Sorry, you need to enable JavaScript to visit this website.

Immersive Optical-See-Through Augmented Reality. Augmented Reality has been getting ready for the last 20 years, and is finally becoming real, powered by progress in enabling technologies such as graphics, vision, sensors, and displays. In this talk I’ll provide a personal retrospective on my journey, working on all those enablers, getting ready for the coming AR revolution. At Meta, we are working on immersive optical-see-through AR headset, as well as the full software stack. We’ll discuss the differences of optical vs.

Categories:
139 Views

In this supplementary material for "SCIGS: 3D GAUSSIANS SPLATTING FROM A SNAPSHOT COMPRESSIVE IMAGE", the results of comparative experiments on all the datasets from dynamic scenes and static scenes are shown. The experiment compares our SCIGS against current state-of-the-art SCI decoding methods and state-of-the-art SCI image-based reconstruction method. An additional experiment is conducted to assess the impact of various mask overlapping rates during the SCI image modulated.

Categories:
12 Views

Accurate 3D modeling of humans and high-fidelity garments is crucial in computer vision and graphics, impacting gaming, virtual, and augmented reality applications.

Categories:
20 Views

Facial Expression Recognition (FER) has achieved significant success in recent years due to the rise of deep learning. Meanwhile, latent semantic information is crucial for recognizing facial expressions with subtle differences. Inspired by inconsistencies in learning intensity across different layers of deep learning networks — where shallow-layer features lack generalization and task relevance compared to deep-layer features — we propose a novel Hierarchical Semantic Transfer (HST) method.

Categories:
35 Views

Facial Expression Recognition (FER) has achieved significant success in recent years due to the rise of deep learning. Meanwhile, latent semantic information is crucial for recognizing facial expressions with subtle differences. Inspired by inconsistencies in learning intensity across different layers of deep learning networks — where shallow-layer features lack generalization and task relevance compared to deep-layer features — we propose a novel Hierarchical Semantic Transfer (HST) method.

Categories:
1 Views

Visual Question Answering with Natural Language Explanation (VQA-NLE) task is challenging due to its high demand for reasoning-based inference. Recent VQA-NLE studies focus on enhancing model networks to amplify the model’s reasoning capability but this approach is resource consuming and unstable. In this work, we introduce a new VQA-NLE model, ReRe (Retrieval-augmented natural language Reasoning), using leverage retrieval information from the memory to aid in generating accurate answers and persuasive explanations without relying on complex networks and extra datasets.

Categories:
3 Views

Visual anomaly detection in computer vision is an essential one-class classification and segmentation problem. The student-teacher (S-T) approach has proven effective in addressing this challenge. However, previous studies based on S-T underutilize the feature representations learned by the teacher network, which restricts anomaly detection performance.

Categories:
36 Views

Driver Action Recognition (DAR) is crucial in vehicle cabin monitoring systems. In real-world applications, it is common for vehicle cabins to be equipped with cameras featuring different modalities. However, multi-modality fusion strategies for the DAR task within car cabins have rarely been studied. In this paper, we propose a novel yet efficient multi-modality driver action recognition method based on dual feature shift, named DFS. DFS first integrates complementary features across modalities by performing modality feature interaction.

Categories:
36 Views

Pages