Virtual reality and 3D imaging

USER-IN-THE-LOOP VIEW SAMPLING WITH ERROR PEAKING VISUALIZATION

Read more about USER-IN-THE-LOOP VIEW SAMPLING WITH ERROR PEAKING VISUALIZATION
Log in to post comments

Augmented reality (AR) provides ways to visualize missing view samples for novel view synthesis. Existing approaches present 3D annotations for new view samples and task users with taking images by aligning the AR display. This data collection task is known to be mentally demanding and limits capture areas to pre-defined small areas due to ideal but restrictive underlying sampling theory. To free users from 3D annotations and limited scene exploration, we propose using locally reconstructed light fields and visualizing errors to be removed by inserting new views.

dummy.txt

dummy.txt

Categories:: Virtual reality and 3D imaging

119 Views

Regularizing Neural Radiance Fields from Sparse RGD Inputs

Read more about Regularizing Neural Radiance Fields from Sparse RGD Inputs
Log in to post comments

myposter.pdf

myposter.pdf (228)

Categories:: Virtual reality and 3D imaging

15 Views

Dynamic Point Cloud Interpolation

Read more about Dynamic Point Cloud Interpolation
Log in to post comments

Dense photorealistic point clouds can depict real-world dynamic objects in high resolution and with a high frame rate. Frame interpolation of such dynamic point clouds would enable the distribution, processing, and compression of such content. In this work, we propose a first point cloud interpolation framework for photorealistic dynamic point clouds. Given two consecutive dynamic point cloud frames, our framework aims to generate intermediate frame(s) between them.

5010_Akhtar_Poster_pdf.pdf

Poster (329)

Categories:: Virtual reality and 3D imaging

46 Views

Refining the bounding volumes for lossless compression of voxelized point clouds geometry

ICIP_SUNUM_videosuz.pptx

ICIP_SUNUM_videosuz.pptx (869)

poster.pdf

poster.pdf (382)

Categories:: Virtual reality and 3D imaging
Other

35 Views

Translation of a Higher Order Ambisonics Sound Scene Based on Parametric Decomposition

This paper presents a novel 3DoF+ system that allows to navigate, i.e., change position, in scene-based spatial audio content beyond the sweet spot of a Higher Order Ambisonics recording. It is one of the first such systems based on sound capturing at a single spatial position. The system uses a parametric decomposition of the recorded sound field. For the synthesis, only coarse distance information about the sources is needed as side information but not the exact number of them.

handout.pdf

handout.pdf (756)

Categories:: Spatial and Multichannel Audio
Source Separation and Signal Enhancement
Audio for Multimedia
Loudspeaker and Microphone Array Signal Processing
Virtual reality and 3D imaging

84 Views

Interactive Low Latency Video Streaming Of Volumetric Content

Read more about Interactive Low Latency Video Streaming Of Volumetric Content
Log in to post comments

Low latency video streaming of volumetric content is an emerging technology to enable immersive media experiences on mobile devices. Unlike 3DoF scenarios where users are restricted to changes of their head orientation at a single position, volumetric content allows users to move freely within the scene in 6DoF. Although the processing power of mobile devices has increased considerably, streaming volumetric content directly to such devices is still challenging. High-quality volumetric content requires significant data rate and network bandwidth.

icassp2020_podborski.pdf

icassp2020_podborski.pdf (624)

Categories:: Multimedia communications and networking
Virtual reality and 3D imaging

278 Views

TOWARDS MODELLING OF VISUAL SALIENCY IN POINT CLOUDS FOR IMMERSIVE APPLICATIONS

Read more about TOWARDS MODELLING OF VISUAL SALIENCY IN POINT CLOUDS FOR IMMERSIVE APPLICATIONS
Log in to post comments

Modelling human visual attention is of great importance in the field of computer vision and has been widely explored for 3D imaging. Yet, in the absence of ground truth data, it is unclear whether such predictions are in alignment with the actual human viewing behavior in virtual reality environments. In this study, we work towards solving this problem by conducting an eye-tracking experiment in an immersive 3D scene that offers 6 degrees of freedom. A wide range of static point cloud models is inspected by human subjects, while their gaze is captured in real-time.

2019-ICIP-presentation.pdf

2019-ICIP-presentation.pdf (529)

Categories:: Virtual reality and 3D imaging

29 Views

BODYFITR: Robust automatic 3D human body fitting

Read more about BODYFITR: Robust automatic 3D human body fitting
Log in to post comments

This paper proposes BODYFITR, a fully automatic method to fit a human body model to static 3D scans with complex poses. Automatic and reliable 3D human body fitting is necessary for many applications related to healthcare, digital ergonomics, avatar creation and security, especially in industrial contexts for large-scale product design. Existing works either make prior assumptions on the pose, require manual annotation of the data or have difficulty handling complex poses.

bodyfitr_poster_icip19-final.pdf

bodyfitr poster (533)

Categories:: Virtual reality and 3D imaging
Other applications of machine learning (MLR-APPL)

127 Views

FAST: Flow-Assisted Shearlet Transform for Densely-sampled Light Field Reconstruction

Read more about FAST: Flow-Assisted Shearlet Transform for Densely-sampled Light Field Reconstruction
Log in to post comments

Shearlet Transform (ST) is one of the most effective methods for Densely-Sampled Light Field (DSLF) reconstruction from a Sparsely-Sampled Light Field (SSLF). However, ST requires a precise disparity estimation of the SSLF. To this end, in this paper a state-of-the-art optical flow method, i.e. PWC-Net, is employed to estimate bidirectional disparity maps between neighboring views in the SSLF. Moreover, to take full advantage of optical flow and ST for DSLF reconstruction, a novel learning-based method, referred to as Flow-Assisted Shearlet Transform (FAST), is proposed in this paper.

ICIP2019_FAST.pdf

ICIP2019_FAST.pdf (553)

Categories:: Image/Video Processing
Multimodal signal processing
Virtual reality and 3D imaging

57 Views

PPSAN: PERCEPTUAL-AWARE 3D POINT CLOUD SEGMENTATION VIA ADVERSARIAL LEARNING

Read more about PPSAN: PERCEPTUAL-AWARE 3D POINT CLOUD SEGMENTATION VIA ADVERSARIAL LEARNING
Log in to post comments

Point cloud segmentation is a key problem of 3D multimedia signal processing. Existing methods usually use single network structure which is trained by per-point loss. These methods mainly focus on geometric similarity between the prediction results and the ground truth, ignoring visual perception difference. In this paper, we present a segmentation adversarial network to overcome the drawbacks above. Discriminator is introduced to provide a perceptual loss to increase the rationality judgment of prediction and guide the further optimization of the segmentator.

ICASSP2019_Poster-lihy.pdf

ICASSP2019_Poster-lihy.pdf (633)

Categories:: Virtual reality and 3D imaging

30 Views

Virtual reality and 3D imaging

Pages