Neural network learning (MLR-NNLR)

SAPT - ICIP2024 supplementary material

Read more about SAPT - ICIP2024 supplementary material
Log in to post comments

Spatiality-Aware Prompt Tuning for Few-shot Small Object Detection - supplementary material for ICIP 2024

SpatialityPromptTuning_ICIP2024_SupplementaryMaterial.pdf

SpatialityPromptTuning_ICIP2024_SupplementaryMaterial.pdf (107)

Categories:: Neural network learning (MLR-NNLR)

36 Views

Supplementary Material for ``WrappingNet: Mesh Autoencoder via Deep Sphere Deformation''

There have been recent efforts to learn more meaningful representations via fixed length codewords from mesh data, since a mesh serves as a complete model of underlying 3D shape compared to a point cloud. However, the mesh connectivity presents new difficulties when constructing a deep learning pipeline for meshes. Previous mesh unsupervised learning approaches typically assume category-specific templates, e.g., human face/body templates.

WrappingNet_ICIP_SM.pdf

Supplementary Material for ``WrappingNet: Mesh Autoencoder via Deep Sphere Deformation'' (103)

Categories:: Neural network learning (MLR-NNLR)

26 Views

COVARIANCE-AWARE FEATURE ALIGNMENT WITH PRE-COMPUTED SOURCE STATISTICS FOR TEST-TIME ADAPTATION TO MULTIPLE IMAGE CORRUPTIONS

Real-world image recognition systems often face corrupted input images, which cause distribution shifts and degrade the performance of models. These systems often use a single prediction model in a central server and process images sent from various environments, such as cameras distributed in cities or cars. Such single models face images corrupted in heterogeneous ways in test time. Thus, they require to instantly adapt to the multiple corruptions during testing rather than being re-trained at a high cost.

presentation_ICIP2023.pdf

Presentation slide (137)

eposter_ICIP2023.pdf

Poster (129)

Categories:: Pattern recognition and classification (MLR-PATT)
Neural network learning (MLR-NNLR)

136 Views

ENABLING THE ENCODER-EMPOWERED GAN-BASED VIDEO GENERATORS FOR LONG VIDEO GENERATION

Read more about ENABLING THE ENCODER-EMPOWERED GAN-BASED VIDEO GENERATORS FOR LONG VIDEO GENERATION
Log in to post comments

We propose Recall Encoder-empowered GAN3 (REncGAN3), employing the recall mechanism to enable a standard short video (16-frame) generation model EncGAN3 for generating long videos of hundreds of frames.
The recall mechanism utilizes simple changes that enable the generation of connectable short video clips for merging into long sequences, maintaining long-duration consistency.

e-poster_TP1PC4_REncGAN3.pptx

e-poster (132)

Enabling_the_Encoder-Empowered_GAN-based_Video_Generators_for_Long_Video_Generation.pdf

paper (123)

Categories:: Neural network learning (MLR-NNLR)

41 Views

A Gradient Boosting Approach for Training Convolutional and Deep Neural Networks

Read more about A Gradient Boosting Approach for Training Convolutional and Deep Neural Networks
Log in to post comments

ICIP2023_Eposter.pptx

E-poster (149)

Categories:: Neural network learning (MLR-NNLR)

28 Views

Interpreting Intermediate Convolutional Layers of Generative CNNs Trained on Waveforms

This paper presents a technique to interpret and visualize intermediate layers in generative CNNs trained on raw speech data in an unsupervised manner. We argue that averaging over feature maps after ReLU activation in each transpose convolutional layer yields interpretable time-series data. This technique allows for acoustic analysis of intermediate layers that parallels the acoustic analysis of human speech data: we can extract F0, intensity, duration, formants, and other acoustic properties from intermediate layers in order to test where and how CNNs encode various types of information.

begus ICASSP 2023 talk.pdf

begus ICASSP 2023 talk.pdf (238)

Categories:: Neural network learning (MLR-NNLR)
Cognitive information processing (MLR-COGP)
Speech Synthesis and Generation, including TTS (SPE-SYNT)
Human Spoken Language Acquisition, Development and Learning (SLP-LADL)
Language Modeling, for Speech and SLP (SLP-LANG)

37 Views

Learning Gradients of Convex Functions with Monotone Gradient Networks

Read more about Learning Gradients of Convex Functions with Monotone Gradient Networks
Log in to post comments

While much effort has been devoted to deriving and analyzing effective convex formulations of signal processing problems, the gradients of convex functions also have critical applications ranging from gradient-based optimization to optimal transport. Recent works have explored data-driven methods for learning convex objective functions, but learning their monotone gradients is seldom studied. In this work, we propose C-MGN and M-MGN, two monotone gradient neural network architectures for directly learning the gradients of convex functions.

ICASSP Poster PDF.pdf

ICASSP Poster PDF.pdf (232)

Categories:: Neural network learning (MLR-NNLR)

32 Views

Chord-Conditioned Melody Harmonization with Controllable Harmonicity

Read more about Chord-Conditioned Melody Harmonization with Controllable Harmonicity
Log in to post comments

Melody harmonization has long been closely associated with chorales composed by Johann Sebastian Bach. Previous works rarely emphasised chorale generation conditioned on chord progressions, and there has been a lack of focus on assistive compositional tools. In this paper, we first designed a music representation that encoded chord symbols for chord conditioning, and then proposed DeepChoir, a melody harmonization system that can generate a four-part chorale for a given melody conditioned on a chord progression.

slide.pdf

slide.pdf (182)

Categories:: Applications in Music and Audio Processing (MLR-MUSI)
Neural network learning (MLR-NNLR)

9 Views

Bilateral Coarse-To-Fine Network For Point Cloud Completion

Read more about Bilateral Coarse-To-Fine Network For Point Cloud Completion
Log in to post comments

Point cloud completion aims to accurately estimate complete point clouds from partial observations. Existing methods often directly infer the missing points from the partial shape, but they suffer from limited structural information. To address this, we propose the Bilateral Coarse-to-Fine Network (BCFNet), which leverages 2D images as guidance to compensate for structural information loss. Our method introduces a multi-level codeword skip-connection to estimate structural details.

SIG-Port.zip

Paper, presentation slides, poster, and video with subtitle (149)

Categories:: Neural network learning (MLR-NNLR)

9 Views

MEET: A Monte Carlo Exploration-Exploitation Trade-off for Buffer Sampling

Read more about MEET: A Monte Carlo Exploration-Exploitation Trade-off for Buffer Sampling
Log in to post comments

Data selection is essential for any data-based optimization technique, such as Reinforcement Learning. State-of-the-art sampling strategies for the experience replay buffer improve the performance of the Reinforcement Learning agent. However, they do not incorporate uncertainty in the Q-Value estimation. Consequently, they cannot adapt the sampling strategies, including exploration and exploitation of transitions, to the complexity of the task.

MEET_Poster.pdf

Poster (143)

icassp_2023 (7).pdf

Paper (131)

Categories:: Neural network learning (MLR-NNLR)

15 Views

Neural network learning (MLR-NNLR)

Pages