Deep Learning for Computer Vision

ICIP2025_Supplementary_ESCANet

Read more about ICIP2025_Supplementary_ESCANet
Log in to post comments

While deep learning based solutions, including CNNs or transformer-based architectures, have demonstrated promising results for image super-resolution (SR) tasks, their substantial depth and parameters challenge deployment on edge computing AI-enabled devices. To address this issue, we propose a lightweight single image super-resolution (SISR) model named Efficient Spatial and Channel Attentive Network (ESCANet), comprised of Spatial Enhancement Module (SEM) and Channel-wise Enhancement Module (CEM).

ID_1046_Publish_Ready_Supplementary.pdf

ID_1046_Publish_Ready_Supplementary.pdf (109)

Categories:: Image/Video Processing

46 Views

A UNIFIED DNN-BASED SYSTEM FOR INDUSTRIAL PIPELINE SEGMENTATION

Read more about A UNIFIED DNN-BASED SYSTEM FOR INDUSTRIAL PIPELINE SEGMENTATION
Log in to post comments

This paper presents a unified system tailored for autonomous pipe segmentation within an industrial setting. To this end, it is designed to analyze RGB images captured by Unmanned Aerial Vehicle (UAV)-mounted cameras to predict binary pipe segmentation maps.

ICASSP_2024_Psarras_Poster.pdf

ICASSP_2024_Psarras_Poster.pdf (153)

Categories:: Other

36 Views

OPEN-SET RECOGNITION FOR FACIAL-EXPRESSION RECOGNITION

Read more about OPEN-SET RECOGNITION FOR FACIAL-EXPRESSION RECOGNITION
Log in to post comments

We address distinguishing whether an input is a facial image by learning only a facial-expression recognition (FER) dataset.

ICIP2023_1606_CameraReady (1).pdf

camera ready paper (292)

Categories:: Other

62 Views

OPEN-SET RECOGNITION FOR FACIAL-EXPRESSION RECOGNITION

Read more about OPEN-SET RECOGNITION FOR FACIAL-EXPRESSION RECOGNITION
Log in to post comments

We address distinguishing whether an input is a facial image by learning only a facial-expression recognition (FER) dataset.

ICIP2023_1606_CameraReady (1).pdf

camera ready paper (307)

Categories:: Other

28 Views

Investigating the Potential of Auxiliary-Classifier GANs for Image Classification in Low Data Regimes

Generative Adversarial Networks (GANs) have shown promise in augmenting datasets and boosting convolutional neural networks' (CNN) performance on image classification tasks. But they introduce more hyperparameters to tune as well as the need for additional time and computational power to train supplementary to the CNN. In this work, we examine the potential for Auxiliary-Classifier GANs (AC-GANs) as a 'one-stop-shop' architecture for image classification, particularly in low data regimes.

Dravid_ICASSP_oral_2022_Slides.pdf

Dravid_ICASSP_oral_2022_Slides.pdf (471)

Categories:: Other
Other

51 Views

GENERATING THERMAL HUMAN FACES FOR PHYSIOLOGICAL ASSESSMENT. USING THERMAL SENSOR AUXILIARY LABELS

Thermal images reveal medically important physiological information about human stress, signs of inflammation, and emotional mood that cannot be seen on visible images. Providing a method to generate thermal faces from visible images would be highly valuable for the telemedicine community in order to show this medical information. To the best of our knowledge, there are limited works on visible-to-thermal (VT) face translation, and many current works go the opposite direction to generate visible faces from thermal surveillance images (TV) for law enforcement applications.

icip_ordun_PDF.pdf

PDF Presentation Slides (342)

Categories:: Image/Video Processing

12 Views

WSO-CAPS: An Automated Framework for Diagnosis of COVID-19 disease from Low and Ultra-Low Dose CT scans using Capsule Networks and Window Setting Optimization

The automatic diagnosis of lung infections using chest computed
tomography (CT) scans has been recently obtained remarkable significance,
particularly during the COVID-19 pandemic that the early
diagnosis of the disease is of utmost importance. In addition, infection
diagnosis is the main building block of most automated diagnostic/
prognostic frameworks. Recently, due to the devastating effects
of the radiation on the body caused by the CT scan, there has been
a surge in acquiring low and ultra-low-dose CT scans instead of the

ICAS2021-Presentation.pdf

Presentation slides (360)

Categories:: Image/Video Processing

18 Views

GLAUCOMA DETECTION FROM RAW CIRCUMPAPILLARY OCT IMAGES USING FULLY CONVOLUTIONAL NEURAL NETWORKS

ICIP_2020_GG_RA_AC_VN.pptx

ICIP_2020_GG_RA_AC_VN.pptx (486)

Categories:: Neural network learning (MLR-NNLR)

32 Views

CONTEXT-AWARE AUTOMATIC OCCLUSION REMOVAL

Read more about CONTEXT-AWARE AUTOMATIC OCCLUSION REMOVAL
Log in to post comments

Occlusion removal is an interesting application of image enhancement, for which, existing work suggests manually-annotated or domain-specific occlusion removal. No work tries to address automatic occlusion detection and removal as a context-aware generic problem. In this paper, we present a novel methodology to identify objects that do not relate to the image context as occlusions and remove them, reconstructing the space occupied coherently.

ICIP2019_#2011.pdf

CONTEXT-AWARE AUTOMATIC OCCLUSION REMOVAL (467)

Categories:: Image/Video Processing

17 Views

Deep Learning-based Obstacle Detection and Depth Estimation

Read more about Deep Learning-based Obstacle Detection and Depth Estimation
Log in to post comments

This paper proposed a modified YOLOv3 which has an extra object depth prediction module for obstacle detection and avoidance. We use a pre-processed KITTI dataset to train the proposed, unified model for (i) object detection and (ii) depth prediction and use the AirSim flight simulator to generate synthetic aerial images to verify that our model can be applied in different data domains.

ICIP_Deep Learning-based Obstacle Detection and Depth Estimation.pdf

ICIP_Deep Learning-based Obstacle Detection and Depth Estimation.pdf (885)

Categories:: Image/Video Processing
Neural network learning (MLR-NNLR)

319 Views

Deep Learning for Computer Vision

Pages