AIMC Topic: Visual Perception

Clear Filters Showing 41 to 50 of 348 articles

An Audio-Visual Speech Separation Model Inspired by Cortico-Thalamo-Cortical Circuits.

IEEE transactions on pattern analysis and machine intelligence
Audio-visual approaches involving visual inputs have laid the foundation for recent progress in speech separation. However, the optimization of the concurrent usage of auditory and visual inputs is still an active research area. Inspired by the corti...

Using machine learning to predict judgments on Western visual art along content-representational and formal-perceptual attributes.

PloS one
Art research has long aimed to unravel the complex associations between specific attributes, such as color, complexity, and emotional expressiveness, and art judgments, including beauty, creativity, and liking. However, the fundamental distinction be...

IdeNet: Making Neural Network Identify Camouflaged Objects Like Creatures.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society
Camouflaged objects often blend in with their surroundings, making the perception of a camouflaged object a more complex procedure. However, most neural-network-based methods that simulate the visual information processing pathway of creatures only r...

A developmental model of audio-visual attention (MAVA) for bimodal language learning in infants and robots.

Scientific reports
A social individual needs to effectively manage the amount of complex information in his or her environment relative to his or her own purpose to obtain relevant information. This paper presents a neural architecture aiming to reproduce attention mec...

Multi-view scene matching with relation aware feature perception.

Neural networks : the official journal of the International Neural Network Society
For scene matching, the extraction of metric features is a challenging task in the face of multi-source and multi-view scenes. Aiming at the requirements of multi-source and multi-view scene matching, a siamese network model for Spatial Relation Awar...

DiagSWin: A multi-scale vision transformer with diagonal-shaped windows for object detection and segmentation.

Neural networks : the official journal of the International Neural Network Society
Recently, Vision Transformer and its variants have demonstrated remarkable performance on various computer vision tasks, thanks to its competence in capturing global visual dependencies through self-attention. However, global self-attention suffers f...

Manipulating and measuring variation in deep neural network (DNN) representations of objects.

Cognition
We explore how DNNs can be used to develop a computational understanding of individual differences in high-level visual cognition given their ability to generate rich meaningful object representations informed by their architecture, experience, and t...

SmartDetector: Automatic and vision-based approach to point-light display generation for human action perception.

Behavior research methods
Over the past four decades, point-light displays (PLD) have been integrated into psychology and psychophysics, providing a valuable means to probe human perceptual skills. Leveraging the inherent kinematic information and controllable display paramet...

Neural activity shaping in visual prostheses with deep learning.

Journal of neural engineering
The visual perception provided by retinal prostheses is limited by the overlapping current spread of adjacent electrodes. This reduces the spatial resolution attainable with unipolar stimulation. Conversely, simultaneous multipolar stimulation guided...

Adaptative machine vision with microsecond-level accurate perception beyond human retina.

Nature communications
Visual adaptive devices have potential to simplify circuits and algorithms in machine vision systems to adapt and perceive images with varying brightness levels, which is however limited by sluggish adaptation process. Here, the avalanche tuning as f...