AIMC Topic: Video Recording

Clear Filters Showing 371 to 380 of 708 articles

Designing Interpretable Recurrent Neural Networks for Video Reconstruction via Deep Unfolding.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society
Deep unfolding methods design deep neural networks as learned variations of optimization algorithms through the unrolling of their iterations. These networks have been shown to achieve faster convergence and higher accuracy than the original optimiza...

Mask-Guided Attention Network and Occlusion-Sensitive Hard Example Mining for Occluded Pedestrian Detection.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society
Pedestrian detection relying on deep convolution neural networks has made significant progress. Though promising results have been achieved on standard pedestrians, the performance on heavily occluded pedestrians remains far from satisfactory. The ma...

Real-Time 3D Facial Tracking via Cascaded Compositional Learning.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society
We propose to learn a cascade of globally-optimized modular boosted ferns (GoMBF) to solve multi-modal facial motion regression for real-time 3D facial tracking from a monocular RGB camera. GoMBF is a deep composition of multiple regression models wi...

AR3D: Attention Residual 3D Network for Human Action Recognition.

Sensors (Basel, Switzerland)
At present, in the field of video-based human action recognition, deep neural networks are mainly divided into two branches: the 2D convolutional neural network (CNN) and 3D CNN. However, 2D CNN's temporal and spatial feature extraction processes are...

FASHE: A FrActal Based Strategy for Head Pose Estimation.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society
Head pose estimation (HPE) represents a topic central to many relevant research fields and characterized by a wide application range. In particular, HPE performed using a singular RGB frame is particular suitable to be applied at best-frame-selection...

Multi-View Gait Image Generation for Cross-View Gait Recognition.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society
Gait recognition aims to recognize persons' identities by walking styles. Gait recognition has unique advantages due to its characteristics of non-contact and long-distance compared with face and fingerprint recognition. Cross-view gait recognition i...

GreenSea: Visual Soccer Analysis Using Broad Learning System.

IEEE transactions on cybernetics
Modern soccer increasingly places trust in visual analysis and statistics rather than only relying on the human experience. However, soccer is an extraordinarily complex game that no widely accepted quantitative analysis methods exist. The statistics...

Application of a Computer Vision Tool for Automated Glottic Tracking to Vocal Fold Paralysis Patients.

Otolaryngology--head and neck surgery : official journal of American Academy of Otolaryngology-Head and Neck Surgery
OBJECTIVES: (1) Demonstrate true vocal fold (TVF) tracking software (AGATI [Automated Glottic Action Tracking by artificial Intelligence]) as a quantitative assessment of unilateral vocal fold paralysis (UVFP) in a large patient cohort. (2) Correlate...