Abir Mohammad Hadi

Work place: Faculty of Science and Technology, American International University – Bangladesh (AIUB), Dhaka, Bangladesh

E-mail: abir45pro@gmail.com

Website:

Research Interests: Image Processing, Neural Networks, Computer Vision, Artificial Intelligence

Biography

Abir Mohammad Hadi is an undergraduate student of Computer Science and Engineering (CSE) program at the Faculty of Science and Information Technology of American International University – Bangladesh (AIUB). He is currently also serving as Research Assistant in the Department of Computer Science. His area of research is machine intelligence, computer vision, image processing, autonomous system, neural network and artificial intelligence

Author Articles
Aggressive Action Estimation: A Comprehensive Review on Neural Network Based Human Segmentation and Action Recognition

By A. F. M. Saifuddin Saif Md. Akib Shahriar Khan Abir Mohammad Hadi Rahul Prashad Karmoker Julian Gomes

DOI: https://doi.org/10.5815/ijeme.2019.01.02, Pub. Date: 8 Jan. 2019

Human action recognition has been a talked topic since machine vision was coined. With the advent of neural networks and deep learning methods, various architectures were suggested to address the problems within a context. Convolutional neural network has been the primary go-to architecture for image segmentation, flow estimation and action recognition in recent days. As the problem itself is an extended version of various sub-problems, such as frame segmentation, spatial and temporal feature extraction, motion modeling and action classification as a whole, some methods reviewed in this paper addressed sub-problems and some tried to address a single architecture to the action recognition problem. While being a success, convolution neural networks have drawbacks in its pooling methods. CapsNet, on the other hand, uses squashing function to determine the activation. Also it addresses spatiotemporal information with the normalized vector maps while CNN-based methods extracts feature map for spatial and temporal information and later augment them in a fusion layer for combining two separate feature maps. Critical review of papers provided in this work can contribute significantly in addressing human action recognition problem as a whole.

[...] Read more.
Other Articles