Anoop Cherian

- Phone: 617-621-7519
- Email:
-
Position:
Research / Technical Staff
Principal Research Scientist -
Education:
Ph.D., University of Minnesota, 2013 -
Research Areas:
- Computer Vision
- Artificial Intelligence
- Machine Learning
- Speech & Audio
- Human-Computer Interaction
External Links:
Anoop's Quick Links
-
Biography
Anoop was a postdoctoral researcher in the LEAR group at Inria from 2012-2015 where his research was on the estimation and tracking of human poses in videos. From 2015-2017, he was a Research Fellow at the Australian National University, where he worked on the problem of recognizing human activities in video sequences. Anoop is the recipient of the Best Student Paper award at the Intl. Conference on Image Processing in 2012. Currently, his research focus is on modeling the semantics of video data.
-
Recent News & Events
-
NEWS MERL presenting 8 papers at ICASSP 2022 Date: May 22, 2022 - May 27, 2022
Where: Singapore
MERL Contacts: Anoop Cherian; Chiori Hori; Toshiaki Koike-Akino; Jonathan Le Roux; Tim K. Marks; Philip V. Orlik; Kuan-Chuan Peng; Pu (Perry) Wang; Gordon Wichern
Research Areas: Artificial Intelligence, Computer Vision, Signal Processing, Speech & AudioBrief- MERL researchers are presenting 8 papers at the IEEE International Conference on Acoustics, Speech & Signal Processing (ICASSP), which is being held in Singapore from May 22-27, 2022. A week of virtual presentations also took place earlier this month.
Topics to be presented include recent advances in speech recognition, audio processing, scene understanding, computational sensing, and classification.
ICASSP is the flagship conference of the IEEE Signal Processing Society, and the world's largest and most comprehensive technical conference focused on the research advances and latest technological development in signal and information processing. The event attracts more than 2000 participants each year.
- MERL researchers are presenting 8 papers at the IEEE International Conference on Acoustics, Speech & Signal Processing (ICASSP), which is being held in Singapore from May 22-27, 2022. A week of virtual presentations also took place earlier this month.
-
NEWS MERL work on scene-aware interaction featured in IEEE Spectrum Date: March 1, 2022
MERL Contacts: Anoop Cherian; Chiori Hori; Jonathan Le Roux; Tim K. Marks; Alan Sullivan; Anthony Vetro
Research Areas: Artificial Intelligence, Computer Vision, Machine Learning, Speech & AudioBrief- MERL's research on scene-aware interaction was recently featured in an IEEE Spectrum article. The article, titled "At Last, A Self-Driving Car That Can Explain Itself" and authored by MERL Senior Principal Research Scientist Chiori Hori and MERL Director Anthony Vetro, gives an overview of MERL's efforts towards developing a system that can analyze multimodal sensing information for highly natural and intuitive interaction with humans through context-dependent generation of natural language. The technology recognizes contextual objects and events based on multimodal sensing information, such as images and video captured with cameras, audio information recorded with microphones, and localization information measured with LiDAR.
Scene-Aware Interaction for car navigation, one target application that the article focuses on, will provide drivers with intuitive route guidance. Scene-Aware Interaction technology is expected to have wide applicability, including human-machine interfaces for in-vehicle infotainment, interaction with service robots in building and factory automation systems, systems that monitor the health and well-being of people, surveillance systems that interpret complex scenes for humans and encourage social distancing, support for touchless operation of equipment in public areas, and much more. MERL's Scene-Aware Interaction Technology had previously been featured in a Mitsubishi Electric Corporation Press Release.
IEEE Spectrum is the flagship magazine and website of the IEEE, the world’s largest professional organization devoted to engineering and the applied sciences. IEEE Spectrum has a circulation of over 400,000 engineers worldwide, making it one of the leading science and engineering magazines.
- MERL's research on scene-aware interaction was recently featured in an IEEE Spectrum article. The article, titled "At Last, A Self-Driving Car That Can Explain Itself" and authored by MERL Senior Principal Research Scientist Chiori Hori and MERL Director Anthony Vetro, gives an overview of MERL's efforts towards developing a system that can analyze multimodal sensing information for highly natural and intuitive interaction with humans through context-dependent generation of natural language. The technology recognizes contextual objects and events based on multimodal sensing information, such as images and video captured with cameras, audio information recorded with microphones, and localization information measured with LiDAR.
See All News & Events for Anoop -
-
Research Highlights
-
MERL Publications
- "Audio-Visual Scene-Aware Dialog and Reasoning Using Audio-Visual Transformers with Joint Student-Teacher Learning", IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), April 2022.BibTeX TR2022-019 PDF
- @inproceedings{Shah2022apr,
- author = {Shah, Ankit Parag and Geng, Shijie and Gao, Peng and Cherian, Anoop and Hori, Takaaki and Marks, Tim K. and Le Roux, Jonathan and Hori, Chiori},
- title = {Audio-Visual Scene-Aware Dialog and Reasoning Using Audio-Visual Transformers with Joint Student-Teacher Learning},
- booktitle = {IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)},
- year = 2022,
- month = apr,
- url = {https://www.merl.com/publications/TR2022-019}
- }
, - "Overview of Audio Visual Scene-Aware Dialog with Reasoning Track for Natural Language Generation in DSTC10", The 10th Dialog System Technology Challenge Workshop at AAAI, February 2022.BibTeX TR2022-016 PDF
- @inproceedings{Hori2022feb,
- author = {Hori, Chiori and Shah, Ankit Parag and Geng, Shijie and Gao, Peng and Cherian, Anoop and Hori, Takaaki and Le Roux, Jonathan and Marks, Tim K.},
- title = {Overview of Audio Visual Scene-Aware Dialog with Reasoning Track for Natural Language Generation in DSTC10},
- booktitle = {The 10th Dialog System Technology Challenge Workshop at AAAI},
- year = 2022,
- month = feb,
- url = {https://www.merl.com/publications/TR2022-016}
- }
, - "(2.5+1)D Spatio-Temporal Scene Graphs for Video Question Answering", AAAI Conference on Artificial Intelligence, February 2022.BibTeX TR2022-014 PDF Video Presentation
- @inproceedings{Cherian2022feb,
- author = {Cherian, Anoop and Hori, Chiori and Marks, Tim K. and Le Roux, Jonathan},
- title = {(2.5+1)D Spatio-Temporal Scene Graphs for Video Question Answering},
- booktitle = {AAAI Conference on Artificial Intelligence},
- year = 2022,
- month = feb,
- url = {https://www.merl.com/publications/TR2022-014}
- }
, - "Max-Margin Contrastive Learning", AAAI Conference on Artificial Intelligence, February 2022.BibTeX TR2022-013 PDF
- @inproceedings{Shah2022feb,
- author = {Shah, Anshul and Sra, Suvrit and Chellappa, Rama and Cherian, Anoop},
- title = {Max-Margin Contrastive Learning},
- booktitle = {AAAI Conference on Artificial Intelligence},
- year = 2022,
- month = feb,
- url = {https://www.merl.com/publications/TR2022-013}
- }
, - "MOST-GAN: 3D Morphable StyleGAN for Disentangled Face Image Manipulation", AAAI Conference on Artificial Intelligence, February 2022.BibTeX TR2022-011 PDF Video
- @inproceedings{Medin2022feb,
- author = {Medin, Safa C. and Egger, Bernhard and Cherian, Anoop and Wang, Ye and Tenenbaum, Joshua B. and Liu, Xiaoming and Marks, Tim K.},
- title = {MOST-GAN: 3D Morphable StyleGAN for Disentangled Face Image Manipulation},
- booktitle = {AAAI Conference on Artificial Intelligence},
- year = 2022,
- month = feb,
- url = {https://www.merl.com/publications/TR2022-011}
- }
,
- "Audio-Visual Scene-Aware Dialog and Reasoning Using Audio-Visual Transformers with Joint Student-Teacher Learning", IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), April 2022.
-
Other Publications
- "Second-order Temporal Pooling for Action Recognition", International Journal of Computer Vision (IJCV), 2018.BibTeX
- @Article{cherian2018ijcv,
- author = {Cherian, Anoop and Gould, Stephen},
- title = {Second-order Temporal Pooling for Action Recognition},
- journal = {International Journal of Computer Vision (IJCV)},
- year = 2018,
- publisher = {Springer}
- }
, - "Visual Permutation Learning", IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2018.BibTeX
- @Article{cherian2018permutation,
- author = {Santa Cruz, Rodrigo and Fernando, Basura and Cherian, Anoop and Gould, Stephen},
- title = {Visual Permutation Learning},
- journal = {IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)},
- year = 2018,
- publisher = {IEEE}
- }
, - "Video Representation Learning Using Discriminative Pooling", Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018.BibTeX
- @Inproceedings{cherian_representation_cvpr18,
- author = {Wang, Jue and Cherian, Anoop and Porikli, Fatih and Gould, Stephen},
- title = {Video Representation Learning Using Discriminative Pooling},
- booktitle = {Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
- year = 2018
- }
, - "Scalable Dense Non-rigid Structure-from-Motion: A Grassmannian Perspective", Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018.BibTeX
- @Inproceedings{cherian_rigid_cvpr18,
- author = {Kumar, Suryansh and Cherian, Anoop and Dai, Yuchao and Li, Hongdong},
- title = {Scalable Dense Non-rigid Structure-from-Motion: A Grassmannian Perspective},
- booktitle = {Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
- year = 2018
- }
, - "Non-Linear Temporal Subspace Representations for Activity Recognition", Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018.BibTeX
- @Inproceedings{cherian_temporal_cvpr18,
- author = {Cherian, Anoop and Sra, Suvrit and Gould, Stephen and Hartley, Richard},
- title = {Non-Linear Temporal Subspace Representations for Activity Recognition},
- booktitle = {Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
- year = 2018
- }
, - "Generalized Rank Pooling for Activity Recognition", Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017.BibTeX
- @Inproceedings{cherian2017generalized,
- author = {Cherian, Anoop and Fernando, Basura and Harandi, Mehrtash and Gould, Stephen},
- title = {Generalized Rank Pooling for Activity Recognition},
- booktitle = {Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
- year = 2017
- }
, - "Learning Discriminative Alpha-Beta Divergences for Positive Definite Matrices", International Conference on Computer Vision (ICCV), 2017.BibTeX
- @Inproceedings{cherian_rigid_iccv17,
- author = {Cherian, Anoop and Stanitsas, Panagiotis and Harandi, Mehrtash and Morellas, Vassilios and Papanikolopoulos, Nikolaos},
- title = {Learning Discriminative Alpha-Beta Divergences for Positive Definite Matrices},
- booktitle = {International Conference on Computer Vision (ICCV)},
- year = 2017
- }
, - "DeepPermNet: Visual Permutation Learning", Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017.BibTeX
- @Inproceedings{cruz2017deeppermnet,
- author = {Cruz, Rodrigo Santa and Fernando, Basura and Cherian, Anoop and Gould, Stephen},
- title = {DeepPermNet: Visual Permutation Learning},
- booktitle = {Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
- year = 2017
- }
, - "Bayesian Non-Parametric clustering for positive definite matrices", IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2016.BibTeX
- @Article{cherian2016bayesian,
- author = {Cherian, Anoop and Morellas, Vassilios and Papanikolopoulos, Nikolaos},
- title = {Bayesian Non-Parametric clustering for positive definite matrices},
- journal = {IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)},
- year = 2016,
- publisher = {IEEE}
- }
, - "Sparse coding for third-order super-symmetric tensor descriptors with application to texture recognition", Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016.BibTeX
- @Inproceedings{koniusz2016sparse,
- author = {Koniusz, Piotr and Cherian, Anoop},
- title = {Sparse coding for third-order super-symmetric tensor descriptors with application to texture recognition},
- booktitle = {Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
- year = 2016
- }
, - "Tensor representations via kernel linearization for action recognition from 3D skeletons", European Conference on Computer Vision (ECCV), 2016.BibTeX
- @Inproceedings{koniusz2016tensor,
- author = {Koniusz, Piotr and Cherian, Anoop and Porikli, Fatih},
- title = {Tensor representations via kernel linearization for action recognition from 3D skeletons},
- booktitle = {European Conference on Computer Vision (ECCV)},
- year = 2016,
- organization = {Springer}
- }
, - "Mixing body-part sequences for human pose estimation", Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2014.BibTeX
- @Inproceedings{cherian2014mixing,
- author = {Cherian, Anoop and Mairal, Julien and Alahari, Karteek and Schmid, Cordelia},
- title = {Mixing body-part sequences for human pose estimation},
- booktitle = {Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
- year = 2014
- }
, - "Nearest neighbors using compact sparse codes", International Conference on Machine Learning (ICML), 2014.BibTeX
- @Inproceedings{cherian2014nearest,
- author = {Cherian, Anoop},
- title = {Nearest neighbors using compact sparse codes},
- booktitle = {International Conference on Machine Learning (ICML)},
- year = 2014
- }
, - "Riemannian sparse coding for positive definite matrices", European Conference on Computer Vision (ECCV), 2014.BibTeX
- @Inproceedings{cherian2014riemannian,
- author = {Cherian, Anoop and Sra, Suvrit},
- title = {Riemannian sparse coding for positive definite matrices},
- booktitle = {European Conference on Computer Vision (ECCV)},
- year = 2014,
- organization = {Springer}
- }
, - "Jensen-Bregman logdet divergence with application to efficient similarity search for covariance matrices", IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2013.BibTeX
- @Article{cherian2013jensen,
- author = {Cherian, Anoop and Sra, Suvrit and Banerjee, Arindam and Papanikolopoulos, Nikolaos},
- title = {Jensen-Bregman logdet divergence with application to efficient similarity search for covariance matrices},
- journal = {IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)},
- year = 2013,
- publisher = {IEEE}
- }
, - "Dirichlet process mixture models on symmetric positive definite matrices for appearance clustering in video surveillance applications", Computer Vision and Pattern Recognition (CVPR), 2011.BibTeX
- @Inproceedings{cherian2011dirichlet,
- author = {Cherian, Anoop and Morellas, Vassilios and Papanikolopoulos, Nikolaos and Bedros, Saad J},
- title = {Dirichlet process mixture models on symmetric positive definite matrices for appearance clustering in video surveillance applications},
- booktitle = {Computer Vision and Pattern Recognition (CVPR)},
- year = 2011
- }
, - "Efficient similarity search for covariance matrices via the Jensen-Bregman LogDet divergence", International Conference on Computer Vision (ICCV), 2011.BibTeX
- @Inproceedings{cherian2011efficient,
- author = {Cherian, Anoop and Sra, Suvrit and Banerjee, Arindam and Papanikolopoulos, Nikolaos},
- title = {Efficient similarity search for covariance matrices via the Jensen-Bregman LogDet divergence},
- booktitle = {International Conference on Computer Vision (ICCV)},
- year = 2011
- }
, - "Generalized dictionary learning for symmetric positive definite matrices with application to nearest neighbor retrieval", Machine Learning and Knowledge Discovery in Databases (ECML), 2011.BibTeX
- @Article{sra2011generalized,
- author = {Sra, Suvrit and Cherian, Anoop},
- title = {Generalized dictionary learning for symmetric positive definite matrices with application to nearest neighbor retrieval},
- journal = {Machine Learning and Knowledge Discovery in Databases (ECML)},
- year = 2011
- }
, - "Accurate 3D ground plane estimation from a single image", International Conference on Robotics and Automation, 2009.BibTeX
- @Inproceedings{cherian2009accurate,
- author = {Cherian, Anoop and Morellas, Vassilios and Papanikolopoulos, Nikolaos},
- title = {Accurate 3D ground plane estimation from a single image},
- booktitle = {International Conference on Robotics and Automation},
- year = 2009
- }
,
- "Second-order Temporal Pooling for Action Recognition", International Journal of Computer Vision (IJCV), 2018.
-
Software Downloads
-
Videos
-
MERL Issued Patents
-
Title: "System and Method for a Dialogue Response Generation System"
Inventors: Hori, Chiori; Cherian, Anoop; Marks, Tim; Hori, Takaaki
Patent No.: 11,264,009
Issue Date: Mar 1, 2022 -
Title: "Scene-Aware Video Dialog"
Inventors: Geng, Shijie; Gao, Peng; Cherian, Anoop; Hori, Chiori; Le Roux, Jonathan
Patent No.: 11,210,523
Issue Date: Dec 28, 2021
-
Title: "System and Method for a Dialogue Response Generation System"