-
ST2083: Deep Learning for Radar Perception
The Computation Sensing team at MERL is seeking a highly motivated intern to conduct fundamental research in radar perception. Expertise in deep learning-based object detection, multiple object tracking, data association, and representation learning (detection points, heatmaps, and raw radar waveforms) is required. Previous hands-on experience on open indoor/outdoor radar datasets is a plus. Familiarity with the concept of FMCW, MIMO, and range-Doppler-angle spectrum is an asset. The intern will collaborate with a small group of MERL researchers to develop novel algorithms, design experiments with MERL in-house testbed, and prepare results for patents and publication. The expected duration of the internship is 3 months with a flexible start date.
- Research Areas: Artificial Intelligence, Computational Sensing, Computer Vision, Dynamical Systems, Machine Learning, Optimization, Signal Processing
- Host: Perry Wang
- Apply Now
-
SA2073: Multimodal scene-understanding
We are looking for a graduate student interested in helping advance the field of multimodal scene understanding, with a focus on scene understanding using natural language for robot dialog and/or indoor monitoring using a large language model. The intern will collaborate with MERL researchers to derive and implement new models and optimization methods, conduct experiments, and prepare results for publication. Internships regularly lead to one or more publications in top-tier venues, which can later become part of the intern''s doctoral work. The ideal candidates are senior Ph.D. students with experience in deep learning for audio-visual, signal, and natural language processing. Good programming skills in Python and knowledge of deep learning frameworks such as PyTorch are essential. Multiple positions are available with flexible start date (not just Spring/Summer but throughout 2024) and duration (typically 3-6 months).
- Research Areas: Artificial Intelligence, Computer Vision, Machine Learning, Robotics, Speech & Audio
- Host: Chiori Hori
- Apply Now
-
CV2119: Conditional Video Generation
We seek a highly motivated intern to conduct original research in generative models for conditional video generation. We are interested in applications to various tasks such as video generation from text, images, and diagrams. The successful candidate will collaborate with MERL researchers to design and implement new models, conduct experiments, and prepare results for publication. The candidate should be a PhD student (or postdoc) in computer vision and machine learning with a strong publication record including at least one paper in a top-tier computer vision or machine learning venue such as CVPR, ECCV, ICCV, ICML, ICLR, NeurIPS, AAAI, or TPAMI. Strong programming skills, experience developing and implementing new models in deep learning platforms such as PyTorch, and broad knowledge of machine learning and deep learning methods are expected, including experience in the latest advances in conditional video generation. Start date is flexible; duration should be at least 3 months.
- Research Areas: Artificial Intelligence, Computer Vision, Machine Learning
- Host: Tim Marks
- Apply Now
-
CV2089: Visual Localization and Mapping
MERL is looking for a highly motivated intern to work on an original research project on visual localization and mapping. A strong background in 3D computer vision is required. Experience in robot vision and/or deep learning will be valued. The successful candidate is expected to have published at least one paper in a top-tier computer vision or robotics venues, such as CVPR, ECCV, ICCV, ICRA, IROS, or RSS, along with solid programming skills in Python and/or C/C++. The position is available for graduate students on a Ph.D. track. Duration and start dates are flexible.
- Research Area: Computer Vision
- Host: Pedro Miraldo
- Apply Now