Internship Openings

21 / 75 Intern positions were found.

Mitsubishi Electric Research Labs, Inc. "MERL" provides equal employment opportunities (EEO) to all employees and applicants for employment without regard to race, color, religion, sex, national origin, age, disability or genetics. In addition to federal law requirements, MERL complies with applicable state and local laws governing nondiscrimination in employment in every location in which the company has facilities. This policy applies to all terms and conditions of employment, including recruiting, hiring, placement, promotion, termination, layoff, recall, transfer, leaves of absence, compensation and training.

MERL expressly prohibits any form of workplace harassment based on race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, or veteran status. Improper interference with the ability of MERL's employees to perform their job duties may result in discipline up to and including discharge.

Working at MERL requires full authorization to work in the U.S and access to technology, software and other information that is subject to governmental access control restrictions, due to export controls. Employment is conditioned on continued full authorization to work in the U.S and the availability of government authorization for the release of these items, which might include without limitation, obtaining an export license or other documentation. MERL may delay commencement of employment, rescind an offer of employment, terminate employment, and/or modify job responsibilities, compensation, benefits, and/or access to MERL facilities and information systems, as MERL deems appropriate, to ensure practical compliance with applicable employment law and government access control restrictions.


  • OR2116: Collaborative robotic manipulation

    • MERL is offering a new research internship opportunity in the field of robotic manipulation. The position requires a robotics background, excellent programming skills and experience with Deep RL and Computer Vision. The position is open to graduate students on a PhD track only, and the length of the internship is three months with the possibility of extending if required. The intern is expected to disseminate this research in top tier scientific conferences such as RSS, IROS, ICRA etc., and if applicable, help with filing associated patents. Start and end dates are flexible.

    • Research Areas: Computer Vision, Machine Learning, Robotics
    • Host: Radu Corcodel
    • Apply Now
  • OR2110: Shared Autonomy for Human-Robot Interaction

    • MERL is looking for a highly motivated and qualified intern to work on human-robot interaction (HRI) research. The ideal candidate would be a Ph.D. student with a strong background in HRI, focusing on robotic manipulation, deep learning, probabilistic modeling, or reinforcement learning. Several topics are available for consideration, including Intent Recognition in Multi-Object Scenes, Shared Autonomy, Cooperative Manipulation, Human-Robot Handovers, and Representation Learning for HRI. Experience working with robotics hardware and physics engine simulators like PyBullet, Issac Gym, or Mujoco is preferred. Proficiency in Python programming is necessary, and experience with ROS is a plus. The successful candidate will collaborate with MERL researchers, and publication of the relevant results is expected. The start date is flexible, and the expected duration of the internship is 3-4 months. Interested candidates are encouraged to apply with their recent CV and list of publications in related topics.

    • Research Areas: Artificial Intelligence, Computer Vision, Robotics
    • Host: Siddarth Jain
    • Apply Now
  • OR2111: Deep Learning for Robotic Manipulation

    • MERL is seeking a highly motivated and qualified intern to work on deep learning for visual feedback in robotic manipulation. The ideal candidate would be a Ph.D. student with a strong background in deep learning and robotic manipulation. Several topics are available for consideration, including Object Pose Estimation, Goal-driven Grasping, Diffusion policy for Industrial Tasks, and Deformable Object Manipulation. The project requires the development of novel algorithms with implementation and evaluation on a robotic platform. Preferred qualifications include experience working with a physics engine simulator like PyBullet, Isaac Gym, or Mujoco, proficiency in Python programming, and experience with ROS. The successful candidate will collaborate with MERL researchers, and publication of relevant results is expected. The start date is flexible, and the expected duration of the internship is 3-4 months. Interested candidates are encouraged to apply with their recent CV and a list of publications in related topics

    • Research Areas: Artificial Intelligence, Computer Vision, Robotics
    • Host: Siddarth Jain
    • Apply Now
  • EA2093: Control for High Precision Motion Systems

    • MERL is seeking a highly motivated and qualified individual to conduct research in the intersection of control theory and learning to achieve high precision motion with guaranteed safety and robustness. The ideal candidate should have solid backgrounds in mechanics, uncertainty quantification, control theory, and reinforcement learning, and strong coding skills. Prior experience on ultra-high precision motion control system and visual servoing is a big plus. Ph.D. students in mechatronics and control are encouraged to apply. Start date for this internship is flexible and the duration is about 3 months.

    • Research Areas: Computer Vision, Control, Machine Learning
    • Host: Yebin Wang
    • Apply Now
  • ST2083: Deep Learning for Radar Perception

    • The Computation Sensing team at MERL is seeking a highly motivated intern to conduct fundamental research in radar perception. Expertise in deep learning-based object detection, multiple object tracking, data association, and representation learning (detection points, heatmaps, and raw radar waveforms) is required. Previous hands-on experience on open indoor/outdoor radar datasets is a plus. Familiarity with the concept of FMCW, MIMO, and range-Doppler-angle spectrum is an asset. The intern will collaborate with a small group of MERL researchers to develop novel algorithms, design experiments with MERL in-house testbed, and prepare results for patents and publication. The expected duration of the internship is 3 months with a flexible start date.

    • Research Areas: Artificial Intelligence, Computational Sensing, Computer Vision, Dynamical Systems, Machine Learning, Optimization, Signal Processing
    • Host: Perry Wang
    • Apply Now
  • CA2129: Perception-Aware Control

    • MERL is seeking a highly motivated and qualified intern to collaborate with the Control for Autonomy team in research on planning and control algorithms accounting for perception and sensing uncertainty, e.g., the surrounding environment of a vehicle. The ideal candidate is expected to be working towards a PhD with strong emphasis in control or planning algorithms, and to have interest and background in as many as possible of: predictive control algorithms for linear and nonlinear systems, stochastic constrained control, e.g., chance constraints, stochastic optimization, statistical estimation, perception system modeling, Bayesian inference, and vehicle modeling and control. Good programming skills in MATLAB, Python or C/C++ are required. The expected start of of the internship is flexible, with duration of 3--6 months.

    • Research Areas: Computer Vision, Control, Optimization
    • Host: Karl Berntorp
    • Apply Now
  • SA2072: Multimodal Representation Learning

    • MERL is offering internship positions for PhD candidates interested in audio-visual-language multimodal learning. The role involves understanding the complex interplay between sound, visuals, and language, aiming to drive next-generation AI applications. Interns will work closely with a group of researchers at MERL to develop and implement models, with an emphasis on integrating different sensory modalities. Internships regularly lead to one or more publications in top-tier venues, which can later become part of the intern''s doctoral work. Ideal candidates are senior Ph.D. students in fields such as Audio Machine Learning, Computer Vision, or Natural Language Processing. Experience in multimodal learning is preferable. Good programming skills in Python and knowledge of deep learning frameworks such as PyTorch are essential. Multiple positions are available with flexible start date (not just Spring/Summer but throughout 2024) and duration (typically 3-6 months).

    • Research Areas: Artificial Intelligence, Computer Vision, Machine Learning, Speech & Audio
    • Host: Sameer Khurana
    • Apply Now
  • SA2073: Multimodal scene-understanding

    • We are looking for a graduate student interested in helping advance the field of multimodal scene understanding, with a focus on scene understanding using natural language for robot dialog and/or indoor monitoring using a large language model. The intern will collaborate with MERL researchers to derive and implement new models and optimization methods, conduct experiments, and prepare results for publication. Internships regularly lead to one or more publications in top-tier venues, which can later become part of the intern''s doctoral work. The ideal candidates are senior Ph.D. students with experience in deep learning for audio-visual, signal, and natural language processing. Good programming skills in Python and knowledge of deep learning frameworks such as PyTorch are essential. Multiple positions are available with flexible start date (not just Spring/Summer but throughout 2024) and duration (typically 3-6 months).

    • Research Areas: Artificial Intelligence, Computer Vision, Machine Learning, Robotics, Speech & Audio
    • Host: Chiori Hori
    • Apply Now
  • CV2118: Vital Signs from video using computer vision and machine learning

    • MERL is seeking a highly motivated intern to conduct original research in estimating vital signs such as heart rate, heart rate variability, and blood pressure from video of a person. The successful candidate will use the latest methods in deep learning, computer vision, and signal processing to derive and implement new models, collect data, conduct experiments, and prepare results for publication, all in collaboration with MERL researchers. The candidate should be a Ph.D. student in computer vision with a strong publication record and experience in computer vision, signal processing, machine learning, and health monitoring. The successful candidate is expected to have published at least one paper in a top-tier computer vision or machine learning venue, such as CVPR, ECCV, ICCV, ICML, ICLR, NeurIPS, or AAAI, and possess strong programming skills in Python and Pytorch. Start date is flexible; duration should be at least 3 months.

    • Research Areas: Computer Vision, Machine Learning, Signal Processing
    • Host: Tim Marks
    • Apply Now
  • CV2100: Novel View Synthesis of Dynamic Scenes

    • MERL is looking for a highly motivated intern to work on an original research project in rendering dynamic scenes from novel views, with a potential use-case in outer space settings. A strong background in 3D computer vision and/or computer graphics is required. Experience in the latest advances of deep learning, such as neural radiance fields and Gaussian splatting, is an added plus and will be valued. The successful candidate is expected to have published at least one paper in a top-tier computer vision/graphics or machine learning venue, such as CVPR, ECCV, ICCV, SIGGRAPH, 3DV, ICML, ICLR, NeurIPS or AAAI, and possess solid programming skills in Python and popular deep learning frameworks like Pytorch. The goal would be for such a candidate to collaborate with MERL researchers to develop algorithms and prepare manuscripts for scientific publications. Successful applicants are typically graduate students on a Ph.D. track or recent Ph.D. graduates. Duration and start dates are flexible but internship is expected to last for at least 3 months.

    • Research Areas: Computer Vision
    • Host: Moitreya Chatterjee
    • Apply Now
  • CV2121: Simulators and Generative AI for Task Driven Data Generation

    • MERL is looking for a self-motivated intern to develop a general-purpose simulation platform for generating computer vision datasets defined by downstream tasks. The ideal intern must have strong background in computer graphics, computer vision, and machine learning, as well as experience in using the latest graphics simulation toolboxes and physics engines. Working knowledge of recent multimodal generative AI methods will be a strong plus. The intern is expected to collaborate with researchers in the computer vision team at MERL to develop algorithms and prepare manuscripts for scientific publications.

    • Research Areas: Artificial Intelligence, Computer Vision
    • Host: Anoop Cherian
    • Apply Now
  • CV2071: Video Anomaly Detection

    • MERL is looking for a self-motivated intern to work on the problem of video anomaly detection. The intern will help to develop new ideas for improving the state of the art in detecting anomalous activity in videos. The ideal candidate would be a Ph.D. student with a strong background in machine learning and computer vision and some experience with video anomaly detection in particular. Proficiency in Python programming and Pytorch is necessary. The successful candidate is expected to have published at least one paper in a top-tier computer vision or machine learning venue, such as CVPR, ECCV, ICCV, ICML, ICLR, NeurIPS or AAAI. The intern will collaborate with MERL researchers to develop and test algorithms and prepare manuscripts for scientific publications. The internship is for 3 months and the start date is flexible.

    • Research Areas: Computer Vision
    • Host: Mike Jones
    • Apply Now
  • CV2119: Conditional Video Generation

    • We seek a highly motivated intern to conduct original research in generative models for conditional video generation. We are interested in applications to various tasks such as video generation from text, images, and diagrams. The successful candidate will collaborate with MERL researchers to design and implement new models, conduct experiments, and prepare results for publication. The candidate should be a PhD student (or postdoc) in computer vision and machine learning with a strong publication record including at least one paper in a top-tier computer vision or machine learning venue such as CVPR, ECCV, ICCV, ICML, ICLR, NeurIPS, AAAI, or TPAMI. Strong programming skills, experience developing and implementing new models in deep learning platforms such as PyTorch, and broad knowledge of machine learning and deep learning methods are expected, including experience in the latest advances in conditional video generation. Start date is flexible; duration should be at least 3 months.

    • Research Areas: Artificial Intelligence, Computer Vision, Machine Learning
    • Host: Tim Marks
    • Apply Now
  • CV2084: Deep Learning for Cloud Removal from Satellite Images

    • MERL is seeking an intern to conduct research for cloud removal from satellite images. The focus will be on building novel deep learning algorithms for this application. A good candidate is a PhD student with experience in deep learning and computational imaging with a publication record. Prior knowledge and experience in deep image restoration algorithms e.g., deep algorithm unrolling, using deep priors such as diffusion models are strongly preferred. Good Python and Pytorch skills are required. Publication of results in a conference or a journal is expected. The expected duration of the internship is 3 months and the start date is flexible.

    • Research Areas: Computational Sensing, Computer Vision, Machine Learning, Signal Processing
    • Host: Suhas Lohit
    • Apply Now
  • CV2078: Anomaly Localization for Industrial Inspection

    • MERL is looking for a self-motivated intern to work on anomaly localization in the industrial inspection setting using computer vision. The relevant topics in the scope include (but are not limited to): cross-view image anomaly localization, how to train one model for multiple views and defect types, how to incorporate large foundation models in image anomaly localization, etc. Candidates with experience in image anomaly localization in industrial inspection settings (e.g., MVTec-AD or VisA datasets) are strongly preferred. The ideal candidate would be a PhD student with a strong background in computer vision and machine learning, and the candidate is expected to have published at least one paper in a top-tier computer vision, machine learning, or artificial intelligence venue, such as CVPR, ECCV, ICCV, ICML, ICLR, NeurIPS, or AAAI. Proficiency in Python programming and familiarity in at least one deep learning framework are necessary. The ideal candidate is required to collaborate with MERL researchers to develop algorithms and prepare manuscripts for scientific publications. The duration of the internship is ideally to be at least 3 months with a flexible start date.

    • Research Areas: Artificial Intelligence, Computer Vision, Machine Learning
    • Host: Kuan-Chuan Peng
    • Apply Now
  • CV2113: Embodied Multimodal Large Language Models

    • MERL is looking for a self-motivated intern to work on problems at the intersection of multimodal large language models and embodied AI in dynamic indoor environments. The ideal candidate would be a PhD student with a strong background in machine learning and computer vision, as demonstrated by top-tier publications. The candidate must have prior experience in audio-visual AI, large language models, and simulators such as Habitat/SoundSpaces. Hands on experience in using animated 3D human shape models (e.g., SMPL and variants) is desired. The intern is expected to collaborate with researchers in computer vision and speech teams at MERL to develop algorithms and prepare manuscripts for scientific publications.

    • Research Areas: Artificial Intelligence, Computer Vision, Machine Learning, Speech & Audio
    • Host: Anoop Cherian
    • Apply Now
  • CV2089: Visual Localization and Mapping

    • MERL is looking for a highly motivated intern to work on an original research project on visual localization and mapping. A strong background in 3D computer vision is required. Experience in robot vision and/or deep learning will be valued. The successful candidate is expected to have published at least one paper in a top-tier computer vision or robotics venues, such as CVPR, ECCV, ICCV, ICRA, IROS, or RSS, along with solid programming skills in Python and/or C/C++. The position is available for graduate students on a Ph.D. track. Duration and start dates are flexible.

    • Research Areas: Computer Vision
    • Host: Pedro Miraldo
    • Apply Now
  • CV2088: Implicit Neural Networks for Object Representation & Reconstruction

    • MERL is looking for a highly motivated intern to work on an original research project in 3D representation and reconstruction using implicit neural networks. A strong background in 3D computer vision is required. Experience in deep learning will be valued. The successful candidate is expected to have published or submitted at least one paper in a top-tier computer vision venue, such as CVPR, ECCV, ICCV, NeurIPS, ICLR, and ICML, along with solid programming skills in Python and/or C++. The position is available for graduate students on a Ph.D. track. Duration and start dates are flexible.

    • Research Areas: Computer Vision
    • Host: Pedro Miraldo
    • Apply Now
  • CV2070: Open-World Object Detection

    • MERL is looking for a highly motivated intern to work on an original research project in open-world object detection. A strong background in computer vision and deep learning is required. Experience in the latest advances in object detection, incremental learning, and open-world object detection is an added plus and will be valued. The successful candidate is expected to have published at least one paper in a top-tier computer vision or machine learning venue, such as CVPR, ECCV, ICCV, ICML, ICLR, NeurIPS or AAAI, and possess solid programming skills in Python and popular deep learning frameworks like Pytorch. The position is available for graduate students on a Ph.D. track. Duration and start dates are flexible but are expected to last for at least 3 months. This internship is preferred to be onsite at MERL’s office in Cambridge, MA

    • Research Areas: Computer Vision
    • Host: Mike Jones
    • Apply Now
  • CV2077: Visual-LiDAR fused object detection and recognition

    • MERL is looking for a self-motivated intern to work on visual-LiDAR fused object detection and recognition using computer vision. The relevant topics in the scope include (but are not limited to): domain adaptation or generalization in visual-LiDAR object detection, data-efficient methods for visual-LiDAR object detection, open-set visual-LiDAR object detection and recognition, small object detection with visual-LiDAR input, etc. The candidates with experiences of object recognition in LiDAR are strongly preferred. The ideal candidate would be a PhD student with a strong background in computer vision and machine learning, and the candidate is expected to have published at least one paper in a top-tier computer vision, machine learning, or artificial intelligence venue, such as CVPR, ECCV, ICCV, ICML, ICLR, NeurIPS, or AAAI. Proficiency in Python programming and familiarity in at least one deep learning framework are necessary. The ideal candidate is required to collaborate with MERL researchers to develop algorithms and prepare manuscripts for scientific publications. The duration of the internship is ideally to be at least 3 months with a flexible start date.

    • Research Areas: Artificial Intelligence, Computer Vision, Machine Learning
    • Host: Kuan-Chuan Peng
    • Apply Now
  • CV2101: Improved Generalizability of Multimodal Learning Techniques

    • MERL is looking for a highly motivated intern to work on an original research project that seeks to improve the generalizability of multimodal learning techniques. A strong background in computer vision and deep learning is required. Experience in audio-visual (multimodal) learning and continual learning are an added plus and will be valued. The successful candidate is expected to have published at least one paper in a top-tier computer vision or machine learning venue, such as CVPR, ECCV, ICCV, ICML, ICLR, NeurIPS or AAAI, and possess solid programming skills in Python and popular deep learning frameworks like Pytorch. The goal would be for such a candidate to collaborate with MERL researchers to develop algorithms and prepare manuscripts for scientific publications. Successful applicants are typically graduate students on a Ph.D. track or recent Ph.D. graduates. Duration and start dates are flexible but the internship is expected to last at least 3 months.

    • Research Areas: Computer Vision
    • Host: Moitreya Chatterjee
    • Apply Now