Research Science: Internship opportunities - Vision, Human-Object Interaction, Robot Learning
Microsoft
Research Science: Internship opportunities - Vision, Human-Object Interaction, Robot Learning
Tokyo, Tokyo-to, Japan
Save
Overview
Microsoft Research Asia – Tokyo is committed to advancing cutting-edge AI technologies that enable deeper understanding and interaction with people, objects, and environments in the 3D real world. Our research spans a diverse range of areas, including computer vision, generative AI, 3D perception, and robotic action learning, among others relevant to Embodied AI. By pushing the boundaries of these domains, we aim to develop innovative solutions that bridge the gap between the digital and physical worlds, empowering AI systems to perceive, comprehend, and navigate complex real-world scenarios.
We are seeking a highly motivated and talented PhD student to join our team as a Research Intern. This internship offers a unique opportunity to work on cutting-edge research in the field of Vision-Language Models (VLMs), fundamental machine learning, computer vision, and Spatial AI, and Robotics, for realization of Embodied AI. The successful candidate will work together as a team of experienced researchers to tackle real-world challenges and contribute to the advancement of AI technologies.
Qualifications
Required Qualifications:
- Currently enrolled in a PhD program in Robotics, Machine Learning, Computer Science, or a related STEM field.
- Research experience in embodied AI, robotics, AI models, natural language, computer vision, demonstrated for example through research in a related PhD program and/or publications in conferences or scientific journals.
- Hands-on experience with Python and modern deep learning frameworks.
- Excellent problem-solving skills and the ability to work independently as part of a team.
- Strong communication skills and the ability to present complex ideas clearly.
Other Requirements:
- Ability to physically work from Microsoft Research Asia – Tokyo (Japan) for the duration of the internship.
- Must obtain permission from your academic advisor and commit to at least four months of internship.
Preferred/Additional Qualifications:
- Proven software engineering skills, evidenced by professional experience, internships, and impactful open-source contributions.
- Practical experience with handling data and robot learning, such as experience in Vision-Language-Action models or Hand Object Interactions.
- Familiarity with 1) robot learning for robot hand manipulation or 2) hand pose estimation techniques or 3) reasoning techniques (Chain-of-Thought) used in LLM.
Responsibilities
- Contribute to a high-impact research agenda within the context of a highly collaborative research culture alongside a team of experts in Embodied AI.
- Design and implement experiments to test new hypotheses and validate research findings.
- Communicate research findings to an interdisciplinary research team.
- Prepare technical papers, presentations, and open-source releases of research code.