Research Intern - Human Intelligence at Microsoft
Redmond, Washington, United States -
Full Time


Start Date

Immediate

Expiry Date

25 Feb, 26

Salary

0.0

Posted On

27 Nov, 25

Experience

0 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

No

Skills

Research, Collaboration, Face Recognition, Multimodal Learning, Metric Learning, Representation Modeling, Generative Modeling, Vision Language Models, Large Language Models, Scalable Training, Dataset Curation, MLOps, PyTorch, JAX, Fairness Evaluation, Robustness Evaluation

Industry

Software Development

Description
Research Interns put inquiry and theory into practice. Alongside fellow doctoral candidates and some of the world's best researchers, Research Interns learn, collaborate, and network for life. Research Interns not only advance their own careers, but they also contribute to exciting research and development strides. During the 12-week internship, Research Interns are paired with mentors and expected to collaborate with other Research Interns and researchers, present findings, and contribute to the vibrant life of the community. Research internships are available in all areas of research, and are offered year-round, though they typically begin in the summer. Research & prototype methods in areas such as: Unified face normalization across modalities (e.g., RGB↔NIR), with joint prototype + feature learning and cross-modal alignment. Multimodal face recognition (fusion across RGB, NIR, depth/IR, audio cues where appropriate), with robustness/fairness under distribution shift. Large Language Models-aided face verification: explore Vision Language Models (VLM)/Large Language Models (LLM) pipelines that (i) use visual context in the photo (attributes, scene cues, spatiotemporal hints) to assist verification; (ii) provide interpretable rationales; and (iii) improve failure detection and human-in-the-loop triage. Efficiency & reliability: distillation/quantization/pruning, lightweight encoders/normalizers, calibration and uncertainty, liveness/antispoof integration. Evaluate thoroughly: define datasets and protocols; run ablations and benchmarks (ROC, EER, TPR@FAR, latency/memory, fairness/robustness). Production immersion: learn Windows Hello-style pipelines (signals, constraints, on-device considerations) to align research with deployment. Publish: communicate results via talks, internal tech reports, and submissions to top venues. Currently enrolled in a Master's or Ph.D. program in CS, EE, Applied Math, or related field with a focus in vision/graphics/ML. In addition to the qualifications below, you'll need to submit a minimum of two reference letters for this position as well as a cover letter and any relevant work or research samples. After you submit your application, a request for letters may be sent to your list of references on your behalf. Note that reference letters cannot be requested until after you have submitted your application, and furthermore, that they might not be automatically requested for all candidates. You may wish to alert your letter writers in advance, so they will be ready to submit your letter. Publications in CVPR/ICCV/ECCV/NeurIPS/ICLR/ICML/SIGGRAPH or related journals. Experience in face recognition/verification, multimodal learning/fusion, metric learning, representation or generative modeling. Depth in multimodal normalization (pre-FR normalizers, prototype learning) and RGB↔NIR FR. Experience with VLMs/LLMs (prompting, fine-tuning, tool-use) for visual reasoning, explainability, or safety. Scalable training (DDP/multi-node), dataset curation, reproducible MLOps; familiarity with liveness/FAS and fairness/robustness evaluation. Proficient PyTorch/JAX background; ability to implement/reproduce SOTA.
Responsibilities
Research Interns will collaborate with mentors and fellow interns to advance research in human intelligence. They are expected to present findings and contribute to the community during the internship.
Loading...