[Hiring] Research Intern - Multimodal Data Retrival (Alibaba Qwen Team)

The Mission Data quality determines the upper bound of model potential. We are building a high-performance Multimodal Semantic Retrieval System designed to diagnose, curate, and filter 100 Billion+ multimodal data points with extreme speed and flexibility.

The Challenge We are looking for collaborators to build the “search engine” for our training data:

  • Scale: Architecting systems to handle 100B+ multimodal data entries.
  • Deep Analysis: Enabling rapid quantification of patterns, bias detection, and live traffic analysis.
  • Retrieval: Developing efficient semantic retrieval algorithms to pinpoint specific high-value data within massive datasets.

Who We Are Looking For We are primarily seeking Research Interns (exceptional Full-time candidates will also be considered):

  • Background: Strong foundation in Information Retrieval (IR), NLP, Computer Vision, or Big Data Systems.
  • Expertise: Experience with Vector Databases, Multimodal Representation Learning (CLIP, SigLIP), or LLM Supervised Fine-Tuning (SFT) pipelines.
  • Mindset: Focused on data-centric AI and infrastructure optimization.

Resources & Benefits

  • Top-tier Infrastructure: Access to massive computing resources and state-of-the-art AI tools (Cursor, Gemini/Claude Pro, etc.).
  • Competitive Package: Market-leading salary, top-spec MacBook, and full catering support.

Apply

  • Email: zhengjie.hsj@alibaba-inc.com
  • Note: Please attach your resume and highlight relevant experience in data systems or model fine-tuning.



Enjoy Reading This Article?

Here are some more articles you might like to read next:

  • [Open Source] Team up Core Contributors - Easymath-wiki
  • [Hiring] Research Intern - Live Avatar (Alibaba Qwen Team)