Career Opportunities at Earlybird Portfolio Companies

Are you seeking a new challenge at a growing startup where you can truly make a difference, take ownership, help build a function and change the world of tomorrow for the better? Below you'll find open jobs from the entire #EBVCGang. We are also hiring at Earlybird! If you want to work with us, please send us your application.

ML Interns - Natural Language Processing: Document Understanding



Software Engineering, Data Science
Munich, Germany
Posted on Sunday, June 9, 2024

We are developing an AI Platform for the Architecture, Engineering, and Construction (AEC) industry. Our platform leverages advanced AI to enable construction domain experts to create complex use cases efficiently.


We are looking for full-time interns (for min. 6 months) to solve some cutting-edge machine learning problems and be a part of our product development.

You should have experience in implementing machine learning models in PyTorch, and be proficient in Python. Prior experience in document understanding, information extraction, OCR / OCR-free methods, or Retrieval Augmented Generation (RAG) with large-language models would be preferred.

The successful candidate will:

  • Extract and pre-process data from transactional documents with varying layouts
  • Collaborate with ML engineers on model design, experimentation and implementation
  • Collaborate with ML engineers to design a system with state-of-the-art ML components that effectively addresses customer KPIs
  • Discuss requirements with customer-facing members to understand the problem and its constraints
  • Proactively propose and implement iterative improvements
  • Propose and implement metrics to evaluate relevant KPIs
  • Integrate the solutions to a common codebase and demonstrate good software engineering practices
  • Communicate results and analysis on regular basis


  • Pursuing a PhD / Master's degree in Computer Science or a related field enrolled in a German or EU university
  • Experience in implementing machine learning models in PyTorch, specifically Large Language Models
  • Experience with document understanding
  • Experience with Retrieval Augmented Generation (RAG)
  • Experience with OCR methods, OCR-free document understanding methods, e.g., Visual Document Question Answering (visual doc-QA) and Information Extraction
  • Proficient in Python and good software engineering skills
  • Good communication and interpersonal skills
  • Ability to work in a team-oriented environment
  • Strong problem-solving skills
  • Fluent in English
  • Eligible to work in Germany


You will be a part of an inclusive start-up culture in a stimulating "work hard, play hard" environment. You will work with (and party with) great colleagues with diverse backgrounds. Team events and after-work activities are frequent at CONXAI. You will be empowered to bring new perspectives and create impact.