Copenhagen

Data Engineer

Join us as part of a team building the scientific intelligence platform powering the future of biotech.

Our customers are R&D teams in biotech, pharma and research institutions. They fight cancer, climate change and the next pandemic using biology. We are born out of the most concentrated life science cluster in the world and have access to all actors in the ecosystem. With Amass, they can triangulate data, evidence across datasets - from patents and trials to internal reports and ELNs - to accelerate decisions and discoveries that human and planetary health needs.

We’re setting our team now and are hiring to build the future-of-work and ecosystem together. You'll work directly with the founders (CTO and CEO) and world leading designers and thinkers to shape the data architecture, ML stack, and applied intelligence layer at the heart of how life science innovation happens in the AI era.

We are backed by world leading funds and operators/founders.

What We’re Building

Scientific knowledge is scattered, siloed, and slow to search. Amass integrates across this fragmented landscape - patents, papers, regulatory filings, proprietary documents and data - and applies AI to create a structured, cross-linked, and queryable knowledge layer.

We build our own domain-specific data pipelines and machine learning models to capture subtle scientific signals. We go beyond simple search and retrieval of information; enabling users to triangulate across multiple sources and trace the evidence behind any output. We’re not just summarizing documents; we’re helping teams reason with them.

The result: faster insight cycles, smarter pipelines, and more effective research-to-decision handoffs.

Your Role 

We’re hiring for ownership and initiative to join our technical staff. You’ll work on the core systems and ideas enabling our future success.

Your core tasks at amass

  • Build fault tolerant and reliable ingestion pipelines across diverse formats (APIs, PDFs, SharePoint, Excel, internal tools)
  • Design a scalable data foundation for real-time scientific querying and reasoning
  • Maintain and optimize (vector) databases, embedding stores, and provenance tracking
  • Develop and maintain pipelines that serve (un)structured data, embeddings and metadata to downstream LLM workflows

What you bring

  • Have proven experience building a modern data foundation in a fast-paced environment
  • Know how to ship data products that blend robustness with rapid iteration and immediate value creation
  • Are excited by the complexity of real-world data (scientific jargon, edge cases, legacy formats)
  • Bonus: You have experience working with data in life science, pharma, or another regulated industry
  • Bonus: You have experience building keyword, semantic or hybrid search systems

How We Work

We are a flat organization with a small team rich in talent. We value face-to-face time at our HQ centrally located in Copenhagen. We are creative and passionate about the work we do. We operate with high trust, enjoy solving problems together, crazy ideas and shipping code without letting ego and status get in the way. This is a rare chance to build the next intelligence layer of scientific research, so join us.

How to Apply

If we see a match, we will reach out to schedule ~2 (technical) interviews. The last step is an onsite in our Copenhagen office where we will work on a project together, discuss ideas, and meet the team.

Apply to role