Ailurus Open Express, Season 1

Toward "1 Million" open data points for protein expression.

Share your safe, non-confidential protein sequences and join us in creating a valuable open dataset to explore protein expression.

Deadline

July 31, 2026

Scope

Soluble, E. coli expression

Protein length

Under 600 aa

License

Ailurus Open License v0.1

our core thesis

Protein expression needs its "PDB".

Recombinant protein expression is a foundation of modern biology, from academic discovery, and drug development, to bioproduct manufacturing.

Yet expression remains difficult to predict: a protein’s yield can depend on codon choice, genetic context, expression system, host state, and purification workflow. Most of this knowledge is still scattered across private experiments, failed attempts, and lab notebooks.

Ailurus Open Express is our effort to build a large-scale (at million-level) open dataset that makes protein expression more measurable, learnable, and useful for the community.

How it works.

Ailurus will combine selected protein-coding DNAs with the Ailurus vec expression vector library to build a large combinatorial screen. We will measure relative soluble intracellular expression and, for selected samples, purified protein readouts using PandaPure.

Library screening

Relative soluble-expression signals for coding sequences tested across Ailurus vec genetic contexts.

PandaPure subset

Estimated protein amount, measured by Bradford absorbance, for selected samples purified by PandaPure.

Open release

All released data will be shared under the Ailurus Open License v0.1.

How to participate

Contribute proteins to the open expression dataset.

Good fit

  • Soluble proteins or domains
  • Non-confidential sequences
  • Natural, engineered, or de novo designed proteins
  • Clear function and safety rationale
  • Under 600 amino acids

Not this season

  • Integral membrane proteins
  • Intrinsically disordered proteins
  • Confidential or restricted sequences.
  • Proteins without interpretable functions.
  • Proteins larger than 600 amino acids.

DO NOT submit

  • Pathogens or pathogen factors
  • Toxins or virulence factors
  • Regulated or unsafe sequences
  • Anything intended for harmful biology
  • Proprietary sequences you do not have the right to share

Submit a protein sequence.

Submit safe, non-confidential soluble proteins under 600 amino acids using the form below, or email your sequence list to support@ailurus.bio. Ailurus will select a diverse cohort, run soluble E. coli expression screening, and release the resulting data openly.

Open Express is selective. Submission does not guarantee selection, screening, expression success, private results, or purified protein delivery.
Thank you. We received your Open Express submission.
Submission failed. Please contact us at support@ailurus.bio.

Help advance open science

Collaborators and sponsors are welcome.

We are especially looking for DNA synthesis, sequencing, biofoundry, and AIxBio ecosystem partners. Email us atsupport@ailurus.bio

  • DNA synthesis
    Help expand the cohort and reduce cost per construct.
  • DNA sequencing
    Support NGS readout, quality control, and public data readiness.
  • Biofoundry
    Help validate, automate, or scale future open-expression batches.
  • AIxBio ecosystem
    Support dry-lab compute, data infrastructure, and open access.