
Share your safe, non-confidential protein sequences and join us in creating a valuable open dataset to explore protein expression.

Recombinant protein expression is a foundation of modern biology, from academic discovery, and drug development, to bioproduct manufacturing.
Yet expression remains difficult to predict: a protein’s yield can depend on codon choice, genetic context, expression system, host state, and purification workflow. Most of this knowledge is still scattered across private experiments, failed attempts, and lab notebooks.
Ailurus Open Express is our effort to build a large-scale (at million-level) open dataset that makes protein expression more measurable, learnable, and useful for the community.
Ailurus will combine selected protein-coding DNAs with the Ailurus vec expression vector library to build a large combinatorial screen. We will measure relative soluble intracellular expression and, for selected samples, purified protein readouts using PandaPure.
Relative soluble-expression signals for coding sequences tested across Ailurus vec genetic contexts.
Estimated protein amount, measured by Bradford absorbance, for selected samples purified by PandaPure.
All released data will be shared under the Ailurus Open License v0.1.
Submit safe, non-confidential soluble proteins under 600 amino acids using the form below, or email your sequence list to support@ailurus.bio. Ailurus will select a diverse cohort, run soluble E. coli expression screening, and release the resulting data openly.
Help advance open science
We are especially looking for DNA synthesis, sequencing, biofoundry, and AIxBio ecosystem partners. Email us atsupport@ailurus.bio