Part 2: manual curation

gempipe recon produced a draft pan-GSMM: before proceeding with the derivation of strain-specific GSMMs (gempipe derive), the draft pan-GSMM must be curated. Its manual curation consists of several tasks, such as:

  • ensuring the stoichiometric consistency of the model.

  • removing eventual erroneous energy-generating cycles (EGCs).

  • adding missing reactions, finding unmodeled metabolic genes on the functional annotation table.

  • adapting the biomass assembly reaction, for example removing strain-specific biomass precursors.

  • and so on…

The Gempipe API includes handy functions to speed up the manual curation, like for example check_reactants, which indicates the blocked precursors of a reaction, or perform_gapfilling, which suggests reactions to include to enable the biosynthesis of a particular metabolite. Tutorials are available to show the usage of some of the Gempipe API functions:

The provided API is not aimed to replace the community effort Memote, but just to provide a quick and convenient way to check the main sanity standards and perform gap-fillings before going on deriving strain-specific GSMMs with gempipe derive.

Warning! An additional command line program gempipe autopilot is provided, which internally calls gempipe recon and gempipe derive, linking them together performing an automated gap-filling on the draft pan-GSMM, as a (discouraged) alternative to the manual curation.