Skip to content

MSRUN Data loading  #45

@jcmatese

Description

@jcmatese

FEATURE DESCRIPTION

Feature Inspiration

Load the example data into the MsRun, PeakGroup, and PeakData model classes (and perhaps Protocol)

Feature Description

Update the msrun model, removing the unique name attribute, and adding a researcher attribute (string, required).

Parse and load example peak data from Accucor export and a minimum of command Line arguments.

CLI will take a

  • xlsx file (accucor format, required) with pandas.read_excel?
  • date (default to today) (required)
  • protocol either a integer (protocol.id) or name; required; if non-integer, get_or_create [e.g. capture and insert a new protocol based on command line arguments]
  • researcher name

msrun.name is currently modeled as per-sample, with unique name. To avoid duplicating sample/date propagation into the old name field, we will just drop the name and make a unique constraint on the attribute, and see if that is viable/useable.

msrun.researcher will be added also, for further classification.

Alternatives Considered

file argument(s) : two .tsv files (original and corrected), or

unspecified protocol insertion : find/insert a generic LC-MS protocol, of the "please specify me later" variety?

Comment

Add any other context or screenshots about the feature request here.


ISSUE OWNER SECTION

Assumptions

Requirements

  • Accucor xlsx output dependent
  • sample names are unique with the file
  • sample names must already be in the database (preloaded; no insertions performed)
  • compound labels must already be in the database (preloaded; no insertions performed)

Limitations

  • will not load samples, compounds
  • no facility for association of raw data files (.raw or .mzxml)

Affected/Changed Components

  • models.py will be edited (msrun)
  • utils.pl with be augmented with a new loader class

DESIGN

GUI Change description

command line interface

Code Change Description (Pseudocode optional)

likely all new files and classes

Tests

A test should be planned for each requirement (above), where possible.

Metadata

Metadata

Assignees

Labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions