-
Notifications
You must be signed in to change notification settings - Fork 37
Open
Labels
IOIssues related to reading and writing common third-party file formatsIssues related to reading and writing common third-party file formatsdocumentationImprovements or additions to documentationImprovements or additions to documentation
Description
We don't currently have any IO functionality for pedigree formats. These are usually tabular but can be quite variable. We should document how to read in some generic examples and add them to an sgkit style dataset.
Basic workflow:
- Read tabular format as pandas dataframe
- Assign sample identifiers to the
sample_id
variable - Assign parental columns to the
parent_id
variable - Optionally set coords for the
parents
dim (['Father', 'Mother']
,['Sire', 'Dam']
, etc.) - Use
parent_indices
to generate theparents
array and explain the 0-based indexing etc. - Do something interesting like calculating kinship.
jeromekelleher
Metadata
Metadata
Assignees
Labels
IOIssues related to reading and writing common third-party file formatsIssues related to reading and writing common third-party file formatsdocumentationImprovements or additions to documentationImprovements or additions to documentation