Project

General

Profile

Actions

Idea #13506

closed

Update Genotype conversion tool

Added by Abram Connelly almost 6 years ago. Updated almost 6 years ago.

Status:
Closed
Priority:
Normal
Assigned To:
Target version:
-
Start date:
06/15/2018
Due date:
Story points:
-

Description

The genotyping conversion tool, gtconv, was never properly tested or used. Update this tool to be able to convert genotyping data files (VCF, 23andMe, Ancestry, etc.). The output needs to be though about as the data is sparse enough that it doesn't warrant creating full FastJ and an intermediate file might be more appropriate.

At the very least, an SGLF file should be created so that it can be merged into the library and an option should be given to allow for band file conversion to be created so that it can be pumped into cgft (or other CGF creation tools).

Batching should be considered as the genotype files are small compared to the auxiliary files needed, such as the reference.

Documentation should be added as well as test cases to make sure things are working properly.

A short summary:

  • Creation of an intermediate format to facilitate conversion or determine it's unnecessary.
  • Creation of SGLF files from input genotype file (23andMe, Ancestry, VCF)
  • Investigate batching option and implement if it seems reasonable
  • Conversion to "band format" (from the intermediate format, say)
  • Documentation
  • Tests

Subtasks 1 (0 open1 closed)

Task #13614: Review branch of 13506Closed06/15/2018Actions
Actions

Also available in: Atom PDF