Thursday, May 23, 2013

Extracting Results from Transnetyx for Upload to Mosaic

The following information is provided courtesy of Dr. Christian Schmedt of the Genomics Institute of the Novartis Research Foundation.

Tables with genotype data linked to specific mouse IDs can be uploaded to Mosaic via the "Genotyping" menu under "Animals":

Uploading genotypes requires a comma-separated-value (csv) file listing the animal ID in column A and genotypes in the adjacent columns (B, C, maybe D). The genotypes need to be expressed the same way as the long genotype name defined under Administration>Genotypes.

In order to derive a genotype upload file from Transnetyx results, the genotyping assay on the Transnetyx site first needs to be translated to reflect long genotype names in Mosaic:

clip_image002

Translation is easiest in the "full matrix" mode:

clip_image004

clip_image006

(matrix shown is incomplete)

Translation will be reflected in the way the results are displayed on the Transnetyx website:

clip_image008

In order to extract this information and process for Mosaic genotype upload, export the data as XML file:

clip_image010

Switch from "XML editor" to "Excel" to open the document:

clip_image012

clip_image014

clip_image016

clip_image018

And open as XML table in excel:

clip_image020

Click "OK" on the next two dialogs:

clip_image022

clip_image024

The resulting Excel document looks like this:

clip_image026

Column "D" contains the animal ID (this may be different for different Transnetyx users, but the animal ID will be in one of the columns). Column "M" contains the translated result.

Copy columns D and M to a new excel document.

clip_image028

In the new document, use the "Text to Columns" tool to separate the translated results into two columns:

clip_image030

Choose delimited

clip_image032

"Comma" (this may be Space or something else, dependent on how the translations were defined for the Transnetyx assay)

clip_image034

And finish

clip_image036

To get two columns for genotypes:

clip_image038

Note that each ID is represented by 4 rows (one for each Transnetyx probe-set).

Before saving the file, the duplicates need to be removed with the "Remove Duplicates" tool:

clip_image040

Expand the selection beyond column A, which has the animal IDs

clip_image042

Unselect the columns with genotypes:

clip_image044

clip_image046

Click "ok" and save the files as csv:

clip_image048

The saved file is ready for upload into Mosaic.