VAT/dataSets
From GersteinInfo
(Difference between revisions)
												
			
		| Line 3: | Line 3: | ||
| __TOC__ | __TOC__ | ||
| - | ==  | + | == Data sets == | 
| === 1000 Genomes Project === | === 1000 Genomes Project === | ||
| Line 12: | Line 12: | ||
|   - Data files: |   - Data files: | ||
| + |      - Source: pilot_data, release: 2010_07, FTP:  ftp://ftp-trace.ncbi.nih.gov/1000genomes/ftp/pilot_data/release/2010_07/low_coverage/ | ||
| + |      - Indels | ||
| + |          - [ftp://ftp-trace.ncbi.nih.gov/1000genomes/ftp/pilot_data/release/2010_07/low_coverage/indels/CEU.low_coverage.2010_07.indel.genotypes.vcf.gz CEU.low_coverage.2010_07.indel.genotypes.vcf.gz] | ||
| + |          - [ftp://ftp-trace.ncbi.nih.gov/1000genomes/ftp/pilot_data/release/2010_07/low_coverage/indels/JPTCHB.low_coverage.2010_07.indel.genotypes.vcf.gz JPTCHB.low_coverage.2010_07.indel.genotypes.vcf.gz] | ||
| + |          - [ftp://ftp-trace.ncbi.nih.gov/1000genomes/ftp/pilot_data/release/2010_07/low_coverage/indels/YRI.low_coverage.2010_07.indel.genotypes.vcf.gz YRI.low_coverage.2010_07.indel.genotypes.vcf.gz] | ||
| + |      - SNPs | ||
| + |          - [ftp://ftp-trace.ncbi.nih.gov/1000genomes/ftp/pilot_data/release/2010_07/low_coverage/snps/CEU.low_coverage.2010_07.genotypes.vcf.gz CEU.low_coverage.2010_07.genotypes.vcf.gz] | ||
| + |          - [ftp://ftp-trace.ncbi.nih.gov/1000genomes/ftp/pilot_data/release/2010_07/low_coverage/snps/CHBJPT.low_coverage.2010_07.genotypes.vcf.gz CHBJPT.low_coverage.2010_07.genotypes.vcf.gz] | ||
| + |          - [ftp://ftp-trace.ncbi.nih.gov/1000genomes/ftp/pilot_data/release/2010_07/low_coverage/snps/YRI.low_coverage.2010_07.genotypes.vcf.gz YRI.low_coverage.2010_07.genotypes.vcf.gz] | ||
| + |  - Annotation file | ||
| + |      - [ftp://ftp.sanger.ac.uk/pub/gencode/release_3b/gencode.v3b.annotation.NCBI36.gtf.gz GENCODE (version 3b, hg18)] using CDS elements where ''gene_type = protein_coding'' and ''transcript_type = protein_coding'' | ||
| + |  - Results | ||
| + |      - [http://dynamic.gersteinlab.org/people/lh372/dev/vat_cgi?mode=process&dataSet=1000genomes_lowCoverage VAT] | ||
| + | |||
| + | <br> | ||
| + | |||
| + | <center>[[#top|Top]]</center> | ||
| + | |||
| + | ==== Low coverage samples from the 1000 Genomes Pilot Project ==== | ||
| + | |||
| + |  - Data files: | ||
| + | |||
|       - Indels |       - Indels | ||
|           - ftp://ftp-trace.ncbi.nih.gov/1000genomes/ftp/pilot_data/release/2010_07/low_coverage/indels/CEU.low_coverage.2010_07.indel.genotypes.vcf.gz |           - ftp://ftp-trace.ncbi.nih.gov/1000genomes/ftp/pilot_data/release/2010_07/low_coverage/indels/CEU.low_coverage.2010_07.indel.genotypes.vcf.gz | ||
Revision as of 18:17, 8 March 2011
| Contents | 
Data sets
1000 Genomes Project
Low coverage samples from the 1000 Genomes Pilot Project
- Data files:
    - Source: pilot_data, release: 2010_07, FTP:  ftp://ftp-trace.ncbi.nih.gov/1000genomes/ftp/pilot_data/release/2010_07/low_coverage/
    - Indels
        - CEU.low_coverage.2010_07.indel.genotypes.vcf.gz
        - JPTCHB.low_coverage.2010_07.indel.genotypes.vcf.gz
        - YRI.low_coverage.2010_07.indel.genotypes.vcf.gz
    - SNPs
        - CEU.low_coverage.2010_07.genotypes.vcf.gz
        - CHBJPT.low_coverage.2010_07.genotypes.vcf.gz
        - YRI.low_coverage.2010_07.genotypes.vcf.gz
- Annotation file
    - GENCODE (version 3b, hg18) using CDS elements where gene_type = protein_coding and transcript_type = protein_coding
- Results
    - VAT
Low coverage samples from the 1000 Genomes Pilot Project
- Data files:
    - Indels
        - ftp://ftp-trace.ncbi.nih.gov/1000genomes/ftp/pilot_data/release/2010_07/low_coverage/indels/CEU.low_coverage.2010_07.indel.genotypes.vcf.gz
        - ftp://ftp-trace.ncbi.nih.gov/1000genomes/ftp/pilot_data/release/2010_07/low_coverage/indels/JPTCHB.low_coverage.2010_07.indel.genotypes.vcf.gz
        - ftp://ftp-trace.ncbi.nih.gov/1000genomes/ftp/pilot_data/release/2010_07/low_coverage/indels/YRI.low_coverage.2010_07.indel.genotypes.vcf.gz
    - SNPs
        - ftp://ftp-trace.ncbi.nih.gov/1000genomes/ftp/pilot_data/release/2010_07/low_coverage/snps/CEU.low_coverage.2010_07.genotypes.vcf.gz
        - ftp://ftp-trace.ncbi.nih.gov/1000genomes/ftp/pilot_data/release/2010_07/low_coverage/snps/CHBJPT.low_coverage.2010_07.genotypes.vcf.gz
        - ftp://ftp-trace.ncbi.nih.gov/1000genomes/ftp/pilot_data/release/2010_07/low_coverage/snps/YRI.low_coverage.2010_07.genotypes.vcf.gz
- Annotation file
    - GENCODE (version 3b, hg18) using CDS elements where gene_type = protein_coding and transcript_type = protein_coding
- Results
    - VAT
						
						
		