2015高通量測序專題培訓:重測序分析.pdf
《2015高通量測序專題培訓:重測序分析.pdf》由會員分享,可在線閱讀,更多相關《2015高通量測序專題培訓:重測序分析.pdf(81頁珍藏版)》請在匯文網(wǎng)上搜索。
1、 蘇州貝斯派生物科技有限公司 Base pair Genome re-sequencing data analysis:from reads to variants Chao Li,PhD SmartQuerier Biotechnology 蘇州貝斯派生物科技有限公司 Base pair Types of Genetic Variation Single Nucleotide Aberrations Single Nucleotide Polymorphisms(SNPs)-mutations shared amongst a population Single Nucleotide Var
2、iations(SNVs)-private mutations Short Insertions or Deletions(indels)Copy Number Variations(CNVs)Structural Variations(SVs)Large insertions and deletions Inversion Translocation 蘇州貝斯派生物科技有限公司 Base pair Re-sequencing strategy Whole genome sequencing vs Whole exome sequencing Cost Coverage Depth Varia
3、nts type 蘇州貝斯派生物科技有限公司 Base pair Analysis Outline Quality control(QC)Mapping SNP and small INDEL calling Structural variation calling Copy number variation calling Variants function annotation 蘇州貝斯派生物科技有限公司 Base pair Analysis Outline Quality control(QC)Mapping SNP and small INDEL calling Structural
4、variation calling Copy number variation calling Variants function annotation 蘇州貝斯派生物科技有限公司 Base pair FASTQ format Sample_GGCTAC_L004_R1_001.fastq.gz Sample_GGCTAC_L004_R2_001.fastq.gz header line:SEQUENCE_ID sequence line line beginning with+quality score line Illumina raw reads are stored in fastq
5、format Paired-end reads are stored in matched files 4 lines per read 蘇州貝斯派生物科技有限公司 Base pair Illumina identifier HWUSI-EAS100R the unique instrument name 6 flowcell lane 73 tile number within the flowcell lane 941 x-coordinate of the cluster within the tile 1973 y-coordinate of the cluster within th
6、e tile#0 index number for a multiplexed sample(0 for no indexing)/1 the member of a pair,/1 or/2(paired-end or mate-pair reads only)HWUSI-EAS100R:6:73:941:1973#0/1 蘇州貝斯派生物科技有限公司 Base pair Illumina identifier(1.4+)EAS139 the unique instrument name 136 the run id FC706VJ the flowcell id 2 flowcell lan
7、e 2104 tile number within the flowcell lane 15343 x-coordinate of the cluster within the tile 197393 y-coordinate of the cluster within the tile 1 the member of a pair,1 or 2(paired-end or mate-pair reads only)Y Y if the read is filtered,N otherwise 18 0 when none of the control bits are on,otherwis
8、e it is an even number ATCACG index sequence EAS139:136:FC706VJ:2:2104:15343:197393 1:Y:18:ATCACG http:/en.wikipedia.org/wiki/FASTQ_format 蘇州貝斯派生物科技有限公司 Base pair Multiplex A unique barcode is attached to sequence fragments from the same sample.Different samples can be mixed for sequencing to improv
- 配套講稿:
如PPT文件的首頁顯示word圖標,表示該PPT已包含配套word講稿。雙擊word圖標可打開word文檔。
- 特殊限制:
部分文檔作品中含有的國旗、國徽等圖片,僅作為作品整體效果示例展示,禁止商用。設計者僅對作品中獨創(chuàng)性部分享有著作權。
- 關 鍵 詞:
- 2015 通量 專題 培訓 重測序 分析