News from this website 4month5Japanese, authoritative international journal《Nature.Communication》(Nature Communications)Published online by the team of Professor Wang Jianxin from the School of Computing at Central South University and Luo Feng from Clemson University in the United StatesProfessor,Sun Yat-sen UniversityZhongshan Eye CenterXiao ChuangleAssociate ResearcherThe latest research results of cooperation“De novo diploid assembly based on long reads (seabet app download diploid genome assembly using long noisy reads)”。The paperProposed a new diploid assembly method based on third-generation sequencing data,And developed the corresponding softwarePECAT.School of Computer Science, Central South UniversityNie Fan,Ni PengCommon to the paperThe first author is Professor Wang Jianxin from the School of Computer Science, Central South UniversityPaperCo-corresponding author,Central South University is the first signing unit。This research is supported by the National Key Research seabet sports betting and Development Program、National Natural Science Foundation of China、Supported by Xiangjiang Laboratory and many other projects。
Third generation sequencing technology (Oxford Nanopore sequencing andRapid advances in PacBio single-molecule real-time sequencing) yield longer and more accurate reads,Brings new opportunities and challenges to genome assembly research。For diploid assembly,The third generation reads still contain higher sequencing errors,It is difficult for assembly algorithms to distinguish sequencing errors from haplotype difference information,Thus generating assembly results of mixed haplotypes,It contains a large number of haplotype switching errors, And a large portion seabet online sports betting of genetic information is lost。
In response to this limitation,Professor Wang Jianxin’s team and others recently published in In research paper in Nature Communications,Through in-depth analysis of the differences between third-generation long reads carrying sequencing errors and haplotype differences,Proposed a long read error correction algorithm that preserves haplotype difference information,Prevent haplotype difference information from being removed as a sequencing error,Ensures the consistency of haplotype difference information,The haplotype consistency of its error-corrected readings can reach 99.4%。On this basis, a diploid assembly algorithm based on local haplotype clustering was designed,Assembly seabet sports betting results to achieve haplotype mixing in the first round of assembly。In the second round of assembly,Align reads to hybrid assembly results,Identifies the single nucleotide polymorphism (SNP) information carried by the read,Identify overlapping relationships with inconsistent haplotypes through local clustering,Filter inconsistent overlapping relationships and assemble again to achieve haplotype assembly results。On multiple test data,The method PECAT proposed in the paper obtains more continuous haplotype assembly results。Among them,On Bull Data for Nanopore R9,PECAT achieves assembly results with nearly resolved haplotypes。And on the human HG002 sample nanopore R10 data,PECAT achieved a haplotype seabet mobile continuity index (phase block NG50) of 59.4/58.0Mb assembly result。
PECAT assembly algorithm framework diagram
First instance: Yu Tao Second instance: Deng Haodi Third instance: Li Yin