GENSCAN 1.0 Date run: 3-Nov-116 Time: 04:19:32 Sequence gi568815578f:49715008_49987933 : 272926 bp : 46.92% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 6629 6673 45 1 0 70 94 28 0.200 2.28 1.02 Intr + 24298 24423 126 0 0 89 60 24 0.137 0.58 1.03 Intr + 42870 42986 117 2 0 127 89 -40 0.199 0.66 1.04 Intr + 46159 46205 47 0 2 103 96 6 0.055 0.11 1.05 Intr + 81700 81732 33 1 0 120 80 17 0.000 1.44 1.06 Intr + 97843 97941 99 1 0 87 64 97 0.861 6.43 1.07 Intr + 100001 100182 182 2 2 101 36 163 0.936 11.91 1.08 Intr + 108054 108134 81 1 0 98 97 15 0.800 3.01 1.09 Intr + 114769 114984 216 2 0 53 39 150 0.594 5.08 1.10 Intr + 115094 115324 231 0 0 26 7 205 0.582 3.94 1.11 Intr + 115703 115779 77 0 2 70 38 69 0.507 -0.67 1.12 Term + 115932 116063 132 2 0 16 52 179 0.452 5.19 1.13 PlyA + 118432 118437 6 1.05 2.02 PlyA - 118906 118901 6 1.05 2.01 Sngl - 130551 130156 396 0 0 78 54 253 0.959 17.15 2.00 Prom - 133237 133198 40 -6.36 3.00 Prom + 137630 137669 40 -3.16 3.01 Init + 139911 140134 224 2 2 78 -13 153 0.776 0.26 3.02 Intr + 140431 140574 144 1 0 95 108 189 0.958 21.00 3.03 Intr + 144493 144596 104 1 2 110 61 -12 0.216 -1.88 3.04 Intr + 149721 149837 117 1 0 84 111 35 0.794 5.84 3.05 Intr + 159698 159814 117 0 0 112 89 149 0.486 17.84 3.06 Intr + 168839 169059 221 0 2 89 49 607 0.915 54.92 3.07 Intr + 171745 171891 147 0 0 69 59 322 0.846 27.83 3.08 Term + 172732 172929 198 0 0 113 48 245 0.760 20.40 3.09 PlyA + 176410 176415 6 1.05 4.03 PlyA - 180699 180694 6 1.05 4.02 Term - 191838 190612 1227 0 0 126 41 1451 0.999 135.82 4.01 Init - 193483 193148 336 1 0 33 100 473 0.379 40.28 4.00 Prom - 194965 194926 40 -7.76 5.00 Prom + 195266 195305 40 -3.16 5.01 Init + 196704 196845 142 2 1 68 77 52 0.510 2.40 5.02 Term + 200479 200705 227 2 2 59 43 208 0.718 10.34 5.03 PlyA + 201178 201183 6 1.05 6.00 Prom + 203633 203672 40 -6.76 6.01 Init + 205694 205767 74 1 2 63 68 70 0.282 1.05 6.02 Intr + 213718 213808 91 1 1 117 117 9 0.771 6.70 6.03 Intr + 221271 221528 258 2 0 69 49 214 0.014 13.26 6.04 Intr + 226576 226704 129 0 0 84 48 128 0.501 9.29 6.05 Intr + 230390 230481 92 1 2 63 94 85 0.528 5.39 6.06 Intr + 231129 231243 115 0 1 92 89 56 0.998 6.55 6.07 Intr + 234241 234348 108 0 0 46 103 125 0.994 10.28 6.08 Intr + 237069 237104 36 2 0 100 41 87 0.011 3.66 6.09 Intr + 241796 242230 435 1 0 74 -56 582 0.002 36.18 6.10 Term + 242558 243025 468 1 0 -32 46 422 0.014 20.97 6.11 PlyA + 243065 243070 6 1.05 7.00 Prom + 248644 248683 40 -3.76 7.01 Init + 255746 255880 135 1 0 84 93 53 0.477 5.68 7.02 Intr + 266511 266593 83 2 2 94 77 62 0.878 4.14 7.03 Intr + 267721 267879 159 1 0 57 46 151 0.924 6.90 7.04 Intr + 268076 268134 59 2 2 102 96 127 0.740 13.43 7.05 Intr + 268817 269344 528 0 0 137 92 512 0.827 49.41 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 237069 237134 66 2 0 100 53 167 0.955 12.34 S.002 Sngl + 242615 243025 411 1 0 74 46 362 0.976 26.89 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578f:49715008_49987933|GENSCAN_predicted_peptide_1|461_aa MSPGHMVYKRGFGEQALRPSGGILRNLGGGSHVPTALLGTVHPKPGPTGAASGMAEETGS CSVTRLEYSGAIMAHCSLNLLGSSNPPSSVPKQLGQVQAILLTPPPDSWEHRCFSLQNGN NASGSRKQKRVLLAPRLRTRWSWKLRRMGEKMAEEERFPNTTHEGFNVTLHTTLVVTTKL VLPTPGKPILPVQTGEQAQQEEQSSGMTIFFSLLVLAICIILVHLLIRYRLHFLPESVAV VSLVRREAGQTQVMRSANSLSEGKVKKIMGDGKKGVSSGSSETGSKSPLKRIQEQSPQKR GQPPKNEKNVTIVESAKQAISVCYQAITKKLKICEEETGSTSIRAADSTAVNGSITDKKM GFGGLGLMRSGIVSNLLKMGHTVTVWNRTAKKREVGNAAKMMLIVNIVQGSFIAMITEKD LCLAIALGNAVNSLTPMAAAANQVYKRAKALDQSNNMSAVS >gi568815578f:49715008_49987933|GENSCAN_predicted_CDS_1|1386_bp atgtctccaggtcacatggtttacaagagaggatttggtgaacaggccctgaggccttct gggggcatccttagaaacttaggtggaggcagccacgtccccacagctttgctgggcaca gtacaccccaaaccagggcccactggagcggcatctgggatggccgaggagacaggatct tgctctgtcaccaggctggagtacagtggtgcaatcatggctcactgcagcctcaatctc ttgggctcaagcaatcctcctagctcagtccccaagcagctaggacaggttcaagccatc ctcctgactccacctcctgatagctgggagcacagatgtttctcattgcaaaatgggaat aatgccagcggaagccggaagcaaaagcgggtcctgctagccccgcggctccgaactcgg tggtcctggaagctccgcaggatgggggagaagatggcggaagaggagaggttccccaat acaactcatgagggtttcaatgtcaccctccacaccaccctggttgtcacgacgaaactg gtgctcccgacccctggcaagcccatcctccccgtgcagacaggggagcaggcccagcaa gaggagcagtccagcggcatgaccattttcttcagcctccttgtcctagctatctgcatc atattggtgcatttactgatccgatacagattacatttcttgccagagagtgttgctgtt gtttctttagtgaggagagaagcaggccaaactcaggtgatgagaagtgcaaacagcctg tctgaaggaaaagtgaagaagatcatgggagatggaaagaagggggtgtcttcgggctct tcagagacaggctccaaatcccctctgaaaagaatccaagagcaaagtccccagaagcgg ggtcagcccccaaagaatgagaagaatgtcaccatcgtggagtccgccaaacaagccatc tctgtctgttaccaggcaatcacaaagaagttgaaaatatgtgaagaggaaactggttcc acctccatccgggcagctgacagcacggccgtgaatggcagcatcacagacaaaaagatg ggatttgggggccttggtctcatgagaagtggaatcgtctctaacttgctaaaaatgggt cacacagtgactgtctggaaccgcactgccaagaaacgtgaagttggcaacgcagccaag atgatgctgattgtaaacatagtccaagggagcttcatagccatgatcactgagaaggat ctctgcttagccattgcgctgggcaatgcggtcaactctctgactcccatggcagctgca gccaaccaggtgtacaaaagagccaaggcactggaccagtccaacaatatgtccgctgtg tcctga >gi568815578f:49715008_49987933|GENSCAN_predicted_peptide_2|131_aa MDGPEKVRKAPVGELKETKVLGHMSPSSQDTVYPLYIYQSVSRKTSQCIICEPLNITSAI FLGKAPIYGKHWLSEPYCLKPLKQPIHHVCSSPEVRHCSPASTEHNQEGAELLLLLLLNY TQAFKELALQT >gi568815578f:49715008_49987933|GENSCAN_predicted_CDS_2|396_bp atggatgggcctgagaaggttagaaaggccccagtgggggagctgaaagaaacaaaagtc ctggggcacatgtccccgtcgtcccaggacacagtctatcctctttacatctatcagtct gtgtccaggaagacgagtcaatgcatcatctgtgagccacttaacatcacttcagccatc ttcctgggcaaagcacccatttatggcaaacactggctcagtgagccatactgcctgaaa ccattaaagcagccaattcatcatgtctgctcctctccagaagtcaggcactgcagtcca gcttccactgagcacaaccaggaaggagcagagctgctgctgctgctgctgctaaattat acccaagcttttaaggaacttgccttacaaacatga >gi568815578f:49715008_49987933|GENSCAN_predicted_peptide_3|423_aa MASVFLSAVYATHPQSTLGLLLLFDTFLDLANLSAGWGRSRTAFQGAGGEGQSRGQMQEL MGHLALAALTRGTGRFAFGSLISAVDPVATIAIFNALHVDPVLNMLVFGESILNDAVSIV LTNTEEKYAPGSCCHLSLVIRIIPGFEEVYFINDHLQIVIYVLKHIDLRKTPSLEFGMMI IFAYLPYGLAEGISLSGIMAILFSGIVMSHYTHHNLSPVTQILMQQTLRTVAFLCGLRGA IPYALSLHLDLEPMEKRQLIGTTTIVIVLFTILLLGGSTMPLIRLMDIEDAKAHRRNKKD VNLSKTEKMGNTVESEHLSELTEEEYEAHYIRRQDLKGFVWLDAKYLNPFFTRRLTQETP HTKHPVASPPFHGPAPDALTAWLCLSTQDLHHGRIQMKTLTNKWYEEVRQGPSGSEDDEQ ELL >gi568815578f:49715008_49987933|GENSCAN_predicted_CDS_3|1272_bp atggcctcagtgtttctctcggctgtctacgccactcacccccaaagcacactggggttg ctgctgctttttgacactttcttagatcttgcaaatctgagtgcaggctggggtcggtca cggacagcattccaaggggctggtggcgaggggcagagcagaggtcagatgcaggagctc atgggccatttggctctagctgctctgacccgtggaactgggcgttttgcgtttggctcc ctaatatctgctgtcgatccagtggccactattgccattttcaatgcacttcatgtggac cccgtgctcaacatgctggtctttggagaaagtattctcaacgatgcagtctccattgtt ctgaccaatacagaagaaaaatatgctcctgggagctgttgtcacctttccttggttatc agaatcatacctggttttgaagaggtatacttcataaacgatcatctccaaattgtcatt tacgtgctgaagcatattgacttgaggaaaacgccttccttggagtttggcatgatgatc atttttgcttatctgccttatgggcttgcagaaggaatctcactctcaggcatcatggcc atccttttctcaggcatcgtgatgtcccactacacgcaccataacctctccccagtcacc cagatcctcatgcagcagaccctccgcaccgtggccttcttatgtggcctgcggggagcc atcccctatgccctgagcctacacctggacctggagcccatggagaagcggcagctcatc ggcaccaccaccatcgtcatcgtgctcttcaccatcctgctgctgggcggcagcaccatg cccctcattcgcctcatggacatcgaggacgccaaggcacaccgcaggaacaagaaggac gtcaacctcagcaagactgagaagatgggcaacactgtggagtcggagcacctgtcggag ctcacggaggaggagtacgaggcccactacatcaggcggcaggaccttaagggcttcgtg tggctggacgccaagtacctgaaccccttcttcactcggaggctgacgcaggagacaccc cacacaaaacacccagtagcatcccctcccttccatggccctgcccctgacgccctgacg gcttggttgtgtctctcgacccaggacctgcaccacgggcgcatccagatgaaaactctc accaacaagtggtacgaggaggtacgccagggcccctccggctccgaggacgacgagcag gagctgctctga >gi568815578f:49715008_49987933|GENSCAN_predicted_peptide_4|520_aa MGKPSSMDTKFKDDLFRKYVQFHESKVDTTTSRQRPGSDECLRVAASTLLSLHKVDPFYR FRLIQFYEVVESSLRSLSSSSLRALHGAFSMLETVGINLFLYPWKKEFRSIKTYTGPFVY YVKSTLLEEDIRAILSCMGYTPELGTAYKLRELVETLQVKMVSFELFLAKVECEQMLEIH SQVKDKGYSELDIVSERKSSAEDVRGCSDALRRRAEGREHLTASMSRVALQKSASERAAK DYYKPRVTKPSRSVDAYDSYWESRKPPLKASLSLRKEPVATDVGDDLKDEIIRPSPSLLT MASSPHGSPDVLPPASPSNGPALLRGTYFSTQDDVDLYTDSEPRATYRRQDALRPDVWLL RNDAHSLYHKRSPPAKESALSKCQSCGLSCSSSLCQRCDSLLTCPPASKPSAFPSKASTH DSLAHGASLREKYPGQTQGLDRLPHLHSKSKPSTTPTSRCGFCNRPGATNTCTQCSKVSC DACLSAYHYDPCYKKSELHKFMPNNQLNYKSTQLSHLVYR >gi568815578f:49715008_49987933|GENSCAN_predicted_CDS_4|1563_bp atggggaagcccagttcaatggatactaaattcaaggatgacttatttcggaagtacgtg cagttccatgagagcaaagtggataccaccaccagcaggcagcggcctggcagcgatgag tgcctgcgggtggcagcctcaaccctgctcagcctgcacaaggtggatcccttttatcga ttccggctgatccagttctatgaggtggtggagagctccttgcgctcgctcagctcctct agcctgcgggctctgcacggcgccttcagcatgctggagacggtgggcatcaacctcttc ctctacccgtggaagaaggaattcagaagcatcaagacctacacgggcccttttgtttat tatgtcaagtcgacattactggaagaggacatccgagccatcctgagctgcatgggctac acacctgagctgggcactgcatacaagctcagagagctcgtggagaccctccaggtgaag atggtctcctttgagctctttctggccaaagtcgagtgtgagcagatgctagaaatccac tcacaagtgaaggacaagggctactccgagctggacattgtgagcgagcgcaagagcagt gcagaggatgtgcgcggctgctcggacgccctgcggcggcgggcagagggccgggagcac ctgacggcctccatgtcacgagtggcactccagaagtcggccagcgagcgggcggccaag gactactacaagccccgcgtgaccaagccctcgaggtcagtggatgcctatgacagctac tgggagagccggaagccacccctgaaggcctcattgagtcttcggaaggagcctgtggca acggatgtgggggacgacctcaaggatgagatcatccgcccatccccttcgctgctgacc atggccagctccccccacggcagcccggatgtgcttccacccgcctcccccagcaacggc ccggccctgctgcgcggtacctacttctccactcaggatgacgtggatctgtacacagac tctgaacccagggccacctaccgtcggcaggatgctctgcggccggatgtgtggctgctc agaaacgatgcccactccctctaccacaagcgctcgccccctgccaaagagtccgccctc tccaagtgccaaagctgcgggctgtcctgcagctcctccctctgccagcgctgtgacagc ctgctcacctgtcctccagcttccaagcccagcgccttccccagcaaggcctcgactcat gacagcctggcccacggggcatctctgcgggagaagtacccaggccagactcagggcctc gaccgcctcccgcaccttcactccaaatccaagccctccaccacgcccacttcccgctgt ggcttctgcaaccgcccaggcgccaccaacacctgcacccagtgttcaaaagtctcatgt gacgcctgcctcagcgcttaccattatgacccctgctacaaaaagagtgagctgcacaag ttcatgcccaacaaccagctgaactacaagtccacccagctctcccatctcgtgtacaga tag >gi568815578f:49715008_49987933|GENSCAN_predicted_peptide_5|122_aa MDAAKDVCIHDAITLVSHWYLPRILGDQLESKETGSERWHEVRQVAQGVLRPVPGGQQRP GNTRPGNDVTMRRARREGGPRLRAGSGKRSMPAADGDYNPEAEDKAEGRRARTKPAEPHF GA >gi568815578f:49715008_49987933|GENSCAN_predicted_CDS_5|369_bp atggatgctgcaaaggatgtctgcattcacgatgccatcactttagtttctcattggtac ctcccaagaatcctgggagatcagctggagagcaaagaaacgggctcagaaagatggcat gaggtacgacaagtagctcaaggggtactgagaccagtacccgggggccagcagcgaccc ggaaacactcggcccggaaatgatgtcaccatgaggcgggcccgaagagagggtggacca cggctgcgcgctggctccgggaagcggtcgatgcccgcggccgacggagactacaaccca gaggcggaggacaaagcggaaggccgaagagcgaggacgaaaccggcggaaccgcacttt ggagcctaa >gi568815578f:49715008_49987933|GENSCAN_predicted_peptide_6|601_aa MVITGVWLRLCLALGARSAPCTKDGQCTVSHDGQRQEILRELERIKEPVFSSQGPLACVR GAPRGGPPASPAPNRSPPREPRPLGLLLIGRRCAAQSGSKMAAQQRDCGGAAQLAGPAAE ADPLGRFTCPVCLEVYEKPVQECLKPKKPVCGVCRSALAPGVRAVELERQIESTETSCHG CRKNIRSHVATCSKYQNYIMEGVKATIKDASLQPRNVPNRYTFPCPYCPEKNFDQEGLVE HCKLFHSTDTKSVVCPICASMPWGDPNYRSANFREHIQRRHRFSYDTFVDYDVDEEDMMN QAPSYGARPVSSMVSVYAGARGSGSRISESHSTSFWGGMGSGDLAGGMAGDLAGMGGIQN EKETMQSLNDHLASYLDRMRSLETKNWKPESKIREHLEKKGPQVRDWSHHFKTIEDLRAQ IFTNTVDNACIVLQINNACLAADDFTIEENTTEVTTQSTEVGTAEMTHRTETCSPVLGDR PGLHEKSEGQLGEQPEGGGGPLFPADGTAQWDTAVPGVRAGTDPGRGTVPGPGVGSPAEH KVKLEAEITTYCRLLEDSEDFNLGDALDSRNSMQTIQKTTTRQTVDGKVVSETNDTKVLR H >gi568815578f:49715008_49987933|GENSCAN_predicted_CDS_6|1806_bp atggttatcacaggagtgtggctgaggctgtgtttggccctgggtgccaggtctgcaccc tgcactaaggatgggcaatgtacagtaagccatgatgggcagaggcaggaaatacttaga gaactggaaagaataaaggagccggtgttctcttcgcagggcccgctcgcttgcgtcaga ggggccccgaggggcggcccacccgctagccccgcccccaaccgctcaccgccccgcgag ccccgccccctcggcctcctcctcatcggccgccgttgcgcggcgcagagcggcagcaag atggcggcgcaacagcgggactgcgggggtgctgcgcagctggcggggccggcggcggag gctgaccccctaggacgcttcacgtgtcccgtgtgcttagaggtgtacgagaagccggta caggaatgtctgaagccgaagaagcctgtctgtggggtgtgtcgcagcgctctggcacct ggcgtccgagccgtggagctcgagcggcagatcgagagcacagagacttcttgccatggc tgccgtaagaatatccggtcccacgtggctacttgttccaaataccagaattacatcatg gaaggtgtgaaggccaccattaaggatgcatctcttcagccaaggaatgttccaaaccgt tacacctttccttgtccttactgtcctgagaagaactttgatcaggaaggacttgtggaa cactgcaaattattccatagcacggataccaaatctgtggtttgtccgatatgtgcctcg atgccctggggagaccccaactaccgcagcgccaacttcagagagcacatccagcgccgg caccggttttcttatgacacttttgtggattatgatgttgatgaagaggacatgatgaat caggcgcccagctatggcgcccggccggtcagcagtatggttagcgtctatgcaggtgcc cggggctctggttcccggatctccgagtcccactccaccagcttctggggcggcatgggg tccggggacctggccggggggatggctggggatctggcaggaatgggaggcatccagaac gagaaggagaccatgcaaagcctgaacgaccatctggcctcctacctggacagaatgagg agcctggagaccaagaactggaagccggagagcaaaatccgggagcacctggagaagaag ggaccccaggtcagagactggagccatcacttcaaaaccatcgaggacctgagggctcag atcttcacaaatactgtggacaatgcctgcattgttctgcagatcaacaatgcctgtctt gctgctgatgactttacaattgaggagaacactacagaagtcaccacgcagtccaccgag gttggaactgctgagatgactcacagaactgagacatgcagtccagtccttggagatcga cctggactccatgagaaatctgaaggccagcttggagaacagcctgagggaggtggaggc ccgctatttcctgcagatggaacagctcagtgggatactgctgtacctggagtcagagct ggcacagacccaggcagagggacagtgccaggcccaggagtaggaagccctgctgaacat aaggtcaagctggaggctgagatcaccacctactgccgcctgctggaagacagcgaggac ttcaatcttggtgatgccctggacagccgcaactccatgcaaaccatccaaaagaccacc acccgccagacagtggatggcaaagtggtgtctgagaccaacgacaccaaagttctgaga cattaa >gi568815578f:49715008_49987933|GENSCAN_predicted_peptide_7|322_aa MERAGLYQCSLMGKGPRPETVCEQVCVYEPLSPPTLQVPGDSLGKLEPGEDDFVHGCHTR HQVTKQTVVLPFSPSTGDDPRCASEPRLGGVPARALTATRREPGQQPAHLLGEWPSAETS LRLARRKPSDPNRKPNYSELQDSNPEFTFQQPYDQAHLLAAIPPPEILNPTASLPMLIWD SVLAPQAQPIAWASLRLQESPRVAELTSLSDEDSGKGSQPPSPPSPAPSSFSSTSVSSLE AEAYAAFPGLGQVPKQLAQLSEAKDLQARKAFNCKYCNKEYLSLGALKMHIRSHTLPCVC GTCGKAFSRPWLLQGHVRTHTX >gi568815578f:49715008_49987933|GENSCAN_predicted_CDS_7|966_bp atggagagagcagggctctatcagtgcagtctgatgggtaaggggcctaggcccgagaca gtctgcgagcaagtgtgcgtgtatgagcccctgagcccgcccaccctgcaagtgcctgga gactcactggggaagctagaaccaggggaggacgattttgttcacggctgtcacacccgg caccaagtgactaaacagacagtagttctgcccttcagccccagcaccggggacgacccg cgctgcgccagcgaaccccgcctcggaggagtccccgcccgggctctcaccgccacgcgg cgcgagcccggccagcagccggcgcacctgctcggggagtggccttcggcggagacgagc ctccgattggcgcggaggaagccctccgaccccaatcggaagcctaactacagcgagctg caggactctaatccagagtttaccttccagcagccctacgaccaggcccacctgctggca gccatcccacctccggagatcctcaaccccaccgcctcgctgccaatgctcatctgggac tctgtcctggcgccccaagcccagccaattgcctgggcctcccttcggctccaggagagt cccagggtggcagagctgacctccctgtcagatgaggacagtgggaaaggctcccagccc cccagcccaccctcaccggctccttcgtccttctcctctacttcagtctcttccttggag gccgaggcctatgctgccttcccaggcttgggccaagtgcccaagcagctggcccagctc tctgaggccaaggatctccaggctcgaaaggccttcaactgcaaatactgcaacaaggaa tacctcagcctgggtgccctcaagatgcacatccgaagccacacgctgccctgcgtctgc ggaacctgcgggaaggccttctctaggccctggctgctacaaggccatgtccggacccac actgnn