GENSCAN 1.0 Date run: 4-Nov-116 Time: 16:19:04 Sequence gi568815583r:79861070_80071119 : 210050 bp : 42.77% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 320 441 122 0 2 128 39 56 0.509 2.36 1.02 PlyA + 2320 2325 6 1.05 2.00 Prom + 2751 2790 40 -4.35 2.01 Init + 4295 4456 162 1 0 51 74 102 0.295 4.88 2.02 Term + 9272 9310 39 1 0 70 48 84 0.179 -1.19 2.03 PlyA + 9469 9474 6 1.05 3.10 PlyA - 11080 11075 6 1.05 3.09 Term - 12240 12175 66 0 0 110 34 56 0.093 -0.64 3.08 Intr - 12727 12597 131 2 2 50 110 46 0.073 2.49 3.07 Intr - 28285 28024 262 1 1 61 109 203 0.235 15.84 3.06 Intr - 36013 35803 211 0 1 72 86 329 0.988 29.09 3.05 Intr - 36319 36092 228 0 0 48 99 152 0.826 8.36 3.04 Intr - 36538 36477 62 1 2 69 89 22 0.166 -2.99 3.03 Intr - 53606 53519 88 1 1 57 89 89 0.169 4.95 3.02 Intr - 56880 56729 152 0 2 -24 48 132 0.080 -3.96 3.01 Init - 58293 58234 60 0 0 69 59 53 0.139 1.84 3.00 Prom - 59852 59813 40 -6.65 4.00 Prom + 60355 60394 40 -5.65 4.01 Init + 61577 62703 1127 1 2 59 45 652 0.515 51.70 4.02 Intr + 65629 65801 173 1 2 52 31 133 0.409 2.66 4.03 Intr + 65954 66130 177 0 0 -39 33 206 0.018 1.37 4.04 Intr + 68098 68132 35 2 2 87 75 31 0.018 -1.28 4.05 Term + 74223 74360 138 2 0 85 43 140 0.048 6.18 4.06 PlyA + 76598 76603 6 1.05 5.00 Prom + 81670 81709 40 -4.85 5.01 Init + 83453 83665 213 1 0 56 4 249 0.797 12.09 5.02 Intr + 83830 84025 196 0 1 -41 87 217 0.382 6.77 5.03 Term + 84123 84511 389 1 2 -108 43 568 0.387 26.82 5.04 PlyA + 84548 84553 6 1.05 6.00 Prom + 85416 85455 40 -6.05 6.01 Init + 93793 93845 53 0 2 72 53 70 0.557 2.58 6.02 Term + 99431 99647 217 2 1 83 40 188 0.980 9.13 6.03 PlyA + 100115 100120 6 1.05 7.05 PlyA - 100357 100352 6 1.05 7.04 Term - 110057 109559 499 1 1 65 41 248 0.419 10.81 7.03 Intr - 120995 120873 123 1 0 26 56 162 0.713 5.58 7.02 Intr - 126807 126642 166 1 1 28 99 116 0.761 4.80 7.01 Init - 144993 144849 145 0 1 68 45 160 0.019 10.05 7.00 Prom - 149467 149428 40 -6.55 8.00 Prom + 149852 149891 40 -6.35 8.01 Init + 156492 156731 240 2 0 64 42 233 0.496 14.26 8.02 Intr + 164221 164278 58 0 1 125 101 26 0.945 5.14 8.03 Term + 165522 165724 203 1 2 101 42 139 0.996 7.17 8.04 PlyA + 167044 167049 6 1.05 9.05 PlyA - 167912 167907 6 1.05 9.04 Term - 170798 170678 121 1 1 95 43 83 0.689 1.47 9.03 Intr - 171931 171877 55 2 1 77 115 21 0.599 0.82 9.02 Intr - 172527 172376 152 2 2 80 33 106 0.624 3.19 9.01 Init - 173333 173218 116 2 2 84 47 102 0.689 5.43 9.00 Prom - 178562 178523 40 -0.85 10.02 PlyA - 179167 179162 6 1.05 10.01 Term - 198358 197791 568 0 1 65 32 243 0.512 9.30 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583r:79861070_80071119|GENSCAN_predicted_peptide_1|40_aa XHHMEATTQLVVINARSFVISAIHSLRERSTYSHVLVMST >gi568815583r:79861070_80071119|GENSCAN_predicted_CDS_1|123_bp nggcatcacatggaggctaccacacaactggtagtcattaatgcacgttcatttgttatt tcggccattcactcactcagggagaggtcaacatactctcatgtgttggtgatgagtacg taa >gi568815583r:79861070_80071119|GENSCAN_predicted_peptide_2|66_aa MPYLSRKLRGISARYNLGKREFDNEIHVKEMAALGRAGRKPVWNCSQAPSQEEQWHNRAA VKDSMA >gi568815583r:79861070_80071119|GENSCAN_predicted_CDS_2|201_bp atgccatatctaagtcggaaacttcgaggtatttctgcaagatataatttaggaaaaagg gaatttgataatgaaatacatgttaaagaaatggctgccttgggaagagccggaaggaaa ccagtgtggaactgcagtcaagcccccagccaggaggagcagtggcacaatcgagctgct gtcaaagattccatggcctga >gi568815583r:79861070_80071119|GENSCAN_predicted_peptide_3|419_aa MSAPEPGACATQPGSQARQRKEEKLEPAWEGPYLVLLTTETAFRTAEKGWTHHTRVKKVS SPSQSWDIVPSITPFSHEWPLHIFEDYRRVRADPFSLPHMIHKYHGAYNCLQYSVQNTLY SPTTEQYKRCQAPWVLPAIFTIPASGAAASAVPVRAQAQGGQAAGQRRRLATAPLTGEGE PDHGDRTRAGSQGPEARPRIHLGAARGGDRTRGRPGRRGRDWTRLGREMAAAAVSSAKRS LRGELKQRLRAMSAEERLRQSRVLSQKVIAHSEYQKSKRISIFLSMQDEIETEEIIKDIF QRGKICFIPRYRFQSNHMDMVRIESPEEISLLPKTSWNIPQPGEGDVREEALSTAHKAPY HKLSLPLLAAELLFFPASPILVKLPFLRTELGAWEPHQIRKKAQMKYLIDGHTTHEWQK >gi568815583r:79861070_80071119|GENSCAN_predicted_CDS_3|1260_bp atgagcgcaccggagccaggagcctgtgccacgcagccaggcagccaggcgcgccagagg aaagaagaaaaactggagccagcctgggaaggaccctaccttgtgctgctaaccaccgag actgcctttcgcacagcagaaaaaggatggacacatcatacccgagtcaagaaagtgtca tcaccttcacaatcatgggacattgttccaagtataaccccgttttcacacgaatggccc cttcacatatttgaagactaccgtcgtgtccgtgctgaccctttctccctgccacacatg atacacaagtatcacggtgcttacaattgcctacagtattcggtacaaaatacgctgtac agtcccactactgaacagtacaagagatgccaggcaccgtgggtactgccggccatcttt accatccccgccagtggcgctgccgcctccgcggtcccggtgagggcccaggctcaggga ggtcaggcggcagggcagcgacggcgcctcgccaccgctcccctaaccggggagggcgag ccggaccacggggaccgcacgagggctgggtcccaagggccagaggcgaggccgcggatc cacctgggagcggccaggggcggagaccgaacccgagggcgcccagggcgccgagggcgg gactggactcggcttgggcgtgagatggcggcggcagcggtgagcagcgccaagcggagc ctgcggggagagctgaagcagcgtctgcgggcgatgagtgccgaggagcggctacgccag tcccgcgtactgagccagaaggtgattgcccacagtgagtatcaaaagtccaaaagaatt tccatctttctgagcatgcaagatgaaattgagacagaagagatcatcaaggacattttc caacgaggcaaaatctgcttcatccctcggtaccggttccagagcaatcacatggatatg gtgagaatagaatcaccagaggaaatttctttacttcccaaaacatcctggaatatccct cagcctggtgagggtgatgttcgggaggaggccttgtccacagctcacaaagccccctac cacaagttaagtctgccccttctggcagcagagctgctgttttttccagccagccccata ttggtaaaactgccattcttgaggactgagctgggggcctgggaacctcaccagatcaga aaaaaggctcagatgaagtaccttatcgatggtcacacaactcatgagtggcagaaataa >gi568815583r:79861070_80071119|GENSCAN_predicted_peptide_4|549_aa MVSPSTLEIDPKHQPLHRRRRLWSAPPELGDTSSSQTPPQHPKCPDQHPKAGLLYKAESL PTKQFPTSDSSPRRTALLDPTNPPTPIGPSTQRVHNPEKELQALYLQASNPQLQLTSPLP EQDHEYPPRADLAGYTKPGARPEGSSGTSSATGASPPSPARRPALTSSPPPPPSLTDAPG RLASRPPPPPARRWRLAAPRYSSASGPSGPAARLCAETQGEAPRGPRAVGGSGGAGEPRW SRGPAQRERLLGCSCCFPSFFTGTCSFRTSWSAFTLELTSSIPAPLPPPPAALSLSQQQG KSRRKEASTPLTPAEAPAYSRARPRRQGSPSAGPEAPLATDEPPPQRRSGIRGGSANIRS LQPSSLRPSAHRGSALYSGGDLENLCVDTLYLANLVGTWRTFVSSSGIVNAPISALSKQT TGLSVKWTNQKDVAVTLTGKVCSFTPEPVRPRTHQKEETPNTSGRQEEQTPDTLPLRTVT LTARVHGFILEVKSCVHGNQMILCVGRETEKRNKTQRQTTEKEQWAQGTGAQHMEDPHQC PSLSSLSIY >gi568815583r:79861070_80071119|GENSCAN_predicted_CDS_4|1650_bp atggtctccccatccacgctagaaattgatcccaaacaccaaccgctgcatagacgtcgg cgactttggtctgcaccccccgaacttggtgatacctcatcaagccaaacccctccccaa catccaaaatgcccagaccagcatccgaaagctggtctcctatacaaagcagaatccctg cctacaaagcagtttcccacctccgactcctccccacggcgaacagcccttctggatcca accaacccaccaaccccaatagggcccagcacccagagggtccacaatccagagaaggag ctacaagccctgtacttgcaagcctcgaacccccaactccaactgacctcccctctgcca gagcaagaccacgagtacccgccgagagcggacctggctggctacacaaagccaggcgcc cgaccagagggatcctccgggacgtcctccgccaccggggcgtccccgcccagccccgcc cggcgcccggcgctcaccagctctcctccccctcctcctagcctgacagacgctccaggt cgtctagcaagccgccctcccccgccgccagcaagaaggtggcgtctcgcagctcctcgt tattcgtccgcctcgggtccatccggccccgccgcccgactttgcgctgagacccaaggg gaagccccgaggggaccccgcgccgtaggagggagtggaggagccggcgaaccgcgatgg agccggggcccagcacagcgcgagcggctcctcggctgttcgtgctgcttccccagcttc ttcaccggcacctgcagcttccgcacctcctggtccgcgttcacgttagagttaacgtcc tccatcccggccccgctgcctccgccgcccgcggccctctcgctctcgcagcaacaaggg aagagccggaggaaagaggcgtccacgccgctcactccagccgaggcccccgcatatagc agggccaggccccgtcgccagggttcccccagcgcgggtcccgaagcaccgctggccaca gatgaaccgcctcctcagcgcaggtccgggattcggggaggcagcgccaatatccggtct ctgcagccatcttccctccgtccctccgcccaccgagggtcggccctctactctggtggg gacttggagaacctttgtgtggacactctgtatctagctaatctagtggggacgtggaga acctttgtgtctagctcagggattgtaaacgcaccaatcagcgccctgtcaaaacagacc actgggctctctgtaaaatggaccaatcagaaggatgtggctgtaacactcacgggaaag gtctgcagcttcactcctgagccagtgagaccacgaacccaccagaaggaagaaactccg aacacatccggacgtcaggaggaacaaactccggacacgctgcctttaagaactgtaaca ctcactgctagggtccacggcttcattcttgaagtcaaatcttgtgtacacggcaaccag atgatcctttgtgtgggacgagagactgagaaaagaaataagacacagagacaaactaca gagaaagaacagtgggcccagggaaccggcgctcagcatatggaggacccgcaccagtgc ccgtctctgagttccctcagtatttattga >gi568815583r:79861070_80071119|GENSCAN_predicted_peptide_5|265_aa MNGDQKSDVYAQEKQDFFQHFSQIVGVLTEDEMGHSEIGDAIARLKDVLEYNAIGGKYHR GLTVLVAFQELDAINYAILLEACIYYLLKLYCQAQPYYLNLIELFLQISCQPEIGQTLDL ITAPRTMWILADSLKTARQCLEDPTGDREFFQIQDGYLDLFGDPTVTRKVGTDIQDNKCS WLVVQCLQRSTLEQYQILKENYGQKKAKKVVQVKALYEELDLPAVFLQYEEDGYSHIMGL TEQYAAPLLPAMFLGLVYKIYKQKK >gi568815583r:79861070_80071119|GENSCAN_predicted_CDS_5|798_bp atgaatggggaccagaaatcagatgtttatgcccaagaaaagcaggatttctttcagcac ttctcccagatcgttggggtgctgactgaggatgagatggggcactcagagataggagat gctattgcccggctcaaggacgtcctggagtacaatgccattggaggcaagtatcaccgg ggtttgacggtgctagtagcgttccaggagctggatgccatcaattatgctatccttctg gaagcatgtatctactacctgctgaagctctattgccaagcacagccctattacctgaac ctgatcgagctcttcctgcagatttcctgtcagcctgagattgggcagaccctggacctc atcacagcccctaggacaatgtggattttggcagattcactgaaaacagcacgccaatgc ttagaagatcctactggagatcgagagttttttcaaattcaggatggttaccttgacctc tttggggaccccactgtgacccgcaaagttggcactgacatccaggacaacaaatgcagc tggctggtggttcagtgtctgcaacgatccactctggaacagtaccagatcctgaaggaa aattacggacagaagaaggccaagaaggtagtccaggtgaaggcactgtatgaggagctg gatctgccggccgtgttcttgcagtatgaggaagacggttacagccacattatgggtctc actgaacagtacgcagcgcccctgctcccagccatgtttctggggcttgtgtacaaaatc tacaagcagaaaaagtga >gi568815583r:79861070_80071119|GENSCAN_predicted_peptide_6|89_aa MGSGGSGVGGGRPVTESQISPSILHVGLPYPSTPNVKPEREGCGGGVSGNSKQPGALETA ESIGKKEGRREEGSRPESASSLSLLLSLA >gi568815583r:79861070_80071119|GENSCAN_predicted_CDS_6|270_bp atgggatcaggaggaagtggtgtaggaggaggacgcccagtgactgagagccaaatatcc ccgagtatcctgcatgtcgggttgccatatccatcaaccccaaatgtcaagccagagaga gaaggctgcggaggaggcgtcagtggaaactctaagcagccaggggcactggaaactgca gaaagcataggaaagaaggaaggaaggagggaagagggaagccgacctgagtctgcttct agcctgagtctgcttctcagtctagcctag >gi568815583r:79861070_80071119|GENSCAN_predicted_peptide_7|310_aa MVMDGSPEPVSLQCFSNLTGTECSEAAGGYAKEDEELLVAPLPEPSNLGGSKHYEEDKVK GKLMRISEGDHSRQQRAVSTKVLRGRVCMGCAGNCKTNGVVGVRKVHPMEATGPTGREPE GAEPAQLTRSSAAQNRIAPANPANWQKMTDCEFGYIYRLAQDYLQCVLQIPQPGSGPSKT SRVLQNVAFSVQKEVEKNLKSCLDNVNVVSVDTARTLFNQVMEKEFEDGIINWGRIVTIF AFEGILIKKLLRQQIAPDVDTYKEISYFVAEFIMNNTGEWIRQNGGWVCVMEKFFIVLSC EIEIENFLAS >gi568815583r:79861070_80071119|GENSCAN_predicted_CDS_7|933_bp atggtgatggatggaagcccggagccagtcagtctgcaatgtttcagcaacctcacaggc acagagtgctccgaggctgctggagggtatgccaaagaagacgaagaattactagtggct cctcttccagagccctctaatctgggtggcagtaagcactatgaagaagataaggtgaag ggaaaactcatgcggatatcagaaggagatcactccaggcagcagagggctgtgagtaca aaggtgctgaggggcagggtgtgcatggggtgtgctgggaactgcaagacgaatggcgtg gttggagtcagaaaagtacatcccatggaggccactggtcccacaggacgagagccagag ggagcagagccagcccagctaacccgtagctctgcagctcaaaacagaattgccccagcc aatcctgcaaactggcagaagatgacagactgtgaatttggatatatttacaggctggct caggactatctgcagtgcgtcctacagataccacaacctggatcaggtccaagcaaaacg tccagagtgctacaaaatgttgcgttctcagtccaaaaagaagtggaaaagaatctgaag tcatgcttggacaatgttaatgttgtgtccgtagacactgccagaacactattcaaccaa gtgatggaaaaggagtttgaagacggcatcattaactggggaagaattgtaaccatattt gcatttgaaggtattctcatcaagaaacttctacgacagcaaattgccccggatgtggat acctataaggagatttcatattttgttgcggagttcataatgaataacacaggagaatgg ataaggcaaaacggaggctgggtatgtgtgatggaaaaattcttcattgttctttcctgt gaaatagaaattgagaatttccttgctagttaa >gi568815583r:79861070_80071119|GENSCAN_predicted_peptide_8|166_aa MNNSRNAALRAVTLTMKVCSFTPEPARPRTHQKEETPNTSEHQKEQTPDTLPLRTVTLTV RVHGFILEVSETKNLPILDTMGPSSCRKTSSGVPLNLRYGILGRVTTSPAGSWYISQQGW ASISERTRPLCWNQSSGTIRILAEGNKSASCSESASLKASELPGRI >gi568815583r:79861070_80071119|GENSCAN_predicted_CDS_8|501_bp atgaacaactccagaaacgccgccttaagagctgtaacactcaccatgaaggtctgcagc ttcactcctgagccagcgagaccacgaacccaccagaaggaagaaactccgaacacatct gaacatcagaaggaacaaactccggacacgctgcctttaagaactgtaacactcactgtg agggtccacggcttcattcttgaagtcagtgagaccaagaacctaccaattctggacaca atgggaccgtctagttgcaggaaaacaagctcaggggtcccactgaatctacgttatggg atattaggaagagtaactacctccccagctggaagctggtatatctcgcagcagggctgg gcttctatttcagaaagaacacggccactttgctggaaccagagcagtggcacaatccga atcctggcggagggcaacaagtcagcttcatgctcagaatctgcttctctgaaggcctct gaattaccaggaagaatctaa >gi568815583r:79861070_80071119|GENSCAN_predicted_peptide_9|147_aa MEWESEKRLGFRQHAFSGLVPGVLVSRISVATEENLGGGPAVLACVQTCKFSKAHRLQCD QLSQFAQNWFGFSIVPGNPSSAGLRSLGLKTRSQKGRMNGETDHFEKRQWVGCWDSIGKQ VSVELIMEEEGDNNQSIISSVISAIMA >gi568815583r:79861070_80071119|GENSCAN_predicted_CDS_9|444_bp atggagtgggaaagtgagaaacggctgggtttcaggcagcatgcattttctggcttggtc cctggggtcttggtgtcccgtatttcagttgcaacagaagagaatttaggaggcggacca gctgtcctggcgtgtgtgcagacctgcaagttcagcaaagcccacagacttcagtgtgac caactgtcccagtttgcccagaactggtttggttttagcattgtcccaggaaacccctct agtgcaggactgaggtccttggggctgaaaactaggagccaaaagggcagaatgaatggt gaaactgatcattttgaaaagcggcagtgggttgggtgctgggactctattggtaaacaa gtttctgtggagctcattatggaggaggagggagacaataaccagtcaattatatccagt gtgataagtgctataatggcctaa >gi568815583r:79861070_80071119|GENSCAN_predicted_peptide_10|189_aa XVPCSTALSGALPAAVSLPPIATGRRLRGAAETLRAWLCRFRYLCHLAGNGVSLSWTEAP KRVALHLPLVQEGILSFCLPMQAGASVCKRKVRAQGAYWKAVLCPVLNWDLEIEARREAC MHTCAIHTCTGTSTPSSSLFQKHGVSSHLRGARWAAALELGVIGRMAPPEVFQPSFGLTS ITSRKYVTL >gi568815583r:79861070_80071119|GENSCAN_predicted_CDS_10|570_bp nntgtcccttgctccactgccctctctggggcgcttcccgccgcggtttctctacctcca atagctacgggaaggcgattaagaggtgccgcggagactttaagggcttggttatgtagg ttcaggtacctttgccacctcgcagggaatggcgtgtccctgtcttggactgaggcacca aaaagagttgcactccacctgccacttgtgcaggaaggaattctgagcttctgtcttccc atgcaggccggcgcctcagtctgcaagcgcaaagtccgagctcagggcgcatactggaaa gcagttctctgccctgtcctgaactgggacttggaaatagaagcaagacgagaagcgtgc atgcacacgtgtgcaatacatacatgcacaggcacaagcacccccagttccagcctgttc caaaagcacggggtcagttcccacctgcgtggtgcacgctgggctgctgcgctggaacta ggggtgataggcagaatggctccccctgaagtttttcagccttcgttcgggctcacgagc atcacgagcaggaagtacgtgacactttag