GENSCAN 1.0 Date run: 5-Nov-116 Time: 05:18:19 Sequence gi568815586r:50888328_51108625 : 220298 bp : 43.52% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 36712 37209 498 0 0 75 117 570 0.946 51.66 1.02 Term + 41587 41823 237 0 0 110 36 139 0.845 7.07 1.03 PlyA + 43919 43924 6 1.05 2.03 PlyA - 46146 46141 6 1.05 2.02 Term - 49318 49109 210 1 0 89 41 129 0.724 5.59 2.01 Init - 53857 53804 54 1 0 99 64 48 0.586 2.78 2.00 Prom - 56784 56745 40 -8.66 3.00 Prom + 57785 57824 40 -3.06 3.01 Init + 57839 58257 419 1 2 71 53 184 0.125 9.10 3.02 Intr + 64114 64142 29 1 2 105 89 10 0.066 0.56 3.03 Intr + 65664 65765 102 1 0 60 97 57 0.188 3.95 3.04 Intr + 72641 72775 135 0 0 84 59 82 0.777 5.44 3.05 Term + 79731 79843 113 1 2 46 43 143 0.494 4.32 3.06 PlyA + 80670 80675 6 1.05 4.00 Prom + 88035 88074 40 -4.76 4.01 Sngl + 88547 89677 1131 1 0 60 44 336 0.992 22.98 4.02 PlyA + 90315 90320 6 1.05 5.18 PlyA - 90934 90929 6 1.05 5.17 Term - 91366 91301 66 1 0 114 39 29 0.032 -1.46 5.16 Intr - 100108 100055 54 1 0 96 115 19 0.172 4.58 5.15 Intr - 102621 102468 154 2 1 110 111 109 0.999 15.47 5.14 Intr - 103345 103272 74 1 2 107 74 63 0.999 5.00 5.13 Intr - 104042 103863 180 2 0 111 94 150 0.777 17.96 5.12 Intr - 104602 104483 120 1 0 89 94 -5 0.646 0.89 5.11 Intr - 106303 106217 87 1 0 58 86 112 0.991 8.17 5.10 Intr - 107460 107302 159 0 0 88 89 163 0.828 16.58 5.09 Intr - 108581 108490 92 0 2 45 105 68 0.797 3.91 5.08 Intr - 110914 110840 75 2 0 129 37 77 0.935 6.19 5.07 Intr - 111088 111018 71 0 2 84 91 13 0.999 0.03 5.06 Intr - 112092 111986 107 0 2 69 100 124 0.998 10.71 5.05 Intr - 116580 116461 120 0 0 48 105 31 0.718 1.49 5.04 Intr - 117109 116984 126 1 0 99 61 65 0.977 5.78 5.03 Intr - 120217 120149 69 1 0 86 88 126 0.357 11.78 5.02 Intr - 129159 129135 25 2 1 112 62 24 0.091 0.23 5.01 Init - 137699 137599 101 2 2 79 13 190 0.288 8.34 5.00 Prom - 149281 149242 40 -2.46 6.00 Prom + 155378 155417 40 -3.66 6.01 Init + 160030 160151 122 0 2 96 85 88 0.969 6.96 6.02 Intr + 160707 160858 152 0 2 93 93 95 0.944 10.31 6.03 Intr + 165485 165533 49 0 1 50 82 46 0.665 -2.06 6.04 Intr + 167508 167694 187 0 1 121 61 10 0.344 1.29 6.05 Intr + 168023 168175 153 1 0 69 103 56 0.621 5.47 6.06 Term + 171783 171881 99 2 0 80 48 62 0.635 -0.47 6.07 PlyA + 173972 173977 6 1.05 7.13 PlyA - 174850 174845 6 -3.24 7.12 Term - 176342 175419 924 2 0 106 48 768 0.996 66.78 7.11 Intr - 179642 179346 297 2 0 96 82 420 0.995 39.27 7.10 Intr - 180347 180222 126 2 0 65 88 37 0.742 2.28 7.09 Intr - 185755 185496 260 2 2 79 46 300 0.956 22.08 7.08 Intr - 188252 188084 169 2 1 20 94 233 0.138 16.62 7.07 Intr - 189657 189549 109 2 1 70 34 49 0.032 -2.01 7.06 Intr - 195458 195368 91 0 1 64 33 128 0.015 3.95 7.05 Intr - 207713 207662 52 2 1 97 85 18 0.498 0.88 7.04 Intr - 210591 210449 143 1 2 117 61 78 0.974 8.07 7.03 Intr - 211452 211328 125 2 2 115 86 57 0.940 8.43 7.02 Intr - 218286 218198 89 0 2 84 110 14 0.959 1.97 7.01 Init - 218977 218909 69 1 0 81 83 45 0.746 4.35 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 100108 99998 111 1 0 96 41 84 0.819 3.06 S.002 Init - 188234 188084 151 2 1 37 94 227 0.860 18.40 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:50888328_51108625|GENSCAN_predicted_peptide_1|244_aa MELTIFILRLAIYILTFPLYLLNFLGLWSWICKKWFPYFLVRFTVIYNEQMASKKRELFS NLQEFAGPSGKLSLLEVGCGTGANFKFYPPGCRVTCIDPNPNFEKFLIKSIAENRHLQFE RFVVAAGENMHQVADGSVDVVVCTLVLCSVKNQERILREVCRVLRPGGAFYFMEHVAAEC STWNYFWQQVLDPAWHLLFDGCNLTRESWKALERASFSKLKLQHIQAPLSWELVRPHIYG YAVK >gi568815586r:50888328_51108625|GENSCAN_predicted_CDS_1|735_bp atggagcttaccatctttatcctgagactggccatttacatcctgacatttcccttgtac ctgctgaactttctgggcttgtggagctggatatgcaaaaaatggttcccctacttcttg gtgaggttcactgtgatatacaacgaacagatggcaagcaagaagcgggagctcttcagt aacctgcaggagtttgcgggcccctccgggaaactctccctgctggaagtgggctgtggc acgggggccaacttcaagttctacccacctgggtgcagggtgacctgtattgaccccaac cccaactttgagaagtttttgatcaagagcattgcagagaaccgacacctgcagtttgag cgctttgtggtagctgccggggagaacatgcaccaggtggctgatggctctgtggatgtg gtggtctgcaccctggtgctgtgctctgtgaagaaccaggagcggattctccgcgaggtg tgcagagtgctgagaccgggaggggctttctatttcatggagcatgtggcagctgagtgt tcgacttggaattacttctggcaacaagtcctggatcctgcctggcaccttctgtttgat gggtgcaacctgaccagagagagctggaaggccctggagcgggccagcttctctaagctg aagctgcagcacatccaggccccactgtcctgggagttggtgcgccctcatatctatgga tatgctgtgaaatag >gi568815586r:50888328_51108625|GENSCAN_predicted_peptide_2|87_aa MGRVRWLTPVIPGLWEAEDTEQELGTHRGTRRMVGLKELQHKWTEAHTTLLQPFLLVTLQ AMRRKEKLRPFWESRSKGSPSQGCDML >gi568815586r:50888328_51108625|GENSCAN_predicted_CDS_2|264_bp atgggccgggtgcggtggctcacgcctgtaatcccaggactttgggaggctgaggacacg gaacaagaacttgggacccaccgagggacccgccgaatggtgggactgaaagagctgcaa cacaaatggactgaagcacacaccacccttctgcaacccttcctgctcgtcacgttgcag gcgatgagaaggaaagaaaagctgcggcccttctgggagtccagatctaagggctcccca agccagggctgtgacatgctgtaa >gi568815586r:50888328_51108625|GENSCAN_predicted_peptide_3|265_aa MGKDFMSKPPKAMATKAKIDKWDLIKLKSFCTAKETTIRVNKQPTEWEKIFAIYSSDKGL ISRIYKELKQIYKKKSNNPINKWAKDMNRHFSKEDIYAANRHMKKCSSSLVIREMQIKTT MRYHLTPVRMVIIKKSGNNRSTWLQSEGGEKKMSSDNQWSADEDEGQLSRLIRKSRDSPF VPIGIAGFVTVVSCGLYKLKYRRDQKMSIHLIHMRVAAQGFVVGAVTLALATERGIVSNN KKKKEEEEEEEENEEEEEKGFITEI >gi568815586r:50888328_51108625|GENSCAN_predicted_CDS_3|798_bp atgggcaaggacttcatgtctaaaccaccaaaagcaatggcaacaaaagccaaaattgac aaatgggatctaattaaactaaagagcttctgcacagcaaaagaaactaccatcagagtg aacaagcaacctacagaatgggagaaaatttttgcaatctactcatctgacaaagggtta atatccagaatctacaaagaacttaaacaaatttacaagaaaaaatcaaacaaccccatc aacaagtgggcgaaggatatgaacagacacttctcaaaagaagacatttatgcagccaac agacacatgaaaaaatgctcatcatcactggtcatcagagaaatgcaaatcaaaaccaca atgagataccatctcacaccagttagaatggtgatcattaaaaagtcaggaaacaacagg tccacatggctgcagtcggagggaggcgagaaaaaaatgtcttcagataaccagtggtca gcagatgaggatgaaggccaattatcccgactaatcaggaaatctagagactcccccttt gtccctataggtatagcaggctttgtgactgtggtgtcctgtggtctttacaagctaaag tacagaagagatcagaaaatgtcaattcatcttattcacatgagagttgctgcccaagga tttgttgttggagctgtgactctagccttggccacagagcgaggcattgtctcaaataat aagaagaagaaggaggaggaggaggaggaggaggagaatgaggaggaggaggagaaggga tttattacagagatatga >gi568815586r:50888328_51108625|GENSCAN_predicted_peptide_4|376_aa MSEIPFTIASKRIKYLGIQLTREVKDLSKENYKPLLNEIKEDTNKWKNIPCSWVGRINIV KTAILPKVIYRFNAIPIKLPMTFFTELGKTTLKFIWNQKRACIAKSILSQKNKAGSITLP DFKLYYQATVTKTAWYWCQNRDIDQWNRTEPSEIIPHIYNHLTFDKPDKNEKWGKDSLFN KWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGITIQDIGMGKD FMSKPPKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQPTEWEKISAIYSSDKGLISRI YKELKQFHKKKNNPINKWVKDMNRHFSKEDIYAANRHMKKCSSSLAIREMQIKTTMRYHL TPVRMAIIKKSGNNRY >gi568815586r:50888328_51108625|GENSCAN_predicted_CDS_4|1131_bp atgagtgaaatcccattcacaattgcttcaaagagaataaaatacctaggaatccaactt acaagggaggtgaaggacctctccaaggagaactacaaaccactgctcaatgaaataaaa gaggatacaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatatcgtg aaaacggccatactgcccaaggtaatttatagattcaatgccatccccatcaagctacca atgactttcttcacagaattgggaaaaactactttaaagttcatatggaaccaaaaaaga gcctgcattgccaagtcaatcctaagccaaaagaacaaagctggaagcatcacgctacct gacttcaaactatactaccaggctacagtaaccaaaacagcatggtactggtgccaaaac agagatatagaccaatggaacagaacagagccctcagaaataataccacacatctacaac catctgacctttgacaaacctgacaaaaacgagaaatggggaaaggattccctatttaac aaatggtgctgggaaaactggttagccatatgtagaaagctgaaactggatcccttcctt acaccttatacaaaaattaattcaagatggattaaagacttaaacgttagacctaaaacc ataaaaaccctagaagaaaacctaggcattaccattcaggacataggcatgggcaaggac ttcatgtctaaaccaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatcta attaaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaacct acagaatgggagaaaatttctgcaatctactcatctgacaaagggctaatatccagaatc tacaaagaactcaaacaatttcacaagaaaaaaaacaaccccatcaacaagtgggtgaag gatatgaacagacacttctcaaaagaagacatttatgcagccaacagacacatgaaaaaa tgctcatcatcactggccatcagagaaatgcaaatcaaaaccacaatgagataccatctc acaccagttagaatggcgatcattaaaaagtcaggaaacaacaggtactag >gi568815586r:50888328_51108625|GENSCAN_predicted_peptide_5|559_aa MAQLPAAPDGAPGLCRGALTCFDASKEADGHRARDGLYYQFLSPGDSEEYFATYFNEKIS IPEEEYSCFSFRKLWAFTGPGFLMSIAYLDPGNIESDLQSGAVAGFKLLWILLLATLVGL LLQRLAARLGVVTGLHLAEVCHRQYPKVPRVILWLMVELAIIGSDMQEVIGSAIAINLLS VGRIPLWGGVLITIADTFVFLFLDKYGLRKLEAFFGFLITIMALTFGYEASGCRTPQIEQ AVGIVGAVIMPHNMYLHSALVKSRQVNRNNKQEVREANKYFFIESCIALFVSFIINVFVV SVFAEAFFGKTNEQVVEVCTNTSSPHAGLFPKDNSTLAVDIYKGGVVLGCYFGPAALYIW AVGILAAGQSSTMTGTYSGQFVMETVICSYVFFQGFLNLKWSRFARVVLTRSIAIIPTLL VAVFQDVEHLTGMNDFLNVLQSLQLPFALIPILTFTSLRPVMSDFANGLGWRIAGGILVL IICSINMYFVVVYVRDLGHVALYVVAAVVSVAYLGFVFYLGWQCLIALGMSFLDCGHTGL LKLQIMRAANPATQAIESF >gi568815586r:50888328_51108625|GENSCAN_predicted_CDS_5|1680_bp atggcccagctcccagctgcaccggatggcgcgcccggcctgtgtcggggtgcgctgacc tgcttcgacgcctcgaaagaggccgacgggcacagggcaagggatggcctttattaccag tttttgtcccctggggactcagaggagtacttcgccacttactttaatgagaagatctcc attcctgaggaggagtactcttgttttagctttcgtaaactctgggctttcaccggacca ggttttcttatgagcattgcctacctggatccaggaaatattgaatccgatttgcagtct ggagcagtggctggatttaagttgctctggatccttctgttggccacccttgtggggctg ctgctccagcggcttgcagctagactgggagtggttactgggctgcatcttgctgaagta tgtcaccgtcagtatcccaaggtcccacgagtcatcctgtggctgatggtggagttggct atcatcggctcagacatgcaagaagtcattggctcagccattgctatcaatcttctgtct gtaggaagaattcctctgtggggtggcgttctcatcaccattgcagatacttttgtattt ctcttcttggacaaatatggcttgcggaagctagaagcattttttggctttctcatcact attatggccctcacatttggatatgaggcaagtggctgtcgcactccacagattgaacag gctgtgggcatcgtgggagctgtcatcatgccacacaacatgtacctgcattctgcctta gtcaagtctagacaggtaaaccggaacaataagcaggaagttcgagaagccaataagtac tttttcattgaatcctgcattgcactctttgtttccttcatcatcaatgtctttgttgtc tcagtctttgctgaagcattttttgggaaaaccaacgagcaggtggttgaagtctgtaca aataccagcagtcctcatgctggcctctttcctaaagataactcgacactggctgtggac atctacaaagggggtgttgtgctgggatgttactttgggcctgctgcactctacatttgg gcagtggggatcctggctgcaggacagagctccaccatgacaggaacctattctggccag tttgtcatggagacagtcatctgctcctatgttttcttccagggattcctgaacctaaag tggtcacgctttgcccgagtggttctgactcgctctattgccatcatccccactctgctt gttgctgtcttccaagatgtagagcatctaacagggatgaatgactttctgaatgttcta cagagcttacagcttccctttgctctcatacccatcctcacatttacgagcttgcggcca gtaatgagtgactttgccaatggactaggctggcggattgcaggaggaatcttggtcctt atcatctgttccatcaatatgtactttgtagtggtttatgtccgggacctagggcatgtg gcattatatgtggtggctgctgtggtcagcgtggcttatctgggctttgtgttctacttg ggttggcaatgtttgattgcactgggcatgtccttcctggactgtgggcatacgggcctc ttgaaacttcagataatgagagcagccaaccctgcaactcaggctatagagtccttctga >gi568815586r:50888328_51108625|GENSCAN_predicted_peptide_6|253_aa MALSRVCWARSAVWGSAVTPGHFVTRRLQLGRSGLAWGAPRSSKLHLSPKADVKNLMSYV VTKTKAINGKYHRFLGRHFPRFYVLYTIFMKGIISIPPFANYLVFLLMYLFPRQLLIRHF WTPKQQTDFLDIYHAFRKQSHPEIISYLEKVIPLISDAGLRWRLTDLCTKKALSRAMLLT SYLPPPLLRHRLKTHTTVIHQLDKALAKLGIGQLTAQEVKSPSLSIVFSGLYYSPLWIFN SAASTVMQQSNCN >gi568815586r:50888328_51108625|GENSCAN_predicted_CDS_6|762_bp atggcgctctccagggtgtgctgggctcggtcggctgtgtggggctcggcagtcacccct ggacattttgtcacccggaggctgcaacttggtcgctctggcctggcttggggggcccct cggtcttcaaagcttcacctttctccaaaggcagatgtgaagaacttgatgtcttatgtg gtaaccaagacaaaagcgattaatgggaaataccatcgtttcttgggtcgtcatttcccc cgcttctatgtcctgtacacaatcttcatgaaaggtattatttccattccaccttttgcc aactacctggtcttcttgctaatgtacctgtttcccaggcaactactgatcaggcatttc tggaccccaaaacaacaaactgatttcttagatatctatcatgctttccggaagcagtcc cacccagaaattattagttatttagaaaaggtcatccctctcatttctgatgcaggactc cggtggcgtctgacagatctgtgcaccaagaaagccttgagccgggccatgcttctcaca tcttacctgcctcctcccttgttgagacatcgtttgaagactcatacaactgtgattcac caactggacaaggctttggcaaagctggggattggccagctgactgctcaggaagtaaaa tcgccttcactctccattgtcttttctgggctgtattacagccctctgtggatcttcaac tctgctgcctccactgtgatgcagcagtccaactgtaactga >gi568815586r:50888328_51108625|GENSCAN_predicted_peptide_7|817_aa MEKRTPHEKEKYQPSYETTILTECSPWPEITYVNNSPSPGFNSSHSSFSLGEGMVRPRLT IYVCQESLQLREQQQQQQQQQQKHEDGDSNGTFFVYHAIYLEELTAVELTEKIAQLFSIS PCQISQIYKQGPTGIHVLISDEMIQNFQEEACFILDTMKGCSRAAASGAGLRAGFGFAAV TAYNSRHAAGISAEQIRVLLELKSKDGGKRFVPREMDYLRLLGCMKETPLKPMDAFTGSG LKRKFDDVDVGSSVSNSDDEISSSDSADSCDSLNPPTTASFTPTSILKRQKQLRRKNVRF DQVTVYYFARRQGFTSVPSQGGSSLGMAQRHNSVRSYTLCEFAQEQEVNHREILREHLKE EKLHAKKMKKASECKLGDTCLEIDGMFVEIDFRVGGKMRRVFEKLGPRKVMLTKNGTVES VEADGLTLDDVSDEDIDVENVEVDDYFFLQPLPTKRRRALLRASGVHRIDAEEKQELRAI RLSREECGCDCRLYCDPEACACSQAGIKCQVDRMSFPCGCSRDGCGNMAGRIEFNPIRVR THYLHTIMKLELESKRQVSRPAAPDEEPSPTASCSLTGAQGSETQDFQEFIAENETAVMH LQSAEELERLKAEEDSSGSSASLDSSIESLGVCILEEPLAVPEELCPGLTAPILIQAQLP PGSSVLCFTENSDHPTASTVNSPSYLNSGPLVYYQVEQRPVLGVKGEPGTEEGSASFPKE KDLNVFSLPVTSLVACSSTDPAALCKSEVGKTPTLEALLPEDCNPEEPENEDFHPSWSPS SLPFRTDNEEGCGMVKTSQQNEDRPPEDSSLELPLAV >gi568815586r:50888328_51108625|GENSCAN_predicted_CDS_7|2454_bp atggagaaacgaacacctcatgaaaaggagaaatatcagccttcctatgagacaaccata ctcacagagtgttctccatggcccgagatcacgtatgtcaataactccccatcacctggc ttcaacagttcccatagcagtttttctcttggggaagggatggtgcgtccaaggttaacc atttatgtttgtcaggaatcactgcagttgagggagcagcaacaacagcagcagcaacag cagcagaagcatgaggatggagactcaaatggtactttcttcgtttaccatgctatctat ctagaagaactaacagctgttgaattgacagaaaaaattgctcagcttttcagcatttcc ccttgccagatcagccagatttacaagcaggggccaacaggaattcatgtgctcatcagt gatgagatgatacagaactttcaggaagaagcatgttttattctggacacaatgaaaggt tgcagcagagctgccgcctcgggagccggtttgcgcgccggcttcggctttgcagcagtt accgcctacaactcccggcatgctgctggcatttctgctgagcagattcgtgtccttctg gaacttaaatccaaagatggaggaaaaagatttgtacctagggaaatggactatctcaga ctccttggttgtatgaaagaaacccctttgaaaccaatggatgcattcacgggctcgggt ctcaagaggaagtttgatgatgtggatgtgggctcatcagtttccaactcagatgatgag atctccagcagtgatagtgctgacagctgcgacagcctcaatcctcctaccactgccagc ttcacacccacatccatcctgaagcggcagaagcagctgcggaggaagaatgtacgcttt gaccaggtgactgtatactactttgcccggcgccaaggttttaccagtgtgcccagccag ggtggtagctctctgggcatggcccagcgccataactctgtacggagctatacactctgt gagtttgcccaggaacaggaggtgaaccatcgagagattctgcgtgagcacctgaaggaa gagaaactccatgccaagaaaatgaagaaggcaagtgagtgtaaattaggggatacatgt ctggaaatagacggtatgtttgtggaaatagattttcgtgtaggagggaagatgagacgg gtttttgagaagcttggacccagaaaggtgatgctgaccaagaatgggacagtggagtcg gtggaggctgatggcctgacgctggatgatgtgtcagatgaagatattgatgtggaaaat gtggaggtggatgattacttcttcctgcagcctctgcccaccaaacggcgacgggccctg ctgagggcttctggggtccaccgtattgatgctgaagagaagcaagaacttcgagccatc cgcctgtcacgggaagaatgtggttgtgactgccgactgtattgtgacccagaagcgtgt gcctgcagccaggctgggattaaatgccaggtggatcgcatgtcctttccatgtggctgc tcccgggatggctgtgggaacatggcaggacgcattgaatttaatccaatccgggtccgg actcattacctccacaccattatgaagctggagctggagagcaagcggcaggtgagccgc ccagcagccccagatgaggagccctccccgactgccagttgcagcctgacaggagcacag ggctctgagacccaggacttccaggagttcattgctgagaatgagacagcagtgatgcac ctgcagagtgcagaggaactggagcggctcaaggcagaagaagattccagcggctctagt gccagcctggactcgagcatcgagagcctgggtgtgtgcatcctagaggagcctctggct gtccccgaagagctgtgcccaggccttacagcccccattctcatccaggctcagctgccc ccaggctcctctgtcctgtgttttaccgagaactcagaccacccaactgcctcaacggtg aacagcccatcctacttgaacagtgggcccctggtctattatcaagtggagcagaggcca gtcttgggagtgaaaggagagcctggtacggaagaaggctcagcctctttcccaaaggag aaggatctgaatgtcttctctctccctgttacctcactcgtggcttgtagctccacagac ccagctgccctctgtaaatcagaggtggggaaaacacccaccctagaagctctattgccc gaagattgtaaccctgaggagcctgaaaatgaagacttccacccttcctggtccccctca agcctccccttccgcacggacaatgaagagggctgtgggatggtgaagacctcccagcag aatgaggatcggccccctgaagattcttccttagaactccctctggcagtgtga