GENSCAN 1.0 Date run: 3-Nov-116 Time: 03:56:33 Sequence gi568815590r:26407664_26608755 : 201092 bp : 44.27% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 337 440 104 0 2 67 89 102 0.931 7.17 1.02 Intr + 564 713 150 0 0 91 110 31 0.895 4.88 1.03 Term + 2701 2749 49 1 1 86 55 64 0.769 -0.32 1.04 PlyA + 3307 3312 6 1.05 2.09 PlyA - 3627 3622 6 1.05 2.08 Term - 12340 12278 63 1 0 89 41 48 0.005 -1.91 2.07 Intr - 26570 26381 190 1 1 35 62 104 0.069 2.09 2.06 Intr - 33443 33278 166 0 1 120 77 61 0.247 7.22 2.05 Intr - 40497 40414 84 1 0 99 59 85 0.947 6.49 2.04 Intr - 41255 41174 82 2 1 80 84 19 0.947 -0.09 2.03 Intr - 41593 41466 128 2 2 90 1 131 0.586 5.00 2.02 Intr - 42746 42679 68 1 2 74 66 54 0.927 0.35 2.01 Init - 45644 45526 119 2 2 62 95 95 0.879 7.31 2.00 Prom - 70522 70483 40 -3.06 3.00 Prom + 71108 71147 40 -5.56 3.01 Init + 75851 76003 153 1 0 64 -9 188 0.282 6.97 3.02 Intr + 77358 77499 142 2 1 100 80 34 0.210 3.73 3.03 Term + 94381 94475 95 2 2 56 48 78 0.018 -1.41 3.04 PlyA + 96323 96328 6 1.05 4.02 PlyA - 97044 97039 6 1.05 4.01 Sngl - 101092 99998 1095 1 0 88 48 838 0.981 76.80 4.00 Prom - 102055 102016 40 -6.86 5.00 Prom + 104121 104160 40 -8.76 5.01 Init + 106663 107020 358 0 1 99 50 544 0.342 48.97 5.02 Term + 140252 140319 68 0 2 70 54 100 0.262 2.90 5.03 PlyA + 140761 140766 6 1.05 6.00 Prom + 145924 145963 40 -3.26 6.01 Init + 150158 150326 169 1 1 73 81 138 0.398 11.30 6.02 Intr + 154115 154251 137 0 2 38 51 94 0.094 0.99 6.03 Intr + 169123 169284 162 0 0 132 64 9 0.537 3.07 6.04 Intr + 169677 169901 225 2 0 87 34 120 0.465 4.68 6.05 Intr + 174306 174394 89 2 2 105 108 96 0.838 12.07 6.06 Term + 176136 176355 220 0 1 50 49 218 0.885 10.61 6.07 PlyA + 176373 176378 6 -3.24 7.08 PlyA - 176481 176476 6 1.05 7.07 Term - 177916 177863 54 1 0 88 43 34 0.589 -3.54 7.06 Intr - 179044 178921 124 0 1 46 59 76 0.745 1.09 7.05 Intr - 180213 180067 147 2 0 51 75 57 0.410 0.05 7.04 Intr - 180871 180690 182 1 2 44 110 107 0.939 7.17 7.03 Intr - 188191 188063 129 1 0 103 66 12 0.222 1.39 7.02 Intr - 189294 189258 37 2 1 98 98 2 0.833 0.46 7.01 Init - 190399 190266 134 1 2 67 53 109 0.601 4.92 7.00 Prom - 194177 194138 40 -2.66 8.03 PlyA - 194339 194334 6 1.05 8.02 Term - 195125 195019 107 0 2 113 42 85 0.547 4.97 8.01 Intr - 200844 200813 32 2 2 87 86 69 0.869 4.37 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590r:26407664_26608755|GENSCAN_predicted_peptide_1|100_aa SEEEVVEGEKEVEALKKSADWVSDWSSRPENIPPKEFHFRHPKRSVSLSMRKSGAMKKGG IFSAEFLKVFIPSLFLSHVLALGLGIYIGKRLSTPSASTY >gi568815590r:26407664_26608755|GENSCAN_predicted_CDS_1|303_bp tcagaagaagaagttgtagaaggagagaaggaagtcgaggctttgaagaaaagtgcggac tgggtatcagactggtccagtagacccgaaaacattccacccaaggagttccacttcaga caccctaaacgttctgtgtctttaagcatgaggaaaagtggagccatgaagaaagggggt attttctccgcagaatttctgaaggtgttcattccatctctcttcctttctcatgttttg gctttggggctaggcatctatattggaaagcgactgagcacaccctctgccagcacctac tga >gi568815590r:26407664_26608755|GENSCAN_predicted_peptide_2|299_aa MTERPLHGGDLGASHCRQHAAHLKEAGLDLLAEVPSDCNLDCPTGICGQVNAQKGYLDAS HIANRSSPPLGDAPPGNPVAGDARSSPWYVVTVRGPLRVRNPDPGREQRASGDAVTRQGQ TQSFFPTQGRGQALAAVKLECVCESPEDLPTTQMRIHLVEAATTMELATSLHCAMQSRRQ RREVGGKRHERHRASRQHCAKEELCRTQGPTARPLRNEAKESTRSRCRHQHSSAWSREEI RAATPASRHQPPASRHQPPLSSDQYQQSLKTRWMTRKFSAGHQQAPNQAHPERCGLKNQ >gi568815590r:26407664_26608755|GENSCAN_predicted_CDS_2|900_bp atgactgagaggccgctccatggaggagacctgggggcctcccactgccgccagcatgct gcccacctgaaggaggctggcttggaccttctggctgaggtgccatctgactgcaaccta gattgccccactggcatatgtggacaagtcaacgctcagaaaggttacctcgacgcgtca cacatagccaatcgctcctctcctccgctcggggacgctcccccggggaatcctgtggct ggagatgctcggtcgagtccctggtatgtggtcaccgtgcgggggccgctgcgggtgcgg aatccagacccggggagggagcagcgcgcgtcgggagacgcagtgactcgccagggtcag acccagagcttcttccccacgcaaggccgagggcaggcccttgccgcagtcaaactcgaa tgcgtttgtgaatcacctgaggaccttcctacaacacagatgcgaattcacttggtcgag gccgctacaactatggaattggcaacctcgctgcactgtgcaatgcagagcaggaggcag aggagggaagttggtgggaaacggcatgaaagacaccgggcgagcagacagcactgtgcg aaggaggagctgtgccggacccagggccccactgccaggccactgcgaaatgaggcaaag gaaagcacgaggagccgctgccgccaccagcacagcagtgcctggagccgggaagagatc cgggccgccacccctgcatcccgccaccagccgcctgcatcccgccaccagccgcctctg tcctcggaccaataccaacaatctcttaaaacacggtggatgactaggaagttctcggca ggacaccagcaagcccccaatcaggcacatccagagagatgtgggctcaagaaccagtaa >gi568815590r:26407664_26608755|GENSCAN_predicted_peptide_3|129_aa MEMDVELALEANTATLVALAQTEQPMRSGGMAGVAIVSLSTEAAAFVSPVLPCEDVPASS LPFTMIVSFLRSPQPCFLYNLQNCEPIKSLFFINYPVSVGGGHFGKKPQLLLLLLKAVQT INGQLQLLS >gi568815590r:26407664_26608755|GENSCAN_predicted_CDS_3|390_bp atggaaatggatgtggagctggcactggaagcaaacacagcaactctagtggctctggcc cagactgaacagccgatgagaagtggggggatggctggagtggccatcgtgtccctgtcc acggaagcagcagcatttgtgtctcctgtcctgccatgtgaagatgtgcctgcttcctct ttgcctttcaccatgattgtaagtttcctgaggtctccccagccatgcttcctgtacaac ctgcagaactgcgagccaattaaatctcttttctttataaattacccagtctcagtggga ggagggcactttgggaaaaagcctcagctgctgcttctgctgcttaaagcagtccagacg attaatgggcagctccagctgctatcctga >gi568815590r:26407664_26608755|GENSCAN_predicted_peptide_4|364_aa MALALLEDWCRIMSVDEQKSLMVTGIPADFEEAEIQEVLQETLKSLGRYRLLGKIFRKQE NANAVLLELLEDTDVSAIPSEVQGKGGVWKVIFKTPNQDTEFLERLNLFLEKEGQTVSGM FRALGQEGVSPATVPCISPELLAHLLGQAMAHAPQPLLPMRYRKLRVFSGSAVPAPEEES FEVWLEQATEIVKEWPVTEAEKKRWLAESLRGPALDLMHIVQADNPSISVEECLEAFKQV FGSLESRRTAQVRYLKTYQEEGEKVSAYVLRLETLLRRAVEKRAIPRRIADQVRLEQVMA GATLNQMLWCRLRELKDQGPPPSFLELMKVIREEEEEEASFENESIEEPEERDGYGRWNH EGDD >gi568815590r:26407664_26608755|GENSCAN_predicted_CDS_4|1095_bp atggcgctggcactgttagaggactggtgcaggataatgagtgtggatgagcagaagtca ctgatggttacggggataccggcggactttgaggaggctgagattcaggaggtccttcag gagactttaaagtctctgggcaggtatagactgcttggcaagatattccggaagcaggag aatgccaatgctgtcttactagagcttctggaagatactgatgtctcggccattcccagt gaggtccagggaaaggggggtgtctggaaggtgatctttaagacccctaatcaggacact gagtttcttgaaagattgaacctgtttctagaaaaagaggggcagacggtctcgggtatg tttcgagccctggggcaggagggcgtgtctccagccacagtgccctgcatctcaccagaa ttactggcccatttgttgggacaggcaatggcacatgcgcctcagcccctgctacccatg agataccggaaactgcgagtattctcagggagtgctgtcccagccccagaggaagagtcc tttgaggtctggttggaacaggccacggagatagtcaaagagtggccagtaacagaggca gaaaagaaaaggtggctggcggaaagcctgcggggccctgccctggacctcatgcacata gtgcaggcagacaacccgtccatcagtgtagaagagtgtttggaggcctttaagcaagtg tttgggagcctagagagccgcaggacagcccaggtgaggtatctgaagacctatcaggag gaaggagagaaggtctcagcctatgtgttacggctagaaaccctgctccggagagcggtg gagaaacgcgccatccctcggcgtattgcggaccaggtccgcctggagcaggtcatggct ggggccactcttaaccagatgctgtggtgccggcttagggagctgaaggatcagggcccg ccccccagcttccttgagctaatgaaggtaatacgggaagaagaggaggaagaggcctcc tttgagaatgagagtatcgaagagccagaggaacgagatggctatggccgctggaatcat gagggagacgactga >gi568815590r:26407664_26608755|GENSCAN_predicted_peptide_5|141_aa MAERKQSGKAAEDEEVPAFFKNLGSGSPKPRQKFCGMFCPVEGSSENKTIDFDSLSVGRG SGQVVAQQRDVAHLGPDPQPPYSRQGRRAGGEPSVESGRKVEIRRASGKEALQNINDQVD PLSKDDELETDKQEEKEVSLA >gi568815590r:26407664_26608755|GENSCAN_predicted_CDS_5|426_bp atggccgagagaaagcaatccgggaaggcggcagaggacgaagaggtccctgcttttttt aaaaacctgggctccggcagccccaagccccggcagaaattctgtggcatgttctgcccg gtggaagggtcctcggagaacaagaccatcgacttcgactcgctgtcggtgggccggggc tcggggcaggtggtggctcagcagcgggacgtcgcccacttgggcccggacccgcagccg ccgtactcgcggcagggccggcgcgccggcggagagccatctgttgaatcgggccggaag gtggagatccggagggcctcgggcaaagaagccctgcagaacatcaacgaccaggtcgac cctctatccaaggatgatgagttggaaacagataagcaggaggagaaagaagtctccctg gcatga >gi568815590r:26407664_26608755|GENSCAN_predicted_peptide_6|333_aa MRYHYTPVRMANIHPGFSCVVASHFILTSNGEFLSLYILTPNACEDVEQQELSFIAVHSR VRAPMKIYWLADLTGGGAQAVMLAHLLLTSYCEAQFLIGHGRLISSSPPHGAVWVPSTGT VNPQDPGALGSQQLPPTALQAPAKCAAPPRTPQGAESSAAPGRGPGEFSASNGGGGGPGS PPGLRGAGRRCSRALAASEAIGSAQSPHAGLPRRPPRIGAQAGRDRGARRRSDRLLIKGG KIVNDDQSFYADIYMEDGLIKQIGENLIVPGGVKTIEAHSRMVIPGGIDVHTRFQMPDQG MTSADDFFQGTKAALAGGTTMISKKLKNHHCSA >gi568815590r:26407664_26608755|GENSCAN_predicted_CDS_6|1002_bp atgagataccactacacacctgttaggatggccaacattcatcctggattttcatgtgta gtggcatctcatttcatcctcaccagcaatggagagttcctgtcgctctacatcctaaca ccaaatgcttgtgaggatgtggagcaacaggaactctcattcattgctgttcacagtagg gttcgagctcctatgaagatctactggctggctgatctgacaggaggtggagctcaggcg gtaatgctggctcacttgctgctcacctcctactgtgaggcccagttcctaataggccat ggacggctcatctcctcctctccgccccacggtgcggtgtgggtcccatccactggcact gtaaaccctcaggaccccggggcgctgggatcgcagcagctgcccccgacagcgctgcag gcaccagcgaaatgcgctgcgcccccacgcacccctcagggagccgagtcctcagccgcg ccagggcgggggccaggcgagttcagcgccagcaacggcgggggaggagggcccgggagc cctcccgggctgcgcggcgccggccggcggtgcagtcgcgcgctcgccgccagcgaagcc attggctcggcgcagtcaccccacgcggggctgccgcggcggcctccgcggattggcgcg caggcgggcagggaccggggcgcgcgcaggcgaagcgatcgtcttctgatcaaaggaggt aaaattgttaatgatgaccagtcgttctatgcagacatatacatggaagatgggttgatc aagcaaataggagaaaatctgattgtgccaggaggagtgaagaccatcgaggcccactcc cggatggtgatccccggaggaattgacgtccacactcgtttccagatgcctgatcaggga atgacgtctgctgatgatttcttccaaggaaccaaggcggccctggctgggggaaccact atgatcagtaagaagcttaaaaatcatcattgtagtgcttag >gi568815590r:26407664_26608755|GENSCAN_predicted_peptide_7|268_aa MLIQVHLTIQKTRTKPWGTEPELISSRVDPGTKVSQLHEDASSRRMDRSPVPEGKLKFCW VHTIPWFDTTKEEGATSHTRLERRELCPLPGLSHAALLGMREPGVQVSRNDIQSSGSQAL EIQSVQIRHAPLKVMLSKSSPSTRFEYRVTDPTSRFATLTRGNCSQLLPRNSEILNSNDH EGPGKSVGFSPSSNAAAPATPQPVWMPPAGCCTQRGMSLVPKQKEPIWMNGKKATASEGE DEREIRGGGKKVCMKSPPDIPGKWHPLG >gi568815590r:26407664_26608755|GENSCAN_predicted_CDS_7|807_bp atgctgatccaggtgcatctgacgattcagaaaaccaggaccaagccgtggggcaccgag cctgagctaataagcagcagagtcgaccctggcacgaaggtctcccagctccatgaagat gcatcatcaagaaggatggataggagccctgtacctgagggcaaactcaagttctgctgg gtccatactatcccttggtttgacactacaaaagaggaaggagccaccagccacacccgg ctggagagaagagagctctgccctctgccaggcctctcccacgctgcactgctgggcatg agagagcctggagttcaggtgagcagaaatgacattcagagcagtggttcccaagccctg gagatccagagtgtgcagatccggcatgcacccctgaaagtcatgctttcaaaaagctcc ccaagcaccagattcgaatacagagtcacagaccccaccagcagatttgctactttaacc agaggtaactgttcccagctgcttcctagaaactctgagatcctgaactcaaatgaccat gaaggccctgggaagtccgtagggttttccccaagcagcaatgcggccgcccccgcaacc cctcagccggtctggatgccgcctgcagggtgctgcactcagagggggatgtccctcgtc cctaaacagaaggaacccatctggatgaatggaaaaaaggccacagcctctgagggggaa gacgaaagagaaattcggggtggtggcaagaaggtgtgcatgaaatctccccccgacatc cctggaaaatggcacccactaggttaa >gi568815590r:26407664_26608755|GENSCAN_predicted_peptide_8|46_aa XYFLFVLKNDVGLPNRQEVFCKEDLICTSNAQISTCQMANKCQLNG >gi568815590r:26407664_26608755|GENSCAN_predicted_CDS_8|141_bp nngtattttctcttcgtcctcaagaatgatgtgggtcttcccaatagacaagaagttttt tgcaaagaggaccttatctgtacttctaatgcccagatcagcacctgccagatggccaat aaatgtcagctgaatggatga