GENSCAN 1.0 Date run: 4-Nov-116 Time: 16:54:50 Sequence gi568815574r:18671938_18873618 : 201681 bp : 37.88% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1055 1338 284 1 2 84 -7 198 0.377 6.46 1.02 Intr + 1658 1805 148 2 1 58 51 60 0.282 -1.38 1.03 Intr + 2280 2378 99 2 0 105 86 73 0.948 8.19 1.04 Term + 3433 3576 144 0 0 80 49 90 0.851 1.23 1.05 PlyA + 3988 3993 6 1.05 2.00 Prom + 24348 24387 40 -5.05 2.01 Init + 26938 27083 146 0 2 87 82 120 0.912 10.94 2.02 Intr + 36751 36909 159 1 0 80 59 103 0.134 4.78 2.03 Intr + 50563 50610 48 1 0 62 127 71 0.353 5.28 2.04 Intr + 53716 53866 151 1 1 64 36 86 0.329 0.24 2.05 Intr + 60866 61054 189 1 0 42 108 186 0.396 14.86 2.06 Intr + 63730 63760 31 0 1 27 80 40 0.335 -6.11 2.07 Term + 63930 64405 476 1 2 109 43 194 0.836 11.06 2.08 PlyA + 64445 64450 6 -0.45 3.00 Prom + 67080 67119 40 -4.95 3.01 Init + 73204 73413 210 0 0 82 49 242 0.898 18.53 3.02 Intr + 74545 74744 200 0 2 57 69 110 0.735 3.23 3.03 Term + 77401 77878 478 1 1 -21 38 266 0.101 4.03 3.04 PlyA + 78383 78388 6 1.05 4.00 Prom + 85199 85238 40 -5.95 4.01 Init + 89284 89420 137 0 2 67 80 94 0.533 6.16 4.02 Intr + 90678 90893 216 0 0 34 69 176 0.699 7.00 4.03 Term + 97607 97799 193 2 1 18 37 174 0.052 1.21 4.04 PlyA + 98434 98439 6 1.05 5.03 PlyA - 98989 98984 6 1.05 5.02 Term - 100690 99998 693 1 0 88 39 363 0.975 23.98 5.01 Init - 101681 101169 513 2 0 50 63 264 0.986 15.05 5.00 Prom - 106570 106531 40 -5.15 6.00 Prom + 108595 108634 40 -3.65 6.01 Init + 120700 120860 161 0 2 62 79 167 0.056 10.63 6.02 Intr + 130967 131095 129 2 0 40 67 133 0.011 5.29 6.03 Intr + 156403 156522 120 1 0 103 101 18 0.209 3.29 6.04 Intr + 168141 168285 145 0 1 106 37 84 0.184 4.46 6.05 Intr + 169670 169853 184 1 1 63 60 98 0.090 2.94 6.06 Intr + 177770 177792 23 0 2 104 87 17 0.016 -0.46 6.07 Intr + 184462 184548 87 0 0 54 84 61 0.003 1.55 6.08 Term + 197531 197851 321 1 0 81 33 156 0.401 3.24 6.09 PlyA + 198223 198228 6 1.05 7.02 PlyA - 198307 198302 6 1.05 7.01 Term - 200891 200751 141 2 0 44 54 126 0.655 1.75 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 53368 53531 164 0 2 57 85 109 0.871 6.75 S.002 Intr - 126766 126473 294 2 0 60 76 120 0.857 4.06 S.003 Init - 127737 127590 148 0 1 90 109 74 0.974 10.02 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815574r:18671938_18873618|GENSCAN_predicted_peptide_1|224_aa MRKCSPSLTTHKGWYSKTFTSVGLNPDPPCFQENIRQCLETFLVVTIGVEDATDKIATKD TAKSTMHTTASLKNLLAPKGKSGAVKIPRISGMTHSPNTLQTCTCEMWVELPACAQQPST FALSLQIWKAVLSRSPGGARWDVRSNMLPLSDVFSQCELRKKLHKTLKSRGVLDTFKTQL QNQLIHVLMHPVLNGELHPQFMSVEGSYLLRGTSNSSVADHSQR >gi568815574r:18671938_18873618|GENSCAN_predicted_CDS_1|675_bp atgaggaaatgctcaccttccctcacaacacacaaggggtggtactcaaaaacgtttact tctgtgggtctcaatccagatccaccttgcttccaagaaaacatacggcaatgtttggag acatttttggttgtcaccattggggtcgaagatgctaccgacaaaatagcgaccaaggat acagctaaatctacaatgcacacgacagcctccctaaagaatcttctggctccaaaaggc aaaagtggcgccgttaaaataccacggattagcggaatgacccattctcccaataccctg cagacgtgcacttgcgagatgtgggtggagcttccggcgtgtgctcagcaaccaagtaca tttgccttgtctctgcagatctggaaggctgtacttagtaggtctccaggaggggcacga tgggatgtgaggtccaacatgcttcctctgtctgacgtgttcagtcaatgtgagctacgc aaaaagctacacaagaccttgaagagtcggggtgtactggacacattcaagacacaactt caaaaccagctaattcatgtgttgatgcaccctgtattgaatggagaactgcatcctcag tttatgtcagtggaagggagctacctcctacgaggcacctctaactcttcagtggctgat cactcacagagatag >gi568815574r:18671938_18873618|GENSCAN_predicted_peptide_2|399_aa MEMLKDWSFAEPNLFDVRMAAHLTADSGLILHSEGKQRRGQVTATPGGIEAGKMEMLKDW SFAEPNLLDVRMAAYLTADSGLILHGEGKQRRGQVTATPWRNFLTAARPQAMQGYKLRLR NGNFIQSKCKQQIRHAGAMKKPFSQLLNTTPGVWAKDTPPGLAVNHAQEVIAKERLMGKP FVPHADAGNVPGHLEMDEFSSGDMSHVYAYLEICLLWVLLQLLCPSHINTQETVRTVDVR AGENRLEHLYESKSYKTALYRNRDQWLSQDPKVRRGIIIKGRGRNVGFALKKGKEIRVKD KNLRHGGRICHLESVMMRVTTSHPGMSTTHLSQVKWDAVARSGIGGLKSMNMEQRLEVGH VSNKSQAESPRVLARTPADWSHIHLNNLIRAATLYFLNC >gi568815574r:18671938_18873618|GENSCAN_predicted_CDS_2|1200_bp atggagatgttgaaggactggagctttgcagagcctaacttatttgatgtgagaatggca gcccatttaactgcagattctgggcttatcttgcatagtgaagggaagcagaggagaggg caagtgactgctacaccaggaggaattgaggctggaaagatggagatgttgaaggactgg agctttgcagagcctaacttacttgatgtgagaatggcagcctatttaactgcagattct gggcttatcttgcatggtgaagggaagcagaggagagggcaagtaactgctacaccatgg agaaattttttaacagctgcaaggccacaagctatgcaaggctacaagttacggctaaga aatggaaactttatacaaagcaagtgcaagcagcagattagacatgcaggtgcaatgaaa aaaccattctctcagttacttaataccactcctggagtatgggctaaagacaccccacct gggttagctgtaaatcatgcacaggaggttattgctaaggaacgactcatgggaaaacca tttgtgcctcatgcagatgcaggaaacgtgcccgggcatcttgaaatggatgagttctct tctggagatatgtctcatgtgtacgcatatctggagatatgcctcttgtgggtgctgctg cagctgctgtgcccatctcatatcaatacccaagaaactgtcagaactgtggatgtgagg gcaggggagaacagacttgagcacctttatgagagtaagagttataaaactgctctgtac agaaatagggatcagtggctcagtcaggatcctaaagttaggagagggattataatcaaa ggcagaggcagaaatgtaggctttgctttgaaaaaggggaaggagatccgtgtaaaggac aagaacttgcgacatggagggaggatatgtcacttggagtcggtcatgatgagggtgact acaagtcaccctgggatgagtacaactcatctcagccaagtgaagtgggatgcagtggca aggagcgggattgggggcttgaagtcgatgaacatggaacagcgacttgaagtaggtcac gtgtcaaacaaaagccaagcagagtctccccgggtcttggcaaggactcctgcagactgg tctcacattcatttgaacaatctgatacgggcagctactctttattttctgaactgttag >gi568815574r:18671938_18873618|GENSCAN_predicted_peptide_3|295_aa MEPKGLQNNSTDSVLLNHGKVRGPRKDPSHKDGMGAGSPPRVYLVIRPRTQDLLANSNET YTPTYEEELRTHNPTWPDSKQLLLLLFNTKEHRWMAQSTLHWLEANVPFQAYAQFQFPEE DPHWDSHALTQFQHLQRLSGRSSATAHQNVSPKGRANSPKETILLAKGEMVNVYTDSKYT FATLHDNGAIYKEKGLLMAVGKKIKYKEKILQLLDAVCASKKMAVMHCRGHQKAGTLEAK KNKKIDREARGAAMTTLQFKKKAIAMPLLPKPLLSEVSSYLQMRSPALPKSLENI >gi568815574r:18671938_18873618|GENSCAN_predicted_CDS_3|888_bp atggaacctaaggggttacagaacaattccacggacagcgtgctgctgaaccatgggaag gttcggggcccaaggaaggacccatcccataaagatggaatgggagccggatcacctccc agggtgtacctagttatccgacccaggacgcaagatttgctcgctaattcaaatgaaacc tacaccccaacctatgaagaggaactgaggacacataatccaacttggccagatagtaaa cagcttctgttgctgctgttcaacactaaagagcaccgatggatggctcagtcaaccctc cactggctagaagccaatgtgccattccaggcatacgctcagttccagttcccagaggaa gacccccactgggactcacatgctttgacccagtttcagcacctgcagagactcagtggt agaagctcagccactgcccaccagaatgtcagcccaaaaggcagagctaatagccctaag gagaccattttgctagcaaaaggcgaaatggtcaatgtttatacagattccaaatatacg tttgccacgttgcatgacaatggggctatatataaagaaaaaggactcttaatggctgta ggcaaaaaaataaagtacaaagaaaaaattctgcaactcttagatgctgtatgtgcttcg aagaagatggctgttatgcactgcagggggcaccaaaaggcaggaacactggaggccaaa aagaacaaaaagatagacagagaggcaagaggggcagcaatgactacccttcagtttaag aagaaagccatagctatgcctctacttccaaagcctctcctctcggaggtttcaagttac ctccaaatgagaagccctgctttgcccaagagtctggaaaatatataa >gi568815574r:18671938_18873618|GENSCAN_predicted_peptide_4|181_aa MTDVASASRDHTGESNPGQRVTKVRVGECGTTLKDTWLLLRNPHGETSRSGKNESLPERA VAKAKGGIKDYGIPHRKLNVALLTLNFLSLSRGQMLSAADQHLQKPAAKTEAEKLVWCEC CFRKVVQLDPQCIPFHHSNQESPLQIKEGVVRIQLVLSDKDLLLKAEDEKEFLEEAGENH N >gi568815574r:18671938_18873618|GENSCAN_predicted_CDS_4|546_bp atgactgatgtggcttcggcaagtcgtgaccatacgggggaatcaaacccaggtcaaaga gtcaccaaagtgagggtgggagaatgtggaactacactgaaggacacctggctactctta agaaatccccatggtgagacaagccgtagcggaaagaatgaatctctccctgaaagagca gttgcaaaagccaaagggggaatcaaggactacgggataccacataggaaattgaatgta gcattattgactctaaattttttgagcctgtctagaggccagatgctatcagcagctgat cagcatctgcagaaaccagctgcaaagacagaagcagaaaaactggtttggtgtgaatgc tgcttccgaaaagttgtgcagctggatccccagtgtatcccatttcatcacagcaaccag gaatctccactgcaaataaaggaaggagtggtaaggatacagctagttctttcagacaag gacctgctactcaaagcagaagatgaaaaagaatttttagaagaagcaggggaaaatcat aactag >gi568815574r:18671938_18873618|GENSCAN_predicted_peptide_5|401_aa MAHVSSETQDVSPKDELTASEASTRSPLCEHTFPGDSDLRSMIEEHAFQVLSQGSLLESP SYTVCVSEPDKDDDFLSLNFPRKLWKIVESDQFKSISWDENGTCIVINEELFKKEILETK APYRIFQTDAIKSFVRQLNLYGFSKIQQNFQRSAFLATFLSEEKESSVLSKLKFYYNPNF KRGYPQLLVRVKRRIGVKNASPISTLFNEDFNKKHFRAGANMENHNSALAAEASEESLFS ASKNLNMPLTRESSVRQIIANSSVPIRSGFPPPSPSTSVGPSEQIATDQHAILNQLTTIH MHSHSTYMQARGHIVNFITTTTSQYHIISPLQNGYFGLTVEPSAVPTRYPLVSVNEAPYR NMLPAGNPWLQMPTIADRSAAPHSRLALQPSPLDKYHPNYN >gi568815574r:18671938_18873618|GENSCAN_predicted_CDS_5|1206_bp atggcacatgtttcttcagaaactcaagatgtttcccccaaagatgaattaactgcttca gaagcctccactaggtctccattgtgtgaacacaccttccctggggactcagacttacgg tcaatgattgaagaacatgcttttcaggttttgtcacaaggatccttgttagaaagtcca agttacacagtttgtgtctctgagccagataaagatgatgattttctttctctgaacttt cccaggaaactttggaaaatagtggaaagtgaccaattcaagtctatttcatgggatgag aatggaacttgcatagtgattaatgaagaactcttcaagaaagaaattttggaaacaaag gctccttacagaatatttcaaactgatgctatcaaaagttttgttcgacagctcaacctt tatggatttagtaaaattcaacagaattttcaaagatctgcctttctagccacctttctg tcagaagagaaagaatcgtctgtcttaagcaagttaaagttctattataatccaaatttc aagcgtggctatccccaacttttagtaagagtgaagagaagaattggtgttaaaaatgct tcacctatatctactttattcaacgaagatttcaacaagaagcattttagagcaggggct aacatggagaatcataattctgccttagctgctgaagctagtgaagaaagtttattttca gcctctaaaaatttaaatatgcctctaacaagggaatcttctgtcagacagataattgca aattcatctgtccccattagaagtggtttccctcctccttcaccttcaacctcagttgga ccatcagaacaaattgcaacagatcaacatgctattttaaatcagttgaccactattcat atgcactctcatagtacctacatgcaagcaaggggccacattgtgaattttattacaacc acaacttctcaataccacatcatatctcccttacaaaatggttattttgggctgacagtg gaaccatctgctgttcccacacgatatcctctggtatcagtcaatgaggctccatatcgt aacatgctaccagcaggcaacccgtggttgcaaatgcctacgatcgctgatagatcagct gcccctcattccaggctagctcttcaaccatcaccactggacaaatatcaccctaattac aactga >gi568815574r:18671938_18873618|GENSCAN_predicted_peptide_6|389_aa MENRQAPQTLLAESLLLLLKAETPTCPMSSRSGSKDHPHLPDLRPSFRAQGPPRNILDTC FSNYVHQLLDLLLVESIGVTGSKSICVFSNVSSCLLRRSKLYHPCIAVRVALLLAESSYR IFTNTWLTATIGRKIGMCTKIRKLEADMGDMPATLESCMKTDTQHSLSDNHNKETQRQSN PLISTYKQSHWEVTAAQQSCCSQTAYLDSSSLVRASLKEWQQPQLGAYRQNFQLPETEPL GERVAVEIGLAKEKNTKFESLETKLNEYKREIEKQLLAEMCQKGIFETNDSSPSQGLTNK NPISLGQSTWGKRKPWMQLQQTSTSPPDGSEGAAHLPLQCTGSANGQSAFSSGSLTSVYP DWETPPGRGRQTPCIGEHWLASDRCPLRI >gi568815574r:18671938_18873618|GENSCAN_predicted_CDS_6|1170_bp atggaaaacaggcaggcaccccaaacactgctggctgaaagcctgctgcttctgctgaag gctgagactccaacctgtcccatgagtagccgcagtggctccaaagaccaccctcaccta cccgacctccgcccttccttcagggcccaggggcccccaagaaacattctggacacatgc ttcagcaactatgtccaccagctcctagatctgttactggtggaaagtatcggagttact ggcagcaaatccatctgcgtctttagcaatgtcagttcttgcctcctcagacgcagtaaa ttgtatcatccatgcatcgcagtcagggtagctttacttttagcagaatcttcatatagg atattcaccaatacttggttaacagccactattggaagaaaaattggaatgtgcactaag atacggaagctagaagcagacatgggggacatgcctgcaactctagaaagctgtatgaaa acagacacacaacactctctgtcagataaccacaacaaagagacacagaggcagtccaac cctctgataagcacttataaacaaagccactgggaagtcaccgcagctcagcaaagctgc tgtagccagactgcctacctagattcctcctctttggtcagagcatctctgaaagaatgg cagcagccccagttaggggcttatagacaaaatttccaactccctgaaacagagcccctg ggggaaagggtggctgtagaaattggtttagcaaaagaaaagaatacaaagtttgagtct ttagaaacaaagctaaatgaatataagagagaaatagaaaagcaacttctggcagaaatg tgtcaaaagggtattttcgaaacaaatgatagcagccccagtcaggggcttacaaataaa aaccccatctccctgggacagagcacctgggggaaaaggaagccatggatgcagcttcag caaacttcaacatccccgcctgatggctctgaaggagcagcacacctcccattacagtgc actggctctgctaatggtcagagtgccttctcaagtggatccctgacctctgtgtatcct gactgggagacacctcctggcaggggccgacagacaccttgtattggagagcactggcta gcatctgacagatgccccttgaggatatag >gi568815574r:18671938_18873618|GENSCAN_predicted_peptide_7|46_aa VEPLGGDEVMRVEPHEWDEHSYKKDSRELACSFHSVRTQREGSVYE >gi568815574r:18671938_18873618|GENSCAN_predicted_CDS_7|141_bp gtggagcctctgggaggtgatgaggtcatgagagtggagcctcatgaatgggatgagcac tcctacaaaaaggattccagagagctcgcttgctccttccacagtgtgaggacacagagg gaaggctctgtctatgaatga