GENSCAN 1.0 Date run: 4-Nov-116 Time: 07:06:46 Sequence gi568815591f:120851164_121074841 : 223678 bp : 37.06% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 6375 6673 299 2 2 59 6 303 0.425 14.19 1.02 Term + 8235 8349 115 0 1 60 42 111 0.220 0.76 1.03 PlyA + 8483 8488 6 1.05 2.04 PlyA - 9067 9062 6 1.05 2.03 Term - 17015 16857 159 2 0 58 45 141 0.364 3.76 2.02 Intr - 38615 38425 191 0 2 110 75 13 0.181 0.58 2.01 Init - 42269 41867 403 2 1 53 57 227 0.329 12.94 2.00 Prom - 42740 42701 40 -9.05 3.00 Prom + 43428 43467 40 -9.85 3.01 Init + 43746 43881 136 2 1 53 57 128 0.431 6.55 3.02 Term + 50400 50884 485 1 2 82 48 341 0.974 23.42 3.03 PlyA + 50960 50965 6 1.05 4.00 Prom + 51994 52033 40 -6.95 4.01 Sngl + 52830 53309 480 2 0 15 41 263 0.566 10.13 4.02 PlyA + 53396 53401 6 1.05 5.00 Prom + 56453 56492 40 -3.15 5.01 Init + 84295 84418 124 0 1 68 67 80 0.414 4.28 5.02 Intr + 100001 100072 72 0 0 101 89 214 0.994 21.26 5.03 Intr + 102141 102241 101 1 2 91 53 67 0.989 2.41 5.04 Intr + 104396 104461 66 1 0 48 111 97 0.979 6.18 5.05 Intr + 113579 113675 97 1 1 66 88 65 0.888 2.96 5.06 Intr + 115463 115534 72 0 0 104 106 44 0.987 6.26 5.07 Intr + 116366 116485 120 0 0 48 97 96 0.976 6.05 5.08 Intr + 116771 116928 158 0 2 86 98 147 0.991 14.31 5.09 Intr + 117848 118041 194 1 2 26 108 239 0.989 17.07 5.10 Intr + 119525 119717 193 2 1 55 98 230 0.994 19.27 5.11 Intr + 138435 138707 273 2 0 51 116 148 0.573 10.71 5.12 Term + 139746 139760 15 2 0 103 53 16 0.720 -3.14 5.13 PlyA + 139914 139919 6 1.05 6.03 PlyA - 140398 140393 6 1.05 6.02 Term - 144835 144697 139 0 1 -30 42 467 0.911 26.75 6.01 Init - 147268 147228 41 1 2 42 110 -12 0.370 -3.89 6.00 Prom - 147629 147590 40 -1.35 7.00 Prom + 150835 150874 40 -6.15 7.01 Init + 158604 158672 69 2 0 71 30 74 0.535 1.00 7.02 Intr + 164502 164685 184 2 1 85 77 134 0.511 10.44 7.03 Intr + 195724 195830 107 2 2 101 94 19 0.088 2.81 7.04 Term + 199745 200044 300 1 0 73 37 473 0.998 34.94 7.05 PlyA + 200818 200823 6 1.05 8.04 PlyA - 201568 201563 6 1.05 8.03 Term - 209755 209547 209 2 2 56 43 176 0.088 6.42 8.02 Intr - 209857 209835 23 0 2 43 101 42 0.018 -2.73 8.01 Init - 217028 216610 419 2 2 71 48 174 0.043 7.65 8.00 Prom - 217082 217043 40 -6.35 9.03 PlyA - 217623 217618 6 1.05 9.02 Term - 222951 222764 188 1 2 48 32 172 0.760 4.27 9.01 Init - 223645 223561 85 1 1 60 100 50 0.596 4.43 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 217028 216603 426 2 0 71 44 172 0.845 7.14 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:120851164_121074841|GENSCAN_predicted_peptide_1|137_aa MQLCRRRVSHSNPASPARDAPATPRSKPPAEHKASAATDPAIASARPRQGARGSAGGREP QSQPLSSLAHSAEGSRRRAGGTAGRKGDRRVATYRRRWARVHAYFKQRTAETTSHPNAVG AIHSPERYKHPRIERSP >gi568815591f:120851164_121074841|GENSCAN_predicted_CDS_1|414_bp atgcagctgtgcagacgccgggttagtcattcaaaccccgcttcccccgcccgggacgct cccgcgacccctcgctcaaagccgcccgcggagcacaaagcgagcgcggccaccgaccca gccatcgcgtccgcgcgcccccggcagggcgcgaggggaagcgccggcggcagggagccc cagagccagcccttaagtagtttagcacacagcgcggagggcagccgccggcgggctggc gggacagcgggcaggaaaggcgacagacgcgtggccacttaccgtcggcgctgggcccgg gtgcatgcctattttaagcaaaggactgctgaaacaaccagccatccaaatgctgttggt gccatacattccccggagaggtataaacatccccgcattgaaaggagtccatga >gi568815591f:120851164_121074841|GENSCAN_predicted_peptide_2|250_aa MSAGTKIMHFLKNTFLSCIENAALMWVQDFYRKGIPIDSNMIREKAKSLYDHLKQKEGER SKAGEFSASKGWFDAFNKNVKITEAASADQEAIHEFPDTIKKMIKGKIYLPEEVLMKMKV CPSLEKNMSLRTFINPMPSVTESFQSSFSTDPSDLSPPPQTAPRQAELGPNSSSASAPPY NLFITFPPHTWSSLQFHSHESFGQSHALGKGDWGKGDWARVTRTQYSPCTTLRDWMSLKE EEKGDKMSLA >gi568815591f:120851164_121074841|GENSCAN_predicted_CDS_2|753_bp atgtcagcaggcacgaaaatcatgcattttttgaaaaatacctttttatcttgcattgaa aatgcagctcttatgtgggtccaggatttctataggaaaggcatacctatagactctaat atgattcgagaaaaagcaaagtcattatatgaccacttgaagcaaaaagaaggtgaaaga tctaaagctggagaatttagtgccagcaaaggatggtttgatgcttttaataaaaatgtc aagataacagaagcagcttctgctgaccaagaggcaatacatgagtttccagacactatt aagaaaatgattaaggggaaaatatatctgcctgaagaggttttaatgaagatgaaagtg tgccctagtctagaaaaaaacatgtcactaaggacgtttattaacccaatgccaagcgtc actgagtctttccaatcttccttttctacagacccatctgacctctcccctcctccccag actgctcctcgccaggcagagctaggtcccaattcttcctcagcctccgctccaccctat aatctttttatcaccttccctcctcacacctggtccagtttacagtttcattcccatgaa tcctttggtcaatctcatgcactgggcaagggtgactggggcaaaggtgactgggcaagg gtgactaggacccagtattcaccctgtaccacactcagggactggatgagtctgaaagag gaagagaaaggtgataagatgtcactggcatga >gi568815591f:120851164_121074841|GENSCAN_predicted_peptide_3|206_aa MWEILELPRDLLNGFAQNADSNMDNKVQAEVALDGNGELVGYYSKASAGDTQANRVWSGP PANSNRPAAEGPERNTININKKDINTKTPSVGHRHQRPKVDKTTKTRRNKSRKAENSKNQ SASYLPKDCSSSSAMKQSWMENDFDELTEGAFRRSVITNFSELKEDVQTHHKEAKNLEKR LDKWLTRINSVEKTLNDLMENGMGNT >gi568815591f:120851164_121074841|GENSCAN_predicted_CDS_3|621_bp atgtgggaaattttggaacttcctagggacttgttgaatggctttgcccaaaatgctgat agcaatatggacaataaagtccaggctgaggtggccttggatggaaatggggaacttgtt gggtactacagcaaagcctccgctggtgatacccaggcaaacagggtctggagtggacct ccagcaaactccaacagacctgcagctgagggacctgaaaggaataccatcaacatcaac aaaaaggacatcaacaccaaaaccccatctgtaggtcaccgacatcaaagaccaaaggta gataaaaccacaaagacaaggagaaacaagagcagaaaagctgaaaattctaaaaaccag agtgcctcttatcttccaaaggactgcagctcttcgtcagcgatgaaacaaagctggatg gagaatgactttgacgagttgacagaaggagccttcagaaggtcggtaataacaaacttc tctgagctaaaggaggatgttcaaacccatcacaaggaagctaaaaaccttgaaaaaaga ttagataaatggctaactagaataaacagtgtagagaagaccttaaatgacctgatggag aatggcatgggaaatacgtga >gi568815591f:120851164_121074841|GENSCAN_predicted_peptide_4|159_aa MQFKNDKGAITTDPTKIQTTTREYYKHLYANKLENLEEMDKFLDTYTIPRLNQEEAESLN RPKTGSEIEAIIESLPTKKSPGPDGFTAEFYQRYKEELVPFLLKLFQSIEKEGILPNSFY EASIILIPKPGRDTTKKENFRPISLMNIDAKILNKIPAN >gi568815591f:120851164_121074841|GENSCAN_predicted_CDS_4|480_bp atgcaatttaaaaatgataaaggggctatcaccactgatcccacaaaaatacaaactacc accagagaatactataaacacctctatgcaaataaactagaaaatctagaagaaatggat aaattcctggacacatacaccatcccaagactaaaccaggaagaagctgaatctctgaat agaccaaaaacaggttctgaaattgaggcaataattgagagcctaccaaccaaaaaaagt ccaggaccagatggcttcacagccgaattctaccagaggtacaaggaggaactggtacca ttccttctgaaactattccaatcaatagaaaaagagggaatcctccctaactcattttat gaggccagcatcatcctgataccaaagcctggcagagacacaacaaagaaagagaatttt agaccaatatccctcatgaacatcgatgcaaaaatcctcaataaaataccagcaaactga >gi568815591f:120851164_121074841|GENSCAN_predicted_peptide_5|494_aa MHLLSDFTAEESVRNHDGVVHDGAYGLFEIILFGNKIQESTVIEQLPMDLRDRFTEMREM DLQVQNAMDQLEQRVSEFFMNAKKNKPEWREEQMASIKKDYYKALEDADEKVQLANQIYD LVDRHLRKLDQELAKFKMELEADNAGITEILERRSLELDTPSQPVNNHHAHSHTPVEKRK YNPTSHHTTTDHIPEKKFKSEALLSTLTSDASKENTLGCRNNNSTASSNNAYNVNSSQPL GSYNIGSLSSGTGAGAITMAAAQAVQATAQMKEGRRTSSLKASYEAFKNNDFQLGKEFSM ARETVGYSSSSALMTTLTQNASSSAADSRSGRKSKNNNKSSSQQSSSSSSSSSLSSCSSS STVVQEISQQTTVVPESDSNSQVDWTYDPNEPRYCICNQGPSEAELVMVCRPVFPCRRRF CPRPFLVGLVVAICLFYQTLTLRGSRKLTAAAPGAVPHTSTETQASRCKKGFSQDKQCFL LSGNAQETRKTTAL >gi568815591f:120851164_121074841|GENSCAN_predicted_CDS_5|1485_bp atgcaccttttatctgactttactgcagaagaaagtgtcaggaatcatgatggagtagtc catgacggagcctatggcttatttgaaattatactattcggcaataaaatccaagagagt acagtgattgagcagcttcctatggatctgcgggaccgcttcacggaaatgcgcgagatg gacctgcaggtgcagaatgcaatggatcaactagaacaaagagtcagtgaattctttatg aatgcaaagaaaaataaacctgagtggagggaagagcaaatggcatccatcaaaaaagac tactataaagctttggaagatgcagatgagaaggttcagttggcaaaccagatatatgac ttggtagatcgacacttgagaaagctggatcaggaactggctaagtttaaaatggagctg gaagctgataatgctggaattacagaaatattagagaggcgatctttggaattagacact ccttcacagccagtgaacaatcaccatgctcattcacatactccagtggaaaaaaggaaa tataatccaacttctcaccatacgacaacagatcatattcctgaaaagaaatttaaatct gaagctcttctatccacccttacgtcagatgcctctaaggaaaatacactaggttgtcga aataataattccacagcctcttctaacaatgcctacaatgtgaattcctcccaacctctg ggatcctataacattggctcgttatcttcaggaactggtgcaggggcaattaccatggca gctgctcaagcagttcaggctacagctcagatgaaggagggacgaagaacatcaagttta aaagccagttatgaagcatttaagaataatgactttcagttgggaaaagaattttcaatg gccagggaaacagttggctattcatcatcttcggcacttatgacaacattaacacagaat gccagttcatcagcagccgactcacggagtggtcgaaagagcaaaaacaacaacaagtct tcaagccagcagtcatcatcttcctcctcctcttcttccttatcatcgtgttcttcatca tcaactgttgtacaagaaatctctcaacaaacaactgtagtgccagaatctgattcaaat agtcaggttgattggacttacgacccaaatgaacctcgatactgcatttgtaatcagggg ccaagtgaagctgaactggtcatggtctgtcgcccagtgttcccttgtcgtcggcgattt tgcccccgacccttcttggtgggcttagtggtggcaatctgtctcttctaccagactctg accctccgagggtcgaggaagctcacagccgctgcccctggggctgtcccacacacatcc actgaaacccaggcaagcagatgcaagaaaggattctctcaggacaaacagtgcttcctt ctctctggtaatgcccaggaaaccagaaagactactgccctgtga >gi568815591f:120851164_121074841|GENSCAN_predicted_peptide_6|59_aa MLAWGVGDNISIFRVIEEEEEEEGGGGGGGGEEGGGEGGGEGEEEEEEEEEEEEEEEEE >gi568815591f:120851164_121074841|GENSCAN_predicted_CDS_6|180_bp atgttagcctggggagttggtgataatatatccatattcagagttattgaagaagaagaa gaagaagaaggaggaggaggaggaggaggaggagaagaaggaggaggagaaggaggagga gaaggagaagaagaagaagaggaagaagaagaagaagaagaagaagaggaagaagaataa >gi568815591f:120851164_121074841|GENSCAN_predicted_peptide_7|219_aa MSKIWPLEQVFETKWRRVTGDEEVKESMETHFGSHGRRAILYRPPFYSKTELQLHQHILT QHGYTVVIAEERLNAGLGPGLLEQGDLGSWDLLICLSSKKAEGTPCISKEVMCQLGLHQK DAQEEGHEGTIKEEPKRRLAQLSAKPAPAKVEAKLKKAAAKDKSSGKNVQTKGKRRAKGK QAKVANQETKEDLPAENGEMKTEESPASDEAGEKETKSD >gi568815591f:120851164_121074841|GENSCAN_predicted_CDS_7|660_bp atgtctaagatatggccattggagcaggtgtttgaaacgaagtggaggagagtcactgga gatgaagaggtcaaagaatcaatggagacacactttggcagccatggccgaagggccata ctctacaggcctcctttctacagcaaaacagagcttcagctacaccagcacattctgact caacatggctatacggttgtcatcgctgaagaaaggctcaatgctggcctagggccgggg ctactagaacaaggtgatctgggctcttgggatctgctcatttgcctgtcttctaagaaa gcagaaggaacaccctgtatatccaaggaagtcatgtgccagttaggtttacatcaaaag gatgcccaagaagaaggtcatgaagggaccatcaaggaagagcccaagaggagattggcg cagttgtcagctaaacctgctcctgcaaaagtggaagcgaagctgaaaaaggcagcagcg aaggataaatcttcaggcaaaaatgtgcaaacaaaagggaaaaggagagcaaagggaaaa caggccaaagtggcaaaccaagaaactaaagaagatttacctgcagaaaacggggaaatg aaaactgaggagagtccagcctctgatgaagcaggagagaaagaaaccaagtctgattaa >gi568815591f:120851164_121074841|GENSCAN_predicted_peptide_8|216_aa MGKDFLSKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRRPTEWEKIFAIYLSDKRL TSRIYKELKQIYKKKTNNPIEKWAKDMNRHFSKEDIYAAKTHMKKCSSSLAIREMQIKTT MRYHLTPVRMAIIKKSGNNRSEAADLRGVKLQIFAVSVTAHKGSVDPKSEQQQDLLQSAK EQSFHNVKRDRSGLPLLAGTACFYSLIWPHPHPADW >gi568815591f:120851164_121074841|GENSCAN_predicted_CDS_8|651_bp atgggcaaggacttcttgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgac aaatgggatctaattaaactaaagagcttctgtacagcaaaagaaactaccatcagagtg aacaggcgacctacagaatgggagaaaatttttgcaatctacttatctgacaaaaggcta acatccagaatctacaaagaactcaaacaaatttacaagaaaaaaacaaacaaccccatc gaaaagtgggcaaaggatatgaacagacacttctcaaaagaagacatttatgcagccaaa acacacatgaaaaaatgctcatcatcactggccatcagagaaatgcaaatcaaaaccaca atgagataccatctcacaccagttagaatggccatcattaaaaagtcaggaaacaacagg agtgaagctgcagaccttcgcggagtgaagctgcagatcttcgcagtgagtgttacagct cataaaggcagtgtggacccaaagagtgagcagcagcaagatttattgcaaagcgcaaaa gaacaaagcttccacaacgtgaaaagggaccggagcgggttgccactgctggctgggaca gcctgcttttattctcttatctggccccacccacatcctgctgattggtag >gi568815591f:120851164_121074841|GENSCAN_predicted_peptide_9|90_aa MPTGSNVIQEKAKSLYDNLKQKKGKGSKASKSGGNRIFKAHYTQYSLERIVRAMEESSNR ENIMKIWKDYTTEDATVAIEKAMKTISPKE >gi568815591f:120851164_121074841|GENSCAN_predicted_CDS_9|273_bp atgcctacaggctctaatgtgattcaagagaaagcaaagtcattatatgacaacttaaag caaaagaaaggtaaaggatctaaagcctctaaatcagggggtaataggatctttaaggct cattatacacagtactctttggaaaggattgtcagagctatggaagagagctccaacaga gagaacatcatgaaaatctggaaggattacaccactgaagatgccactgttgctatagaa aaagccatgaaaaccatcagcccaaaagaataa