GENSCAN 1.0 Date run: 5-Nov-116 Time: 22:09:16 Sequence gi568815591r:112667200_112884825 : 217626 bp : 37.20% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 PlyA - 249 244 6 1.05 1.01 Sngl - 20049 19537 513 0 0 60 42 330 0.999 21.39 1.00 Prom - 20561 20522 40 -6.15 2.03 PlyA - 20849 20844 6 1.05 2.02 Term - 21596 21059 538 1 1 1 48 254 0.631 5.43 2.01 Init - 22095 21875 221 0 2 68 72 186 0.406 13.15 2.00 Prom - 29315 29276 40 -4.65 3.00 Prom + 36235 36274 40 -2.85 3.01 Init + 40579 40662 84 0 0 36 106 61 0.351 3.57 3.02 Intr + 41203 41430 228 0 0 -28 77 182 0.157 2.64 3.03 Term + 45535 45663 129 0 0 21 40 150 0.300 0.70 3.04 PlyA + 45704 45709 6 1.05 4.00 Prom + 47332 47371 40 -3.95 4.01 Init + 48394 48454 61 0 1 88 116 45 0.698 8.76 4.02 Term + 48824 48990 167 0 2 44 54 86 0.634 -2.10 4.03 PlyA + 49522 49527 6 1.05 5.02 PlyA - 50315 50310 6 1.05 5.01 Sngl - 51506 50694 813 2 0 43 37 252 0.878 11.12 5.00 Prom - 64476 64437 40 -2.75 6.03 PlyA - 65241 65236 6 1.05 6.02 Term - 65990 65740 251 0 2 52 48 144 0.472 1.68 6.01 Init - 68128 68062 67 1 1 123 75 34 0.433 6.89 6.00 Prom - 69600 69561 40 -3.95 7.05 PlyA - 70365 70360 6 1.05 7.04 Term - 85664 85567 98 0 2 62 37 127 0.418 2.15 7.03 Intr - 100545 100243 303 1 0 116 66 225 0.049 18.74 7.02 Intr - 105856 105582 275 0 2 82 97 110 0.935 7.56 7.01 Init - 108065 107977 89 2 2 64 92 50 0.486 3.17 7.00 Prom - 111214 111175 40 -5.65 8.00 Prom + 114812 114851 40 -2.85 8.01 Init + 116700 116799 100 2 1 28 75 78 0.326 0.97 8.02 Intr + 122673 122841 169 1 1 97 68 145 0.983 11.48 8.03 Intr + 123005 123216 212 2 2 -30 0 208 0.520 -2.17 8.04 Intr + 123294 123481 188 1 2 56 5 146 0.054 1.59 8.05 Intr + 134234 134496 263 1 2 43 59 202 0.567 8.16 8.06 Term + 135387 135579 193 0 1 39 43 162 0.053 2.71 8.07 PlyA + 136095 136100 6 1.05 9.04 PlyA - 137085 137080 6 1.05 9.03 Term - 155174 154545 630 2 0 104 34 334 0.984 22.93 9.02 Intr - 165476 165362 115 1 1 71 69 124 0.963 8.33 9.01 Init - 170727 170633 95 0 2 63 95 25 0.667 0.60 9.00 Prom - 193126 193087 40 -3.65 10.02 PlyA - 194376 194371 6 1.05 10.01 Sngl - 202176 201724 453 0 0 47 46 318 0.908 19.45 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 100545 99998 548 1 2 116 28 288 0.942 19.05 S.002 Sngl - 217574 217236 339 2 0 48 32 190 0.874 5.18 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:112667200_112884825|GENSCAN_predicted_peptide_1|170_aa MCQKHWDTAKAVFSRKFIALNAHRNKRERSKIDTLTPQLKEQEKQEQTNSKASRKQEITK IRAELKEIETRKTLQKISEFRSWFFEKIKKMDRPPARLMKKKKKKNQMDTVKNDKGDSTT DATEIQTTIREYYKYLYANKLENLEEMDKFLDTYILPRLNQEKVKSLIDQ >gi568815591r:112667200_112884825|GENSCAN_predicted_CDS_1|513_bp atgtgccagaaacactgggacacagctaaagcagtatttagcaggaaatttatagcacta aatgcccacaggaataagcgggaaagatctaaaattgacaccctaacaccacaattgaaa gaacaagagaagcaagagcaaacaaattcaaaagctagcagaaaacaagaaataaccaag atcagagcagaactgaaagagatagagacacgaaaaacccttcaaaaaatcagtgaattc agaagctggttttttgagaagattaagaaaatggatagaccaccagccagattaatgaag aagaaaaaaaagaagaatcaaatggacacggtaaaaaatgacaaaggggatagcaccact gatgccacagaaatacaaactaccatcagagaatactataaatacctctatgcaaataaa ctagaaaatcttgaagaaatggataaattcctggacacatacatcctcccaagactaaac caggaaaaagtcaaatccctgatagaccaataa >gi568815591r:112667200_112884825|GENSCAN_predicted_peptide_2|252_aa MRKNQRKKDENSKNREASSPPKDHSSLPAREQNWTENEFDELAEVGFRRWVINSSELKEH VLTQCKEAKNLEQSDRENGTKLENTLQDIIQENFPHLVRQVNIQIQETQRTPQRHSSRRG TPRHRIIRFTKVEMNEKMLRAAREKGWVTHKGKSIRLTADFSAETLQARREWRPIFNILK EKTFQPRISHPAKLSFISEGEIKSFTDKQMLTDFAITRPALQELLKEALDVKRKSQYQPL QKHTKLESPSTL >gi568815591r:112667200_112884825|GENSCAN_predicted_CDS_2|759_bp atgaggaaaaaccagcgcaaaaaggatgaaaattccaaaaaccgggaggcttcttctcct ccgaaggatcacagttccttaccagcaagggaacaaaactggacggagaatgagtttgat gaattggcagaagtaggcttcagaagatgggtaataaactcttctgagctaaaggagcat gttctaacccaatgcaaggaagctaagaaccttgaacaaagtgacagggagaatggaacc aagttggaaaacactcttcaggatattatccaggagaatttcccccacctggtgagacaa gtcaacattcaaattcaggaaacacagagaacaccacaaagacactcctcgagaagagga accccaagacacagaatcatcagattcaccaaagttgaaatgaatgaaaaaatgttaagg gcagccagagagaaaggttgggttacccacaaagggaagtccatcagactaacagcggat ttctcagcagaaaccttacaagccagaagagagtggcggccaatattcaacattcttaaa gaaaagacttttcaacccagaatttcacatccagccaaactaagcttcataagtgaagga gaaataaaatcctttacagataagcaaatgctgacagattttgccatcaccaggcctgcc ttacaagagctcctgaaggaagcactagatgtgaaaaggaaaagccagtaccagccactg caaaaacataccaagttggaaagcccatcaacactatga >gi568815591r:112667200_112884825|GENSCAN_predicted_peptide_3|146_aa MALELHAENFIMKPENHRSINTAAYAYKVNHSGNLVAHKGQRSIQEAQKAHCWLIGGISA FGIPQHKKISDEIENEPGKTLVLGLEYKDSATGTGAISRTSRHEPHGDISDCIEVVLVLK QRPCFAVGKLAGLRWSYGTEFAVSVV >gi568815591r:112667200_112884825|GENSCAN_predicted_CDS_3|441_bp atggctcttgaattgcatgctgaaaacttcataatgaaaccagaaaatcacagaagcata aacacagctgcttatgcctataaagtgaaccattcagggaatcttgtagcacacaaaggt cagaggtcaattcaggaagctcagaaggcccactgctggctcattggtggaatttcagcc tttgggattccacaacacaaaaagatcagtgatgaaattgaaaacgaaccaggaaagacc cttgtgctgggactggagtacaaagattctgctactggaactggggccatttccaggaca tccagacatgagcctcacggggatatttcagattgcattgaagtggtcctggtgctaaag cagaggccatgctttgctgtgggaaagctggctggacttcggtggtcctatgggacagag ttcgctgtttcagtggtgtag >gi568815591r:112667200_112884825|GENSCAN_predicted_peptide_4|75_aa MEDHKGEKGAGEAACDHVRKAMIPTLIIANADKQELCLSAAKAAIAAKMNLACYMPETIL TAFQTLIHLILTTVL >gi568815591r:112667200_112884825|GENSCAN_predicted_CDS_4|228_bp atggaggatcataagggtgagaaaggagctggagaagcagcatgtgaccatgtcaggaaa gccatgatccccactctgatcatcgcaaatgcagataagcaggaattatgtttgtcagca gcaaaagcagcaatagcagccaaaatgaatttggcttgctatatgccagagactattctg acagcatttcaaacattaattcatttaattctcaccacagtcttatga >gi568815591r:112667200_112884825|GENSCAN_predicted_peptide_5|270_aa MNIHAKILNEILANRIQQHIKKLIHHDQVGFIPGMQGWFNICKSLNIIQSINRTKDKNHM IISIDAEKAFDKIQQLFMLKTLNKLGIDGTYLKIIRAIYDKPTANIILNGQKLEASPLKT GSRQGCPLSPILFNIVLEVLTRAIRQEKERKGIQLGKEEVKLSLFADDMIVNLENPIVSA QSLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELPFTNASKRIKYLGIQLTR HVKDLFKENYKPLLNEIKEDTNKWKNIPCS >gi568815591r:112667200_112884825|GENSCAN_predicted_CDS_5|813_bp atgaacatccatgcaaaaattctcaatgaaatattggcaaaccgaatccagcagcacatc aaaaagcttatccaccatgatcaagtgggcttcatccctgggatgcaaggctggttcaac atatgcaaatcactaaatataatccagtctataaacagaaccaaagacaaaaaccacatg attatctcaatagatgcagaaaaggcctttgacaaaattcaacaactcttcatgctaaaa actctcaataaattaggtattgatgggacatatctcaaaataataagagctatttatgac aaacccacagccaatatcatactgaatgggcaaaaactggaagcatcccctttgaaaact ggctcaagacagggatgccctctctcaccaatcctattcaacatagtgctggaagttcta accagggcaatcaggcaggagaaagaaagaaagggtattcaattaggaaaagaggaagtc aaattgtctctgtttgcagatgatatgattgtaaatctagaaaaccccatcgtctcagcc caaagtctccttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaatgtg caaaaatcacaagcattcttatacaccaataacagacaaacagagagccaaatcatgagt gaactcccattcacaaatgcttcaaagagaataaaatacctaggaatccaacttacaagg catgtgaaggacctcttcaaggagaactacaaaccactgctcaatgaaataaaagaggat acaaacaaatggaagaacattccatgctcatag >gi568815591r:112667200_112884825|GENSCAN_predicted_peptide_6|105_aa MECYCREMLAHSGGKKPMLMVEVLEVLAKTIRQEKEIKGIQIGKEEVKLSLFADDMIVYL EKPIVSASNLLKLKSNFSKVSGYKINVQNHKHSYTSITDKQRAKS >gi568815591r:112667200_112884825|GENSCAN_predicted_CDS_6|318_bp atggagtgctattgcagggagatgctggcccacagtggaggcaagaagcccatgctcatg gttgaagtattggaagttctagccaagacaatcaggcaagagaaagaaataaagggtatt caaataggaaaagaggaagtcaaattgtctctgtttgcagatgacatgattgtataccta gaaaaacccatcgtctcagcctcaaatctccttaagctgaaaagcaacttcagcaaagtc tcaggatacaaaattaatgtgcaaaatcacaagcattcttatacatcaataacagacaaa cagagagccaaatcatga >gi568815591r:112667200_112884825|GENSCAN_predicted_peptide_7|254_aa MAHGLFHELGNCLGGTSVGYAIVIPTNFCSPDGQPTLLPPEHVQELNLRSTGMLNAIQRF FAYHMIETYGCDYSTSGLSFDTLHSKLKAFLELRTVDGPRHDTYILYYSGHTHGTGEWAL AGGDTLRLDTLIEWWREKNGSFCSRLIIVLDSENSTPWVKEVRKINDQYIAVQGAELIKT VDIEEADPPQLGDFTKDWVEYNCNSSNNICWTEKGRTVKAVYDIANKETPPYDTVRSCET YTLSQEQHGKELPS >gi568815591r:112667200_112884825|GENSCAN_predicted_CDS_7|765_bp atggctcatgggctcttccatgaattgggtaactgtttaggaggaacatctgttggatat gctattgtgattcccaccaacttctgcagtcctgatggtcagccaacactgcttccccca gaacatgtacaggagttaaatttgaggtctactggcatgctcaatgctatccaaagattt tttgcatatcatatgattgagacctatggatgtgactattccacaagtggactgtcattt gatactctgcattccaaactaaaagctttcctcgaacttcggacagtggatggacccaga catgatacgtatattttgtattacagtgggcacacccatggtacaggagagtgggctcta gcaggtggagatacactacgccttgacacacttatagaatggtggagagaaaagaatggt tccttttgttcccggcttattatcgtattagacagcgaaaattcaaccccttgggtgaaa gaagtgaggaaaattaatgaccagtatattgcagtgcaaggagcagagttgataaaaaca gtagatattgaagaagctgacccgccacagctaggtgactttacaaaagactgggtagaa tataactgcaactccagtaataacatctgctggactgaaaagggacgcacagtgaaagca gtatatgatattgccaacaaggaaactcccccttatgataccgtcagatcttgtgagact tatacgctatcacaagaacagcatggaaaagagctgccctcatga >gi568815591r:112667200_112884825|GENSCAN_predicted_peptide_8|374_aa MTNHPENSKNGKARYYKIPEVSVSKFECGKNKKSRCGRSDKELGCSPPSSASENATTTWR PVEGEPLRNTERRSHTLQSPTTTVNSSRGSQFQNQTKKEESRQGFSLRLRTGRRHALGNE GPRPYALIQASVGVNFLRYRPGRDRHVSAAEIRDGTERRRGEQGAARVSTAPPKAPAFLK PKSTPGTWNQQLRTDSGKTTKDQADSAHQASRPPPRAGPNGDRALPPWSGKTKASGTGLP FPRPHNDVLPSGQVLWQGQSQVILSLRMGQRSIVPYLVHLDPESGITFLHRKDIGTGDEN AMFIQGADNCWPVSSQILSVDSWMAAAVHVQGLPHSKLSVYQIWSRIGPCPSCTKEEICG GSAASFKDCLSDVT >gi568815591r:112667200_112884825|GENSCAN_predicted_CDS_8|1125_bp atgacaaatcatccagaaaattccaaaaatggaaaagccaggtattacaaaataccagag gtgagtgtctctaagtttgaatgcggaaagaataaaaaatcgagatgtgggaggtccgac aaggagcttggctgttctccaccatcctctgcctcggagaacgcgaccacaacgtggcgc ccagttgagggggaacccttgcgaaatacggagaggcgctcccacaccctccaatctcca accaccactgtcaactcctccagaggcagtcagttccaaaaccaaaccaaaaaggaggag agcagacaaggcttcagcctgcgactgagaacagggaggcggcacgccttggggaacgag ggcccgaggccatacgcgctaatacaggccagtgtcggcgtaaactttctccgttaccgg cccggccgcgaccgccatgtttccgccgccgaaatccgcgacggtacggaacgccgccgg ggtgagcaaggcgctgcgcgagtgtcaactgccccacctaaggcgcctgccttcctcaag ccaaagtcgactcctgggacctggaatcagcagctgagaacagacagcgggaagaccacc aaagatcaggctgacagtgcccatcaagcatctaggccaccgcctcgggctggaccgaac ggggacagggcccttcccccatggagtggaaaaaccaaagcatcggggactggcttgcca tttccaagaccacataatgatgtgctccccagtggacaggtcctgtggcaaggacaatca caggttatcttgagtctcaggatgggacaaaggagtatcgtcccttaccttgtccacctg gatcctgagagtggtatcacgttcttgcatcggaaggacataggcactggtgatgagaac gccatgttcattcagggtgcagacaactgctggcctgtatcaagccagatcctgtcagtg gacagctggatggctgcagctgtccatgtgcaaggattgcctcatagcaagctctctgtg taccagatctggtcaagaattgggccctgtccctcctgtacaaaggaggaaatctgtgga ggttcagcggcctctttcaaagactgtctttcagatgttacatag >gi568815591r:112667200_112884825|GENSCAN_predicted_peptide_9|279_aa MSYSAVGHEFNVNASTIRIKYSAFKQKHINSRLLQASGKIRLLDVGSCFNPFLKFEEFLT VGIDIVPAVESVYKCDFLNLQLQQPLQLAQDAIDAFLKQLKNPIDSLPGELFHVVVFSLL LSYFPSPYQRWICCKKAHELLVLNGLLLIITPDSSHQNRHAMMMKSWKIAIESLGFKRFK YSKFSHMHLMAFRKISLKTTSDLVSRNYPGMLYIPQDFNSIEDEEYSNPSCYVRSDIEDE QLAYGFTELPDAPYDSDSGESQASSIPFYELEDPILLLS >gi568815591r:112667200_112884825|GENSCAN_predicted_CDS_9|840_bp atgagttatagtgctgttggccacgagttcaatgttaatgcatcaacaatacgtattaaa tacagtgcttttaaacagaaacatataaattcaaggttgctacaagcctcaggaaaaatc agattacttgatgttggcagctgctttaacccatttctgaagtttgaagaatttctaact gttggcatagatattgtacctgctgtagagagtgtctataaatgtgatttcctgaactta cagcttcagcaaccactccagcttgcacaggatgctatagatgcttttttgaagcagctg aaaaaccctattgattctcttcctggagagcttttccatgtggttgttttctctctcctt ctttcttattttccatctccttaccagcgatggatttgctgcaagaaagcccatgaactg ttagtgttaaatggtttattactaatcatcacacctgattcctcccatcagaaccgtcat gctatgatgatgaaaagctggaagattgctatagagtccctgggctttaaacgcttcaag tactcaaaattttcacatatgcatctgatggcatttaggaaaatctctctaaaaaccaca agtgacttggttagtaggaactacccaggaatgttatatattcctcaagatttcaacagt atagaagatgaggaatattctaacccttcctgctatgttcgatcagatatagaagatgaa caactagcatatggtttcacagaactccctgatgcgccatatgactcagattctggagaa agtcaagccagctctattcctttctatgagctagaagaccccatattacttttaagttaa >gi568815591r:112667200_112884825|GENSCAN_predicted_peptide_10|150_aa MALMSKSQQEATEAKEASLSRQCHWSSAGVELLERPAQHLSQPTRSGPQHVELLPHSSQC TNERGWGREITPTLRIHSLALLLPPSVACASLLFPLTQGRLWQATFLLLLGQSTLKVKSP GDWRLLPSLSLDWWATVPQPTKEAARDTQQ >gi568815591r:112667200_112884825|GENSCAN_predicted_CDS_10|453_bp atggcgcttatgagtaagagccagcaggaggcgacagaggcaaaagaagcatccttatcc cgccagtgccactggtcttcagcgggagtggagctgctggagaggcctgcacagcacctc tctcagcccacacgctctggtccccagcatgtagagctgctgccgcattcgtcccagtgc actaatgagcggggctgggggcgagagattacccccactctccgaattcattccttggct ttgctgttgcccccttcagtggcttgtgcttcacttttgtttcctttgacccaaggaagg ctttggcaggccacgttcctccttctgttggggcagtccacactgaaggttaaatctcca ggggactggagactgcttccctccctgtctttggactggtgggctactgtcccgcaacca accaaggaagcagcccgggacacccagcagtga