GENSCAN 1.0 Date run: 5-Nov-116 Time: 04:50:25 Sequence gi568815595r:190208280_190422206 : 213927 bp : 37.40% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 7376 7381 6 1 0 73 127 7 0.397 3.67 1.02 Intr + 14421 14528 108 2 0 69 67 88 0.234 4.36 1.03 Term + 27264 27371 108 2 0 139 40 29 0.282 0.73 1.04 PlyA + 28048 28053 6 1.05 2.02 PlyA - 29656 29651 6 1.05 2.01 Sngl - 37786 37448 339 1 0 96 49 156 0.620 8.29 2.00 Prom - 39977 39938 40 -5.85 3.00 Prom + 46867 46906 40 -6.25 3.01 Init + 47488 47692 205 0 1 75 102 154 0.687 14.61 3.02 Term + 48536 48558 23 0 2 115 37 32 0.945 -1.70 3.03 PlyA + 48863 48868 6 1.05 4.00 Prom + 50225 50264 40 -8.15 4.01 Init + 50391 50477 87 2 0 47 84 113 0.964 7.49 4.02 Intr + 50529 50654 126 2 0 -5 66 150 0.651 3.46 4.03 Term + 53670 53861 192 2 0 56 41 147 0.904 3.24 4.04 PlyA + 54790 54795 6 1.05 5.03 PlyA - 54823 54818 6 1.05 5.02 Term - 75388 75227 162 1 0 93 41 125 0.946 5.25 5.01 Init - 79914 78847 1068 0 0 50 42 371 0.593 23.07 5.00 Prom - 80184 80145 40 -8.95 6.02 PlyA - 81370 81365 6 1.05 6.01 Sngl - 84629 84180 450 2 0 59 47 317 0.436 20.66 6.00 Prom - 91085 91046 40 -7.15 7.07 PlyA - 91252 91247 6 1.05 7.06 Term - 91682 91576 107 0 2 83 48 99 0.927 2.99 7.05 Intr - 98446 98314 133 1 1 94 61 98 0.383 7.00 7.04 Intr - 100160 100004 157 1 1 107 -7 99 0.704 1.39 7.03 Intr - 101974 101890 85 2 1 82 97 53 0.970 3.56 7.02 Intr - 104757 104593 165 1 0 29 90 119 0.170 5.21 7.01 Init - 113927 113705 223 2 1 87 73 437 0.479 38.86 7.00 Prom - 114201 114162 40 -4.15 8.00 Prom + 116747 116786 40 -5.85 8.01 Init + 128063 128143 81 1 0 71 98 72 0.779 7.52 8.02 Intr + 132760 132825 66 0 0 81 86 46 0.410 1.88 8.03 Term + 133099 133224 126 0 0 90 44 81 0.498 1.20 8.04 PlyA + 134215 134220 6 1.05 9.02 PlyA - 136343 136338 6 1.05 9.01 Sngl - 138569 138351 219 2 0 89 54 146 0.841 6.23 9.00 Prom - 147960 147921 40 -4.65 10.00 Prom + 150053 150092 40 -6.85 10.01 Init + 151613 151661 49 1 1 69 63 90 0.310 5.76 10.02 Intr + 162614 162722 109 0 1 85 84 99 0.982 7.62 10.03 Term + 164677 164779 103 1 1 139 38 63 0.883 3.17 10.04 PlyA + 164936 164941 6 1.05 11.00 Prom + 171231 171270 40 -5.35 11.01 Init + 180051 180164 114 2 0 96 94 70 0.900 8.82 11.02 Intr + 190002 190103 102 2 0 67 11 121 0.615 1.75 11.03 Intr + 194058 194160 103 2 1 84 57 100 0.958 5.33 11.04 Intr + 196483 196647 165 2 0 94 86 138 0.901 13.21 11.05 Intr + 200035 200226 192 2 0 99 116 -21 0.420 0.34 11.06 Term + 201624 201757 134 1 2 70 38 97 0.802 0.17 11.07 PlyA + 202659 202664 6 1.05 12.02 PlyA - 203414 203409 6 1.05 12.01 Sngl - 209125 208448 678 1 0 59 54 266 0.972 16.23 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 42511 42477 35 2 2 94 103 1 0.820 -0.78 S.002 Init - 43547 43463 85 2 1 102 81 71 0.920 8.83 S.003 Sngl - 126435 126142 294 0 0 66 39 171 0.925 5.45 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:190208280_190422206|GENSCAN_predicted_peptide_1|73_aa MTDLEVSLVKAVKQQRWQPVPPSASFVPGKHGPVADPENFHNYDTIRCSRLISHTDTVLA LKSAMSLSSSGLI >gi568815595r:190208280_190422206|GENSCAN_predicted_CDS_1|222_bp atgacggacctggaggtgtcactagtgaaggctgtgaaacagcaaagatggcagcctgtt cctccttctgcaagctttgtcccagggaaacacggacctgttgctgacccagagaacttt cataattatgatacaataagatgttccagactcatctctcacactgatactgtcctagcc ctgaaatcagccatgtctcttagtagctctggtttaatttag >gi568815595r:190208280_190422206|GENSCAN_predicted_peptide_2|112_aa MGLSWSTSARAMWKENMGLEPPHRVPTGALPSGAVRRGPLSTRPQNDRSTKSLHHVPRKA TDTQCQPVKTAGREAVLCKVMEVELCKTMGTHLLHQCDLDVRPEVKGGHFGA >gi568815595r:190208280_190422206|GENSCAN_predicted_CDS_2|339_bp atggggctgtcatggagtacctctgctagggcaatgtggaaggaaaatatggggttggag cccccacacagagtccctactggggcactgcctagtggagctgtgagaagagggccacta tctaccagaccccagaatgatagatccaccaagagtttgcatcatgtgcctcgaaaagcc acagacactcaatgccagcctgtgaaaacagctgggagggaggctgtactctgcaaagtc atggaggtggagctgtgcaagactatgggaacccatctcttgcatcagtgtgatctggat gtgagacctgaagtcaaaggaggtcattttggagcttga >gi568815595r:190208280_190422206|GENSCAN_predicted_peptide_3|75_aa MELGFELRSLAPEPALLCRIITQQSRVAVYSRRRNRNYKAFSAERNLVHLKDQKKDNVAF HGGSGSRKGIRVHST >gi568815595r:190208280_190422206|GENSCAN_predicted_CDS_3|228_bp atggagcttggatttgaactgagaagcctagctccagaacctgccctgttgtgccgtatc atcactcaacaatctagggtagcagtgtattccagaagaagaaatagaaattacaaagcc tttagtgcagaaaggaacctggtgcatttgaaagaccaaaagaaggataatgtggccttt catggagggagtggaagcaggaaaggtattcgagttcatagcacataa >gi568815595r:190208280_190422206|GENSCAN_predicted_peptide_4|134_aa MQALCALLAPKCSTPIAELNANQRLQLCRFRVVIQQFSDTEKVMDFKENAECRDEGLRRM AQLMITLHNFTVCECVCVHECADLGLASSAVVLEGKTTALNSPGYHQPVFQIYNGGGLGN HSRPPTKKINENNI >gi568815595r:190208280_190422206|GENSCAN_predicted_CDS_4|405_bp atgcaggccttatgtgccttactggctcccaagtgctccactccaattgctgaactgaat gctaatcagcgactccagctgtgcaggttcagagtagttattcagcagttcagtgacaca gagaaggtcatggacttcaaagagaatgctgaatgcagagatgaaggcctgcgcagaatg gcccagttaatgatcacacttcataatttcacggtgtgtgagtgtgtatgtgtgcatgag tgtgctgacctgggcttggctagctcagctgttgttttggaaggcaaaaccacagctcta aattccccagggtatcatcagccagtgttccagatctacaatggtggtggtcttgggaat cacagcagaccacccacaaagaaaataaatgaaaacaacatctag >gi568815595r:190208280_190422206|GENSCAN_predicted_peptide_5|409_aa MSELPFTIASKGIKYLGIQLTRDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWVGRINIV KMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRARIAKSILSQKNKAGGIMLP DFKLYYKATVTKTAWYWHQNRHIDQWNRTEPSEIMLLIYNYLIFDKPEKNKQWGKDSLFN KWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNIRPKTIKTLEENLGITIQDIGMGKD FMSKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQPTEWEKIFATYSSDKGLISRI YNELKHIYKKKTNNPIKKWVKDMNRHFSKEDIYAAKKHMKKCSSSLAIREMQTKTTTKKK REKNQINKKGKGNITTDTIVIQRINRDNYEQIYANKIESLEKMDERLEA >gi568815595r:190208280_190422206|GENSCAN_predicted_CDS_5|1230_bp atgagtgaactcccattcacaattgcttcaaagggaataaaatacctaggaatccaactt acaagggatgtgaaggacctcttcaaggagaactacaaaccactgctcaatgaaataaaa gaggatacaaacaaatggaagaacattccatgctcatgggtgggaagaatcaatatcgtg aaaatggccatactgcccaaggtaatttatagattcaatgccatccccatcaagctacca atgactttcttcacagaattggaaaaaactactttaaagttcatctggaaccaaaaaaga gcccgcatcgccaagtcaatcctaagccaaaagaacaaagctggaggcatcatgctccct gacttcaaactatactacaaggctacagtaactaaaacagcatggtactggcaccaaaac agacatatagatcaatggaacagaacagagccctcagaaataatgctgcttatctacaac tatctgatcttcgacaaacctgagaaaaacaagcaatggggaaaggattccctatttaat aaatggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttcctt acaccttatacaaaaattaattcaagatggattaaagacttaaacattagacctaaaacc ataaaaaccctagaagaaaacctaggcattaccattcaggacataggcatgggcaaggac ttcatgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatcta attaaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaacct acagaatgggagaaaattttcgcaacctactcatctgacaaagggctaatatccagaatc tacaatgaactcaaacacatttacaagaaaaaaacaaacaaccccatcaaaaagtgggtg aaggacatgaacagacacttctcaaaagaagacatttatgcagccaaaaaacacatgaaa aaatgctcatcatcactggccatcagagaaatgcaaaccaaaaccacaactaagaaaaaa agggagaagaaccaaataaataaaaagggaaaaggaaacattactactgacaccatagta atacagagaatcaatagagacaattatgaacaaatatatgccaacaaaattgaaagccta gaaaaaatggatgaacgtctggaagcataa >gi568815595r:190208280_190422206|GENSCAN_predicted_peptide_6|149_aa MWESLELPRDLLNGFDQNADSDVDNEVQAEVVSDGDEELTGNWNEDHFCYALAKRLVAFC LFPRDMWNFKLERDDLKLEVMFKREAEHKSLENLQPDDAIEKEKPISGEKFKPAAEICIS NKEPNINSQDNGENISRACQRSLRQPFPS >gi568815595r:190208280_190422206|GENSCAN_predicted_CDS_6|450_bp atgtgggaaagtttggaacttcctagagacttgttgaatggttttgaccaaaatgctgat agtgatgtggacaatgaagtccaagctgaggtggtctcagatggagatgaggaacttact gggaactggaatgaagatcacttttgctatgctttagcaaagagactggtggcattttgc ctcttccctagagatatgtggaactttaaacttgagagagatgatctgaaattggaagtt atgtttaaaagggaagcagagcataaaagtttggaaaatttgcagcctgatgatgcaata gaaaaagaaaaacccatttctggggagaaattcaagccagctgcagaaatttgcataagt aataaggagccaaacattaatagccaagacaatggggaaaatatctccagggcatgtcag agatctttgaggcagccctttccatcatag >gi568815595r:190208280_190422206|GENSCAN_predicted_peptide_7|289_aa MANAGLQLLGFILAFLGWIGAIVSTALPQWRIYSYAGDNIVTAQAMYEGLWMSCVSQSTG QIQCKVFDSLLNLSSTLQATRALMVVGILLGVIAIFVATVGMKCMKCLEDDEVQKMRMAV IGGAIFLLAGLAILVATAWYGNRIVQEFYDPMTPVNARYEFGQALFTGWAAASLCLLGGA LLCCSCPRKTTSYPTPRPYPKPAPSSGKDYSVQNAISLEQDDVMERVLALVSGDLDLSLG AINHRLCLSKAFGCLYGFRSHGYFGSRKVTLPSLLPPYASQEDEDAKEG >gi568815595r:190208280_190422206|GENSCAN_predicted_CDS_7|870_bp atggccaacgcggggctgcagctgttgggcttcattctcgccttcctgggatggatcggc gccatcgtcagcactgccctgccccagtggaggatttactcctatgccggcgacaacatc gtgaccgcccaggccatgtacgaggggctgtggatgtcctgcgtgtcgcagagcaccggg cagatccagtgcaaagtctttgactccttgctgaatctgagcagcacattgcaagcaacc cgtgccttgatggtggttggcatcctcctgggagtgatagcaatctttgtggccaccgtt ggcatgaagtgtatgaagtgcttggaagacgatgaggtgcagaagatgaggatggctgtc attgggggtgcgatatttcttcttgcaggtctggctattttagttgccacagcatggtat ggcaatagaatcgttcaagaattctatgaccctatgaccccagtcaatgccaggtacgaa tttggtcaggctctcttcactggctgggctgctgcttctctctgccttctgggaggtgcc ctactttgctgttcctgtccccgaaaaacaacctcttacccaacaccaaggccctatcca aaacctgcaccttccagcgggaaagactactctgtacagaatgctatttcacttgagcaa gatgatgtaatggaaagggtgttggcattggtgtctggagacctggatttgagtcttggt gctatcaatcaccgtctgtgtttgagcaaggcatttggctgcttgtatggcttccgttct catggttattttggttccaggaaggttactctaccctcactcttaccaccttatgcaagt caagaagatgaagacgcaaaagaaggataa >gi568815595r:190208280_190422206|GENSCAN_predicted_peptide_8|90_aa MQIQWLATYTMLLGIQWSGAYSNIPSKGRAFPPAAFMGWHDIECLQLFQAQHQVEAAKVW ELHPLKSWPELYVGPFQPWLEQLGCRVPSP >gi568815595r:190208280_190422206|GENSCAN_predicted_CDS_8|273_bp atgcagattcagtggcttgccacatatacgatgcttttagggatccagtggtctggtgca tactcgaacatcccttccaagggtagagcttttcctccagctgctttcatgggctggcat gacattgagtgtctgcagcttttccaggctcaacaccaggtggaagctgccaaggtttgg gaattgcaccctctgaagtcatggcctgagctctatgttggcccctttcagccatggctg gagcagctgggatgcagggtaccaagtccctag >gi568815595r:190208280_190422206|GENSCAN_predicted_peptide_9|72_aa MGRGKAMGDTGENVMWKGRGWGDGSASQATPGIVSNHYKLGEGREGYFPRSPSDKAYDLT GILISDVWAPEK >gi568815595r:190208280_190422206|GENSCAN_predicted_CDS_9|219_bp atgggcagagggaaggcaatgggagacacaggggagaatgtcatgtggaaaggcagaggc tggggtgatggatctgcaagccaagcaacaccagggattgtgagtaaccactacaagcta ggagaaggcagggaaggatacttcccaagatccccttcagacaaagcctatgaccttact ggtatcctgatctcagacgtctgggctccagaaaaatga >gi568815595r:190208280_190422206|GENSCAN_predicted_peptide_10|86_aa MRLDVKKHGSRVLALKDDSCPQTSDSKFLSFGTQTGFLAPQLADELLWDLVIIISACVGI LTVHADNLPDSSKGSPFVCSAFQSLT >gi568815595r:190208280_190422206|GENSCAN_predicted_CDS_10|261_bp atgaggctcgatgtgaagaaacatggatctcgagttttggcactgaaagatgattcctgc cctcaaacatcggactccaagttcttgagttttgggactcagactggctttcttgctcct cagcttgcagatgaactactgtgggatcttgtgatcatcatttctgcctgtgtcggaatc cttactgtacatgctgataatctgcctgactcttcaaagggcagcccctttgtttgttct gccttccagtccctgacatag >gi568815595r:190208280_190422206|GENSCAN_predicted_peptide_11|269_aa MRDLLQYIACFFAFFSAGFLIVATWTDCWMVNADDSLEFWRLEAYNQEVGKAALPLKVLE ESVLSLFQLLVAVSTKCRGLWWECVTNAFDGIRTCDEYDSILAEHPLKLVVTRALMITAD ILAGFGFLTLLLGLDCVKFLPDEPYIKVRICFVAGATLLIAGTPGIIGSVWYAVDVYVER STLVLHNIFLGIQYKFGWSCWLGMAGSLGCFLAGAVLTCCLYLFKDVGPERNYPYSLRKA YSAAGVSMAKSYSAPRTETAKMYAVDTRV >gi568815595r:190208280_190422206|GENSCAN_predicted_CDS_11|810_bp atgagggatcttcttcaatacatcgcttgcttctttgcctttttctctgctgggtttttg attgtggccacctggactgactgttggatggtgaatgctgatgactctctggagttctgg aggctggaagcttataatcaagaagttggcaaggctgcgctgcccctgaaagttctggaa gaatccgttcttagcctcttccagcttctggtggctgtgagcacaaaatgccgaggcctc tggtgggaatgcgtcacaaatgcttttgatgggattcgcacctgtgatgagtacgattcc atacttgcggagcatcccttgaagctggtggtaactcgagcgttgatgattactgcagat attctagctgggtttggatttctcaccctgctccttggtcttgactgcgtgaaattcctc cctgatgagccgtacattaaagtccgcatctgctttgttgctggagccacgttactaata gcaggtaccccaggaatcattggctctgtgtggtatgctgttgatgtgtatgtggaacgt tctactttggttttgcacaatatatttcttggtatccaatataaatttggttggtcctgt tggctcggaatggctgggtctctgggttgctttttggctggagctgttctcacctgctgc ttatatctttttaaagatgttggacctgagagaaactatccttattccttgaggaaagcc tattcagccgcgggtgtttccatggccaagtcatactcagcccctcgcacagagacggcc aaaatgtatgctgtagacacaagggtgtaa >gi568815595r:190208280_190422206|GENSCAN_predicted_peptide_12|225_aa MWESLEPPRDLLNGLHQNADSDVDSEVQAEVVSNGDKELSRSWNEDYSCCALAKRLVAFC LCPIYLWNFELQRDYLKLDLMFKKEIEHKSLENLWPDNVIEKKNPFSGEKFKLAVEISLS NKKPNINSQDNWENISRACQRLLRQPLPSQAWKPRRKRWFRGPGPWLCCFVQPWDLVSCV PAAPAPVGTTRGQGTAQAIVLEAANPWQLPHGVGPVGAQNSRIKV >gi568815595r:190208280_190422206|GENSCAN_predicted_CDS_12|678_bp atgtgggaaagtttggaacctcctagagatttgttgaatggtttgcaccaaaatgctgat agtgatgtggacagtgaagttcaagccgaggtggtctcaaatggagataaggaacttagt aggagctggaatgaagattactcttgctgtgctttagcaaagagactggtggcattttgc ctctgccctatatatctgtggaactttgaacttcaaagagattatctgaaattggatctt atgtttaaaaaggaaatagagcataaaagtttggaaaatttgtggcctgacaatgtgata gaaaagaaaaacccattttctggggagaaattcaagctggctgtagaaattagcctgagt aataagaagccaaacattaatagccaagacaattgggaaaatatctccagggcatgtcag agacttttgaggcagcccctcccatcacaggcctggaagcctaggaggaaaagatggttt cgtgggccaggcccctggctctgctgctttgtgcagccttgggatttggtgtcctgtgtc ccagcagctccagctccagttggcactacaaggggccaaggaacagctcaggccatagtt ttagaagctgccaatccttggcagcttccacatggtgttgggcctgtgggtgcacagaat tcaagaattaaggtttga