GENSCAN 1.0 Date run: 8-Nov-116 Time: 12:58:22 Sequence gi568815597f:62703787_62964156 : 260370 bp : 38.81% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 Intr - 494 17 478 1 1 23 -35 430 0.440 16.32 1.01 Init - 8892 8831 62 0 2 104 92 27 0.829 4.57 1.00 Prom - 17044 17005 40 -4.45 2.00 Prom + 22511 22550 40 -2.15 2.01 Sngl + 36626 37174 549 1 0 90 36 252 0.859 16.06 2.02 PlyA + 37207 37212 6 1.05 3.11 PlyA - 40697 40692 6 1.05 3.10 Term - 41240 41164 77 0 2 74 48 37 0.053 -4.68 3.09 Intr - 50106 49933 174 1 0 37 35 225 0.856 11.09 3.08 Intr - 50864 50739 126 0 0 39 101 120 0.699 8.13 3.07 Intr - 51375 51156 220 0 1 79 49 93 0.051 1.45 3.06 Intr - 52158 52054 105 0 0 55 58 109 0.003 4.09 3.05 Intr - 57536 57428 109 1 1 75 44 51 0.000 -1.23 3.04 Intr - 59792 59634 159 1 0 117 61 26 0.022 0.98 3.03 Intr - 70270 70144 127 2 1 15 96 142 0.573 6.52 3.02 Intr - 74645 74554 92 1 2 102 81 25 0.562 1.92 3.01 Init - 77094 77045 50 0 2 72 75 21 0.216 -0.32 3.00 Prom - 78435 78396 40 -3.95 4.00 Prom + 82425 82464 40 -5.85 4.01 Init + 83084 83216 133 1 1 75 100 95 0.627 9.75 4.02 Term + 108816 109471 656 1 2 59 41 248 0.882 10.37 4.03 PlyA + 109579 109584 6 1.05 5.00 Prom + 110337 110376 40 -3.65 5.01 Init + 110514 110658 145 2 1 83 20 68 0.747 -0.17 5.02 Intr + 112789 112947 159 2 0 82 24 118 0.341 3.84 5.03 Intr + 115258 115549 292 2 1 25 93 149 0.500 4.37 5.04 Intr + 117353 117423 71 2 2 42 110 64 0.924 1.91 5.05 Intr + 125254 125390 137 2 2 54 86 132 0.947 8.97 5.06 Intr + 130252 130330 79 0 1 104 96 28 0.865 3.41 5.07 Intr + 130990 131066 77 2 2 98 69 22 0.363 -0.38 5.08 Intr + 137669 137761 93 1 0 50 65 108 0.167 3.94 5.09 Term + 144024 144167 144 2 0 49 32 123 0.209 -0.27 5.10 PlyA + 144887 144892 6 1.05 6.11 PlyA - 145313 145308 6 1.05 6.10 Term - 153370 153170 201 1 0 79 49 97 0.146 1.31 6.09 Intr - 155775 155709 67 2 1 61 86 7 0.131 -4.21 6.08 Intr - 156530 156336 195 1 0 53 57 143 0.067 5.31 6.07 Intr - 200234 200158 77 2 2 50 98 65 0.177 1.09 6.06 Intr - 202413 202247 167 1 2 75 115 119 0.821 12.06 6.05 Intr - 220185 220023 163 0 1 24 60 107 0.002 0.13 6.04 Intr - 224161 223954 208 0 1 62 71 90 0.006 2.86 6.03 Intr - 244800 244670 131 0 2 70 23 126 0.051 2.87 6.02 Intr - 248517 248429 89 1 2 39 103 43 0.062 -0.33 6.01 Intr - 255082 254889 194 0 2 27 10 189 0.052 3.21 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 32224 32087 138 1 0 57 76 123 0.806 8.19 S.002 Term - 224292 224129 164 1 2 100 49 123 0.917 6.72 S.003 Term - 255082 254878 205 0 1 27 49 194 0.821 5.26 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:62703787_62964156|GENSCAN_predicted_peptide_1|180_aa MKAEGRQLLVASFSLHYSLNIITNAEKSLKDLMELKTTARELRDECTSLSSRCDQLEERV SVMEDEMNEMKREEKIREKRIKRKEQSLQEIWDYVKRPNLHLIGVPESDGENGTKLENTL QDITQENFPNLARQAIQIQEIQRTPQRYSSRRATPRHIIVRFTKVEMKEKMLRAAREKSW >gi568815597f:62703787_62964156|GENSCAN_predicted_CDS_1|540_bp atgaaggcagaagggagacaattgctggtagccagcttcagcctgcactattctctcaat ataataaccaatgcagagaagtccttaaaggacctgatggagctgaaaaccacagcacga gaactacgcgacgaatgcacaagcctcagtagccgatgtgatcaactggaagaaagggta tcagtgatggaagacgaaatgaatgaaatgaagcgagaagagaagattagagaaaaaaga ataaaaagaaaggaacaaagcctccaagaaatatgggactatgtgaaaagaccaaatcta catctgattggtgtacctgaaagtgacggggagaatggaaccaagttggaaaacactctg caggatattacccaggagaacttccccaatctagcaaggcaggccattcaaattcaggaa atacagagaacgccacaaagatactcctcgagaagagcaactccaagacacataattgtc agattcaccaaagttgaaatgaaggaaaaaatgttaagggcagccagagagaaaagttgg >gi568815597f:62703787_62964156|GENSCAN_predicted_peptide_2|182_aa MDLTDIYRTFHPTTTEYTFYSTAHRTFSKIDHMIGVNTFKKIEIISSTLSDHSGIKLEIN SKRNLLYHANTWKLNNLLLNEHWVKNEINMKIKKFFKLNNNNDTTYQNLWDPAKVVLRGK FIALNAYIKKSERAQTDGLRSHLKEEEKQKQTKPKPSRKKEIAKIGAELNEIETTTTTKT KR >gi568815597f:62703787_62964156|GENSCAN_predicted_CDS_2|549_bp atggacttaacagacatatacagaacatttcatccaacaaccacagaatacacattctat tcaacagcacatagaactttctcaaagatagaccacatgataggcgtcaatacatttaag aaaattgaaattatatcaagcactctctcagaccacagtggaataaaactggagatcaac tccaaaaggaatcttctataccatgcaaatacgtggaaattaaataacctgctcctgaat gagcattgggtcaaaaatgaaatcaatatgaaaattaaaaaattcttcaaactaaacaac aataatgatacaacctatcaaaacctctgggatccagctaaggtggtgttaagaggaaag ttcatagccctaaatgcctacatcaaaaaatctgaaagagcacaaacagacggtctaagg tcacacctcaaggaagaagagaaacaaaaacaaaccaaacccaaacccagcagaaaaaag gaaatagccaagatcggagcagaactaaatgaaattgaaacaacaacaacaacaaaaacc aaaagataa >gi568815597f:62703787_62964156|GENSCAN_predicted_peptide_3|412_aa MTEAPKLPFHEYCVVLMSGLLLECGSYDLLSVEVGQRIPFRPALTLKGTSLATVGKSTKW AFGVPDSRPWLVDSISGPTLGQKVARCPEGTYRLVAFYIIYLFLCLSILYLTSLKDKLYD KRNYLSYIFTDVTQACKTVPKTLTPQLLLGIWPMTALATSCWIVAMTALATSCWIVAKKN LQRSLDPADQNNCALRCSSRNTSFSPRPQGGPRKGQQTCCWDPGAEWLSSLSTLGSAQKY GKSGSWFQANQPSHSEESGVVREPFPRKPDTRVFSLAAMLVAFNWPTGVKLQTFAVSVTA LKGGASGVVRPSPGVHDLTGFRSEAADLHGGQLASPRGARAGAVGGAVHQSNAVRQQSSA LGRSMGSGTAEQGVAPIGELLAAQEPQERDSTMSHQQPTLPKPEGNQCFDSK >gi568815597f:62703787_62964156|GENSCAN_predicted_CDS_3|1239_bp atgaccgaggccccgaaactgccttttcatgaatactgcgtagttttaatgtcaggactc cttctggaatgtgggtcttatgatttactttcagtcgaggtaggtcagagaattcctttc aggccagctcttacactgaaaggtaccagtttggccacagtggggaagagcaccaagtgg gcatttggggtccctgattccaggccttggctcgtggacagcatttctggacctaccctg ggccagaaagtggcccgttgtcctgaagggacttatcgccttgtagcattctacataatt tacttatttctctgtttgtctattctctatctcacttcactaaaagataagctctatgac aaaagaaattacttgagttacatcttcactgatgtaactcaagcctgtaaaacagtgcct aagacattgacaccccaactgctgttgggaatttggccaatgactgctctagctacttcc tgctggatagtggcaatgactgctctagctacttcctgctggatagtggcaaagaagaac ctgcaacggtccctggaccctgctgatcagaataattgcgctctccgatgcagcagcaga aacactagtttttcccccagaccacaaggaggaccgaggaagggtcaacaaacttgttgt tgggaccctggagctgaatggctttcctctctgtcaaccctcggctcagcccagaagtac gggaaaagcggaagctggttccaagcaaaccaaccctcccactccgaagagtcgggggtt gttagagagccctttcccagaaagcctgacacccgtgtctttagtctggcagccatgcta gttgcttttaactggccgacaggagtgaagctgcagaccttcgcagtgagtgttacagct cttaaaggtggcgcgtctggagttgttcgtccctccccaggggtccatgatctcactggc ttcaggagtgaagctgcagaccttcatggaggccagctggcttcacctagaggagcccgt gccggggccgtgggtggagctgtccaccagtccaatgctgtgcgccagcagtcctcagcc cttgggcggtcaatgggatcaggcaccgcagagcagggggtggcacccattggggagctt ctggccgcgcaggaaccacaggaaagagactcaactatgagccatcagcagccaacactc ccaaaacctgaaggaaaccaatgcttcgattctaaataa >gi568815597f:62703787_62964156|GENSCAN_predicted_peptide_4|262_aa MGRGQLEDCWSGPIERWYFGQGGGGKMDSRGVQEVELGDGLDMEVLEVLAREIRQEKAIK GIQIGREEVKLSLFADDMIVYLENPVVSAQNLLMLISNFSKVSEYKINVQKSQAFLYTSD RQTESQIMSELPFTIATKRIKYLGIQLTRDVKDLFKENYKPLLKGIKEDTNKWKNIPCSW IGRINIMKMAILPKVIYRFNAIPIKLPFTFFTELEKTTLKFIWNQKRARVAKTILSKKNK AGGITLPDFKLYYRATVTKTDI >gi568815597f:62703787_62964156|GENSCAN_predicted_CDS_4|789_bp atggggagaggccagttagaagattgttggagtggcccaattgagagatggtattttgga caaggtggtggtggtaagatggattccagaggtgttcaggaggtggaactaggcgatgga ttagatatggaagtattggaagttctggccagggaaatcaggcaagagaaagcaataaag ggtattcaaataggaagagaggaagtcaaattatctctttttgcagatgacatgattgta tatttagaaaaccctgttgtctcagcccaaaatctccttatgctgataagcaatttcagc aaagtctcagaatacaaaatcaatgtgcaaaaatcgcaagcattcctatataccagtgat agacaaacagagagccaaatcatgagtgaactcccattcacaattgccacgaagagaata aaatacttaggaatccaacttacaagggatgtgaaggacctcttcaaggagaactacaaa ccgctgctcaagggaataaaagaggacacaaacaaatggaaaaacattccatgctcatgg ataggaagaatcaatatcatgaaaatggccatactgcccaaagtaatttatagattcaat gctatccccatcaagctaccattcactttcttcacagaattggaaaaaactaccttaaag ttcatatggaaccaaaaaagagctcgcgtagccaagacaatcctaagcaaaaagaacaaa gctggaggcatcacactacctgacttcaaactatactacagggctacagtaaccaaaaca gatatatag >gi568815597f:62703787_62964156|GENSCAN_predicted_peptide_5|398_aa MDEAGNHPSQQTNTGTENQTLHVLTHKWELNNENTWRLGGEHHTPGPVNEDKTLPAESGC TIEDHVIAGNVEEFRKDFISRIWLTYREEFPQIEGSALTTDYSESWTSHTVKKFTASFEA SLSGEREFKTPTISLKETIGKYSDDHEMRNEVYHRKIISWFGDSPLALFGLHQLIEYGKK SGKKAGDWYGPAVVAHILRKAVEEARHPDLQGITIYVAQDCTVYNSDVIDKQSASMTSDN ADDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGIIGGKPKQSYYFAGFQDDSLIY MDPHYCQSFVDVSIKDFPLEMSFRKMDPSCTIGFYCRNVQDFKRASEEITKGGKWSGRVE SGKLEGQMSMMKSGLLEVPPKQPLFRSLPSSDFSISME >gi568815597f:62703787_62964156|GENSCAN_predicted_CDS_5|1197_bp atggatgaagctggaaaccatccttctcagcagactaacacaggaacagaaaaccaaaca ctgcatgttctcactcataagtgggagttgaacaatgagaatacatggagactgggaggg gaacatcacacaccagggcctgtcaatgaagataaaacgttacctgcagagtcgggatgt acaatagaggatcacgtaattgcaggaaatgtagaagaatttcgtaaagatttcatttct agaatatggctgacctacagggaagaattccctcaaatagaaggctcagctttgacaaca gactactctgaatcatggacttcccacactgtcaaaaaatttactgcatcatttgaagca tcactttcaggggaaagagaattcaaaaccccaacaatttctctgaaggaaacaattggg aaatattctgatgatcatgaaatgcgaaatgaagtttatcataggaaaatcatctcttgg tttggtgattcccccttggctctttttggcttacatcaactaatagaatatggaaagaag tctgggaaaaaagcaggagattggtatggaccagctgtggttgctcacattttaagaaaa gcagttgaagaagcaaggcatcctgatttacaaggaataactatttatgttgcacaagat tgtacagtttacaattctgatgtaattgataaacagagtgcttccatgacttctgataat gcagatgacaaagctgttattattctagttcctgttagacttggtggagaaagaaccaac accgactacttagaatttgtgaagggtattttaagcctggaatattgtgtgggtattatt ggtggcaaacctaaacagtcatattactttgctggatttcaagatgacagtttgatttac atggatcctcattactgccaatcttttgtagatgtcagcataaaggatttccctcttgag atgtcttttcgaaaaatggatcccagctgtacaataggattttactgtcgaaatgttcag gacttcaaacgagcttctgaagaaatcaccaagggaggcaaatggtccgggagagttgag agtggaaaacttgaaggtcagatgagtatgatgaagtcaggtttactggaagtgcctcca aaacaaccactcttccgttctctgccttcttcagacttcagcatctctatggaatag >gi568815597f:62703787_62964156|GENSCAN_predicted_peptide_6|497_aa XCYQKLSTLKPKRNETGQRGGKGGEEKEKALGLKFVKPEMESIVTLIGHGVAPSLETLSP PKSPEGDLLFQPVEGSFSLVFSARIKSSKIVPNQQDSRNKNALEDSEATKGKGSESLNEC MEESCLLTRNTRTYYMNKKTVGKRESPSYKENTIFWGVILGRKEDRDICSSLGKDTRRKA SFSQSPSHVQVRVKQVKHCHSWKRKSSQASNGCFTEGATRERKWGPSQILMRANAGHSSE DVRTFCPFVVTQPCGVVYIYFIRSKRKRRRIQTARRQTELPEEESIAHPETKPRTLTACQ VWAQLHSSSFNSRLGASALVKFMSNNKIIDNQMNYAGHSGDKERVYSFYFFKKKLPVEQP QAGALGGTAEEGTVIIGDDSCMCVTIPGDLLVGQDVGVEDSGIDDPDPGLTNSPRATSSP AGAIHGKFPTQATDWYESMVQGLGAPVNDNKEPDIKVLGEEHSRQGNQSVKVLGCQQTLH VQRTESKPAWLAGRGKQ >gi568815597f:62703787_62964156|GENSCAN_predicted_CDS_6|1494_bp nnatgctatcaaaaattaagtacactgaagccaaaaagaaatgaaacaggacaaagagga ggaaaaggaggagaagaaaaggagaaggcccttggactaaaatttgtaaaaccagagatg gaatctatagtcactctaattggccatggtgtggctccaagcttggagaccttgagtcct ccaaagtccccagagggagacctgttgttccaaccagtagaaggaagtttttcccttgtc ttctctgctcgaataaaatcaagtaaaattgttccaaatcagcaggattcacgtaacaag aatgccttagaggatagtgaagccacaaaagggaaggggtctgaatccctgaatgagtgc atggaagagagctgcctgctgaccaggaataccaggacctattacatgaacaagaagact gtgggaaagagggagtctccaagctataaggagaatacgattttctggggggtgatctta ggcaggaaggaggacagagacatctgcagctccctaggaaaggacacaaggagaaaagcc tccttttctcagagtccaagccatgtccaggtgcgggtgaagcaggtcaagcactgtcac tcttggaaacgcaaaagctcccaggcgagcaatggctgcttcacagagggggccactcgt gagagaaagtggggaccatctcaaatactcatgagagccaatgctggccactcttctgag gatgtacgcacattctgtcctttcgtggtcacacagccctgtggagtggtttatatttac ttcataaggtcaaaaagaaaaaggagacgaattcaaactgcacgacggcaaacagagctt cccgaggaggaatccattgctcatcctgaaaccaagcccaggacactcactgcttgccag gtatgggcccagttacattcgagctccttcaactcacgcctgggagcatcagcgcttgta aaatttatgtccaataacaagataatagataatcagatgaattatgccggtcattcaggg gacaaagaaagagtgtactcattttacttttttaaaaagaaattacctgtagaacagcct caggcaggtgctttaggaggcactgcagaagaaggcactgttatcataggagatgacagc tgcatgtgtgttaccatccctggagaccttctagtgggacaagatgtaggggtagaagat agtggtattgatgatcctgaccctggactcactaattcacccagagcaacctccagtcct gcaggtgccattcatggtaaattccctacacaggccacagactggtacgagtccatggtc caggggctgggggcccctgtgaatgacaataaggagccagacataaaagttctgggcgaa gagcactctaggcagggaaatcagagtgtaaaggtcctaggatgccagcaaactttgcat gttcaaagaacagaaagcaagccagcgtggcttgcaggtagaggaaaacagtaa