GENSCAN 1.0 Date run: 4-Nov-116 Time: 09:49:36 Sequence gi568815584r:34416313_34639628 : 223316 bp : 44.82% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 8777 8816 40 -2.46 1.01 Sngl + 16112 16447 336 1 0 56 38 232 0.883 11.03 1.02 PlyA + 16486 16491 6 1.05 2.06 PlyA - 16635 16630 6 1.05 2.05 Term - 45432 45371 62 1 2 69 42 70 0.469 -1.53 2.04 Intr - 46424 46404 21 0 0 75 116 18 0.211 0.82 2.03 Intr - 46762 46678 85 1 1 89 111 19 0.372 3.69 2.02 Intr - 53182 53093 90 1 0 95 115 54 0.961 8.99 2.01 Init - 54174 54112 63 0 0 82 102 12 0.944 1.38 2.00 Prom - 58105 58066 40 -3.46 3.00 Prom + 60983 61022 40 -6.06 3.01 Sngl + 69667 70140 474 0 0 91 47 491 0.997 41.41 3.02 PlyA + 72388 72393 6 1.05 4.22 PlyA - 74369 74364 6 1.05 4.21 Term - 100274 99998 277 1 1 90 41 141 0.325 4.53 4.20 Intr - 113163 113046 118 1 1 5 78 94 0.447 -0.38 4.19 Intr - 117227 117132 96 0 0 45 111 82 0.900 6.18 4.18 Intr - 119963 119782 182 1 2 21 57 175 0.944 7.21 4.17 Intr - 123332 123243 90 1 0 7 80 206 0.028 10.81 4.16 Intr - 125275 125104 172 2 1 23 29 136 0.006 0.20 4.15 Intr - 130570 130410 161 0 2 64 111 44 0.027 3.93 4.14 Intr - 140454 140302 153 2 0 127 9 64 0.016 1.59 4.13 Intr - 141276 141092 185 0 2 32 14 163 0.089 1.89 4.12 Intr - 148183 148052 132 1 0 59 52 108 0.341 5.14 4.11 Intr - 151459 151374 86 2 2 98 67 28 0.231 1.24 4.10 Intr - 151701 151542 160 0 1 57 79 54 0.147 0.96 4.09 Intr - 152837 152404 434 0 2 67 38 385 0.001 24.77 4.08 Intr - 176838 176733 106 0 1 75 69 75 0.930 4.09 4.07 Intr - 181333 181238 96 1 0 62 116 27 0.858 3.11 4.06 Intr - 187159 187036 124 0 1 62 76 60 0.980 2.79 4.05 Intr - 189405 189284 122 0 2 75 94 107 0.996 9.29 4.04 Intr - 191828 191718 111 2 0 64 97 75 0.989 6.58 4.03 Intr - 193430 193326 105 2 0 70 94 83 0.982 7.51 4.02 Intr - 213642 213595 48 0 0 58 83 118 0.803 7.18 4.01 Init - 213840 213799 42 0 0 64 64 83 0.580 2.02 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 123316 123243 74 1 2 110 80 161 0.965 18.04 S.002 Intr - 159530 159444 87 2 0 123 86 60 0.871 9.47 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584r:34416313_34639628|GENSCAN_predicted_peptide_1|111_aa MAYTPGEGAMNIVEITMKDLGYFINLVDKAAAGFERIASNFGGSSIVGKRLSNSLHATEK SFMNEELINAANFIVVLFLKIAIATPTFRNHDPDQSAAINIKARPSTNKKL >gi568815584r:34416313_34639628|GENSCAN_predicted_CDS_1|336_bp atggcatatacacctggtgaaggtgctatgaacattgttgaaattacaatgaaggactta ggatattttataaacttagttgataaagcagcagcagggtttgaaaggattgcttccaat tttggaggaagttccattgtgggtaaaaggctatcaaatagcttacatgctacagagaaa tctttcatgaatgaagagttgatcaatgcagcaaactttattgttgtcttatttttaaaa attgccatagccaccccaaccttccgcaaccatgacccagatcagtcagcagccatcaac atcaaggcaagaccctccaccaacaaaaaattatga >gi568815584r:34416313_34639628|GENSCAN_predicted_peptide_2|106_aa MNSSRAGSVLPALSLPQPLAKVGNTFSIEIFLDLKVRDDPRLINRDSQRQQNPLDQRASG LDPLGGYLQPLRGGQIDVFARPPAAQGGGDHVTVEHIAPGLDAFDL >gi568815584r:34416313_34639628|GENSCAN_predicted_CDS_2|321_bp atgaactcctcaagggctggctctgtgctacctgccctatcactcccacagcccctggca aaggttggaaatacttttagtatagaaattttcttggatctgaaagtacgagatgatcca cggttaataaacagggatagccagcggcagcagaacccactagatcagcgcgcttctggc cttgacccacttggtggttacctacagccactaagaggtgggcagatagatgtttttgct aggccgccagctgcacaaggaggcggggatcatgtaacagtggagcacatcgctcccggc ttggacgcctttgacctttaa >gi568815584r:34416313_34639628|GENSCAN_predicted_peptide_3|157_aa MALKAKKEALAPPKAEAKAKALKAKKAVMKDVHIHKKRRPARHPPSRGPGHCDSRPPKYP RKSAPRRNKLDRYAIIKFLWTTESAMKKIEDNTTYVFRVDVKANKQQIKQAVKKLPDNDA AKVNTLIRPDGEKKAYVQLAPDYDVLDVAKKKWDHLN >gi568815584r:34416313_34639628|GENSCAN_predicted_CDS_3|474_bp atggcgttgaaagcaaagaaggaagcgcttgcccctcctaaagccgaagccaaagcgaag gctttaaaggccaagaaggcagtgatgaaagatgtccacatccacaaaaaaagaagaccc gcacgtcacccaccttccagaggcccaggacactgcgactcgaggccacccaaatatcct cggaagagcgcccccaggagaaacaagcttgaccgttatgccatcatcaagtttctgtgg accactgagtccgccatgaagaagatagaagacaacaccacatatgtgttcagggtggat gttaaagccaacaagcagcagatcaaacaggctgtgaagaagctccctgacaatgatgcg gccaaggtcaacaccctgattcggcccgatggagagaagaaggcatatgttcaactggct ccagattacgatgttttggatgttgccaaaaaaaaatgggatcatctaaactga >gi568815584r:34416313_34639628|GENSCAN_predicted_peptide_4|999_aa MRACAGPRLGAAMMEGLDDGPDFLSEEDRGLKAINVDLQSDAALQVDISDALSERDKVKF TVHTKSSLPNFKQNEFSVVRQHEEFIWLHDSFVENEDYAGYIIPPAPPRPDFDASREKLQ KLGEGEGSMTKEEFTKMKQELEAEYLAIFKKTVAMHEVFLCRVAAHPILRRDLNFHVFLE YNQDLSVRGKNKKEKLEDFFKNMVKSADGVIVSGVKDVDDFFEHERTFLLEYHNRVKDAS AKSDRMTRSHKMHRPGRMMPGVTVKDVNQQEFIRALAAFLRKSGKLKVPEWVDTVKLLAK HKELAPYDENWLYTRAASTAWHLYLWGGAGVGSMTKIYGGCQRSGGMPGHFSRGSKNVAH RVLQALEGLKMVEKDQDGGRKLTRQGQRDLDRIAGQDLLYRRSRSLVDYENANKALDKAR AKNKDVLQAETSQQLCCQKFEKISESAKQELIDFKTRRVAAFRKNLVELAELELKHAKIS DNCSINSSTQPEVILPPRGHFAISGDICGYHNWVFYGYYQYMPGQQERNSVSHKKKKKKK KKKKKKKKKKKKNTIIIIVYLQIFPSVQKWFVVQRYTLNSTGARPPFQVPDSRPRAHLRP TSPGDTQACPTPSVGPRSGSGRDRSAGDDPGKGVAKAGESANAVPHYHKLCSRVSHIWGN RRGQHIRSAMDKPRPGKTTFVIMVSPLPEQQMKGLDLDPEARALGPAFIHWWTSHGLIFT SEGYVKDKTHSASNGNSTVQTALGNMPLQATMNRLPDDYDPYAVEEPSDEEPALSSSEDE VDVLLHGTPDQKRKLIRECLTGESESSSEDEFEKEMEAELNSTMKTMEDKLSSLGTGSSS GNGKVATAPTRYYDDIYFDSDSEDEDRAVQVTKKKKKKQHKIPTNDELLYDPEKDNRDQA WVDAQRRGHESYKTQYRAMFVMNCSINKEEVLRYKASENRKKRRVHKKMRSNREDAAEKA ETDVEEIYHPVMCTECSTEVAVYDKDEVFHFFNVLASHS >gi568815584r:34416313_34639628|GENSCAN_predicted_CDS_4|3000_bp atgcgcgcctgcgccggccctcgcctcggagcagccatgatggaaggcctggacgacggc ccggacttcctctcagaagaggaccgcggacttaaagcaataaatgtagatcttcaaagt gatgctgctctgcaggtggacatttctgatgctcttagtgagcgggataaagtaaaattc actgttcacacaaagagttcattgccaaattttaaacaaaacgagttttcagttgttcgg caacatgaggaatttatctggcttcatgattcctttgttgaaaatgaagactatgcaggt tatatcattccaccagcaccaccaagacctgattttgatgcttcaagggaaaaactacag aagcttggtgaaggagaagggtcaatgacgaaggaagaattcacaaagatgaaacaggaa ctggaagctgaatatttggcaatattcaagaagacagttgcgatgcatgaagtgttcctg tgtcgtgtggcagcacatcctattttgagaagagatttaaatttccatgtcttcttggaa tataatcaagatttgagtgtgcgaggaaaaaataaaaaagagaaacttgaagacttcttt aaaaacatggttaaatcagcagatggagtaatcgtttcaggagtaaaggatgtagatgat ttctttgagcacgaacgaacatttcttttggagtatcataaccgagttaaggatgcatct gctaaatctgatagaatgacaagatcccacaaaatgcacaggccgggccgcatgatgcct ggagttactgtaaaagatgtgaaccagcaggagttcatcagagctctggcagccttcctc agaaagtctgggaagctgaaagtccccgaatgggtggacaccgtcaagctgctggccaag cacaaagagcttgctccctacgatgagaactggttgtacacgcgagctgcttccacagcg tggcacctgtacctctggggtggcgctggggttggctccatgaccaagatctatggggga tgtcagagaagtggcggcatgcctggccactttagccgaggctccaagaatgtggcccac cgggtcctccaagccctggaggggctaaaaatggtggaaaaggaccaagatgggggccgc aaactgacacgtcagggacagagagatctggacagaatcgctggacaggatctcctgtat cgaaggtctaggtcactagtggattatgaaaatgctaataaagcactggataaagcaaga gcaaaaaataaagatgttctacaggccgaaacttcccaacaattatgttgtcagaaattt gaaaaaatatctgagtctgcaaaacaagaacttatagattttaagacaagaagagttgct gcattcagaaaaaatttagtggaactggcagagttagaactgaagcatgcaaagatttct gataactgttctattaacagtagtactcaaccagaggtgattctgccccccagaggacat tttgcaatatctggagacatttgtggttatcacaactgggtgttctatggttactaccag tacatgcctggacaacaagagcgaaactccgtctcacataagaagaagaagaagaagaag aagaagaagaagaagaagaagaagaagaagaagaagaatacaataataataatagtgtat ttacagattttcccaagtgttcaaaaatggtttgttgtccagcgttatacacttaactcg accggtgcacggcctccctttcaggtcccagactcccggccgcgcgcccacttgcgccca accagccccggagacacccaggcctgtcccacgccgtcggtaggtccccggtccgggagc gggagagaccggagcgccggggacgaccccggcaagggcgtggctaaggcaggggaaagc gcgaacgcagtcccccactaccacaaattatgcagtcgagtttcccacatttggggaaat cgcaggggtcagcacatccggagtgcaatggataagcctcgccctgggaaaaccaccttc gtgatcatggtatctcccctgccagagcagcagatgaagggcctagacctggatccagaa gctagggctctcggtccagcattcatccactggtggacatcacatgggcttatttttacc agcgaaggttacgtgaaggacaaaacgcactcagccagcaacggaaactcaacagttcaa acagcactggggaacatgccgctgcaggccaccatgaaccggcttccggatgactacgac ccctacgcggttgaagagcctagcgacgaggagccggctttgagcagctctgaggatgaa gtggatgtgcttttacatggaactcctgaccaaaaacgaaaactcatcagagaatgtctt accggagaaagtgaatcatctagtgaagatgaatttgaaaaggagatggaagctgaatta aattctaccatgaaaacaatggaggacaagttatcctctctgggaactggatcttcctca ggaaatggaaaagttgcaacagctccgacaaggtactacgatgatatatattttgattct gattccgaggatgaagacagagcagtacaggtgaccaagaaaaaaaagaagaaacaacac aagattccaacaaatgacgaattactgtatgatcctgaaaaagataacagagatcaggcc tgggttgatgcacagagaagggggcatgaatcatacaaaactcaatatagagcaatgttt gtaatgaattgttctattaacaaagaggaggttctaagatataaagcctcagagaacagg aagaaaaggcgggtccataagaagatgaggtctaaccgggaagatgctgccgagaaggca gagacagatgtggaagaaatctatcacccagtcatgtgcactgaatgttccactgaagtg gcagtctacgacaaggatgaagtctttcattttttcaatgttttagcaagccattcctaa