GENSCAN 1.0 Date run: 5-Nov-116 Time: 23:09:22 Sequence gi568815588r:13178304_13400042 : 221739 bp : 45.67% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2139 2304 166 0 1 75 58 114 0.537 7.06 1.02 Intr + 4630 4797 168 0 0 48 94 158 0.996 12.54 1.03 Intr + 7861 7977 117 0 0 119 8 85 0.725 4.26 1.04 Intr + 10578 10777 200 2 2 126 64 194 0.867 18.95 1.05 Intr + 14148 14265 118 0 1 88 95 70 0.775 8.17 1.06 Intr + 16748 16966 219 1 0 36 88 122 0.560 5.50 1.07 Intr + 19320 19464 145 2 1 90 79 78 0.991 6.96 1.08 Intr + 20386 20504 119 2 2 92 94 56 0.796 6.78 1.09 Intr + 23118 23231 114 2 0 69 76 94 0.659 6.94 1.10 Intr + 25916 26061 146 1 2 60 80 124 0.518 7.88 1.11 Intr + 38982 39737 756 0 0 70 53 188 0.338 4.08 1.12 Term + 40085 40121 37 2 1 93 37 27 0.445 -5.09 1.13 PlyA + 40391 40396 6 1.05 2.06 PlyA - 43482 43477 6 1.05 2.05 Term - 43897 43800 98 2 2 115 42 93 0.978 5.53 2.04 Intr - 51406 51308 99 2 0 101 95 181 0.993 20.18 2.03 Intr - 55330 55235 96 2 0 78 91 88 0.913 8.08 2.02 Intr - 55497 55432 66 1 0 116 113 27 0.967 6.88 2.01 Init - 55955 55898 58 2 1 90 75 83 0.959 6.74 2.00 Prom - 63639 63600 40 -6.36 3.00 Prom + 76137 76176 40 -2.66 3.01 Init + 81569 81746 178 1 1 97 40 143 0.078 9.99 3.02 Intr + 85277 85423 147 0 0 77 85 16 0.090 0.41 3.03 Intr + 94405 94530 126 2 0 48 100 58 0.220 3.65 3.04 Term + 96907 96944 38 2 2 116 44 34 0.343 -0.70 3.05 PlyA + 97151 97156 6 1.05 4.07 PlyA - 98570 98565 6 1.05 4.06 Term - 100051 99998 54 1 0 85 44 53 0.776 -1.84 4.05 Intr - 102807 102673 135 0 0 55 95 181 0.971 16.26 4.04 Intr - 105536 105387 150 2 0 82 121 187 0.997 21.76 4.03 Intr - 110238 110057 182 1 2 59 92 166 0.438 13.69 4.02 Intr - 113609 113522 88 2 1 84 59 21 0.133 -1.66 4.01 Init - 116238 116125 114 0 0 47 55 138 0.133 6.61 4.00 Prom - 124758 124719 40 -1.96 5.02 PlyA - 125446 125441 6 1.05 5.01 Sngl - 130198 129842 357 1 0 63 41 243 0.927 13.16 5.00 Prom - 138896 138857 40 -3.16 6.09 PlyA - 139158 139153 6 1.05 6.08 Term - 141053 140839 215 0 2 91 33 78 0.810 -0.01 6.07 Intr - 144744 144532 213 1 0 80 92 286 0.580 26.89 6.06 Intr - 150147 150048 100 0 1 68 80 128 0.654 9.68 6.05 Intr - 151485 151395 91 2 1 90 78 84 0.995 7.60 6.04 Intr - 155668 155514 155 1 2 96 103 55 0.998 6.67 6.03 Intr - 158047 157940 108 1 0 122 110 116 0.999 17.68 6.02 Intr - 159572 159478 95 0 2 65 115 36 0.694 3.68 6.01 Init - 169177 169063 115 1 1 79 32 204 0.121 12.27 6.00 Prom - 205492 205453 40 -0.66 7.03 PlyA - 205549 205544 6 -0.45 7.02 Term - 208215 207993 223 2 1 108 44 66 0.488 0.59 7.01 Intr - 214076 213955 122 2 2 108 116 13 0.840 5.39 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 169800 169929 130 2 1 69 -18 264 0.839 12.21 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:13178304_13400042|GENSCAN_predicted_peptide_1|768_aa XRPRVSSTEMNKKMTGRKLIRLSQIKEKMAREKLEEIDWVTFGVILKKVTPQSVNSGKTF SIWKLNDLRDLTQCVSLFLFGEVHKALWKTEQGTVVGILNANPMKPKDGSEEVCLSIDHP QKVLIMGEALDLGTCKAKKKNGEPCTQTVNLRDCEYCQYHVQAQYKKLSAKRADLQSTFS GGRIPKKFARRGTSLKERLCQDGFYYGGVSSASYAASMDYGEPKTSHQVHLGLSTLEATE AADVGDEEKEIRRNTEASSSEVESPAVPSSSRQPPAQPPRTGSEFPRLEGAPATMTPKLG RGVLEGDDVLFYDESPPPRPKLSALAEAKKLAAITKLRAKGQVLTKTNPNSIKKKQKDPQ DILEVKERVEKNTMFSSQAEDELEPARKKRREQLAYLESEEFQKILKAKSKHTGILKEAE AEMQERYFEPLVKKEQMEEKMRNIREVKCRVVTCKTCAYTHFKLLETCVSEQHEYHWHDG VKRFFKCPCGNRSISLDRLPNKHCSTYTKIGTIQRRLAWPLRKDDTQIREAFHIFMPHIY NYLIFDKPEKNKQWGKDSLFNKWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPK TIKTLEENLGITIQDIGVGKDFMSKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQ PTTWEKIFTTYSSDKGLISRIYNELKQIYKKKTNNPIKKWAKDMNRHFSKEDIYAAKKHM KKCSSSLAIREMQIKTTMRYHLTPVRMAIIKKSGNNRDMDEIGNHHSQ >gi568815588r:13178304_13400042|GENSCAN_predicted_CDS_1|2307_bp nngcggcctcgagtatcctccacagaaatgaacaagaaaatgaccggccgaaaactgatc agactgtctcagatcaaggaaaagatggccagagagaagctggaagaaatagattgggtg acatttggggttatattgaagaaggttacgccacagagtgtgaatagtggaaaaaccttc agcatatggaaactgaatgatcttcgtgacctgacacaatgtgtgtccttgttcttattt ggagaagttcacaaagcgctctggaagacggagcaggggactgtcgtagggatcctcaat gccaaccccatgaagcccaaggatggttcagaggaggtgtgtttatctatcgatcatcct cagaaggtcttaattatgggtgaagctcttgacctgggaacctgtaaagccaagaagaag aatggagagccgtgcacgcagactgtgaatttgcgtgactgtgagtactgtcagtaccat gtccaggctcagtacaagaagctcagcgcaaagcgtgcggatctgcagtccaccttctct ggaggacgaattccaaagaagtttgcccgcagaggcaccagcctcaaagaacggctgtgc caagatggcttttactacggaggggtttcttctgcctcgtatgcagcttcaatggattat ggggagcccaaaaccagccatcaagtccatctcggcctcagcactcttgaagcaacagaa gcagcggatgttggagatgaggagaaggaaatcagaagaaatacagaagcgagctcaagt gaagttgagagcccagctgtgccatcttcatcaagacagccccctgctcagcctccacgg acaggatccgagttccccaggctggagggagccccggccacaatgacgcccaagctgggg cgaggtgtcttggaaggagatgatgttctcttttatgatgagtcaccaccaccaagacca aaactgagtgctttagcagaagccaaaaagttagctgctatcaccaaattaagggcaaaa ggccaggttcttacaaaaacaaacccaaacagcattaagaagaaacaaaaggaccctcag gacatcctggaggtgaaggaacgtgtagaaaaaaacaccatgttttcttctcaagctgag gatgaattggagcctgccaggaaaaaaaggagagaacaacttgcctatctggaatctgag gaatttcagaaaatcctaaaagcaaaatcaaaacacacaggcatcctgaaagaggccgag gctgagatgcaggagcgctactttgagccactggtgaaaaaagaacaaatggaagaaaag atgagaaacatcagagaagtgaagtgccgtgtcgtgacatgcaagacgtgcgcctatacc cacttcaagctgctggagacctgcgtcagtgagcagcatgaataccactggcatgatggt gtgaagaggtttttcaaatgtccctgtggaaacagaagcatctccttggacagactcccg aacaagcactgcagcacatatactaaaattggaacgatacagagaagattagcatggccc ctgcgcaaggatgacacgcaaattcgtgaagcgttccatatttttatgccgcatatctac aactatctgatctttgacaaacctgagaaaaacaagcaatggggaaaggattccctattt aataaatggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttc cttacaccttatacaaaaatcaattcaagatggattaaagatttaaacgttagacctaaa accataaaaaccctagaagaaaacctaggcattaccattcaggacataggcgtgggcaag gacttcatgtccaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggat ctaattaaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaa cctacaacatgggagaaaattttcacaacctactcatctgacaaagggctaatatccaga atctacaatgaactcaaacaaatttacaagaaaaaaacaaacaaccccatcaaaaagtgg gcgaaggacatgaacagacacttctcaaaagaagacatttatgcagccaaaaaacacatg aaaaaatgctcatcatcactggccatcagagaaatgcaaatcaaaaccactatgagatat catctcacaccagttagaatggcaatcattaaaaagtcaggaaacaacagggacatggat gaaattggaaaccatcattctcagtaa >gi568815588r:13178304_13400042|GENSCAN_predicted_peptide_2|138_aa MTWRQAVLLSCFSAVVLLSMLREGTSVSVGTMQMAGEEASEDAKQKIFMQESDASNFLKR RGKRSPKSRDEVNVENRQKLRVDELRREYYEEQRNEFENFVEEQNDEQEERSREAVEQWR QWHYDGLHPSYLYNRHHT >gi568815588r:13178304_13400042|GENSCAN_predicted_CDS_2|417_bp atgacttggagacaggccgtcctgctgtcttgcttctccgccgtggtgctcctgtctatg ctgagagagggaaccagtgtatctgtgggcaccatgcagatggcgggagaagaggcgagt gaagatgcaaaacagaagattttcatgcaggaatcagatgcctcgaatttcctcaagagg cgcggcaagcggtcccccaagtccagagatgaggtcaatgtggaaaacaggcagaagctt cgggttgatgagctgcggagagaatattacgaggaacaaaggaatgaatttgagaacttc gtggaggaacaaaacgatgagcaggaagagaggagccgggaggctgtggagcagtggcgc cagtggcactatgacggcctgcacccatcctatctctacaaccgccaccacacctga >gi568815588r:13178304_13400042|GENSCAN_predicted_peptide_3|162_aa MPEPPTHSMGSCVARASPTSTTPCSTVPSPIDHPRAEECECTAQDWQAAPPAAPVPDPLG VVIVPGTSWSLSKYLLKGIDVEGLVESWEKWEGWPGNCEGTRAGVPVRASKRVVNFTCFK SKLNQCKKSCFFASPTNTVFQEMPSLKKNEGHLTKAFNGLQT >gi568815588r:13178304_13400042|GENSCAN_predicted_CDS_3|489_bp atgcctgagcctcccacccactccatgggctcctgtgtggcccgagcctccccgacgagc accaccccctgctccacggtgcccagtcccatcgaccacccaagggctgaggaatgcgag tgcacggcgcaggactggcaggcagctccacctgcagccccagtgccggatccactaggg gttgtgatagtgcctggaacatcatggtcactcagtaaatatttactaaagggtattgac gtggagggcttggttgaaagctgggagaagtgggaagggtggccaggcaactgtgagggg acaagggctggagtgccggtgcgagcttctaaacgagtagtaaatttcacgtgcttcaaa tcaaaattaaatcaatgcaaaaagtcctgtttctttgcctccccaacaaacaccgtgttc caagaaatgccaagcctgaagaagaatgaaggacacctaacaaaggccttcaatgggctt cagacttga >gi568815588r:13178304_13400042|GENSCAN_predicted_peptide_4|240_aa MRDVTISKSEYAPSEKMITKVQDFQEDKELFRYCTLPEILKYVECFTGPNIMAMHTMLIN KPPDSGNCKKTSRHPLHQDLHYFPFRPSDLIVCAWTAMEHISRNNGCLVVLPGTHKGSLK PHDYPKWEGGVNKMFHGIQDYEENKARVHLVMEKGDTVFFHPLLIHGSGQNKTQGFRKAI SCHFASADCHYIDVKGTSQENIEKEVVGIAHKFFGAENSVNLKDIWMFRARLVKGERTNL >gi568815588r:13178304_13400042|GENSCAN_predicted_CDS_4|723_bp atgagagatgtgaccatttcgaaatccgaatatgctccaagtgagaagatgatcacgaag gtccaggatttccaggaagataaggagctcttcagatactgcactctccccgagattctg aaatatgtggagtgcttcactggacctaatattatggccatgcacacaatgttgataaac aaacctccagattctggtaattgcaagaagacgtcccgtcaccccctgcaccaggacctg cactatttccccttcaggcccagcgatctcatcgtttgcgcctggacggcgatggagcac atcagccggaacaacggctgtctggttgtgctcccaggcacacacaagggctccctgaag ccccacgattaccccaagtgggaggggggagttaacaaaatgttccacgggatccaggac tacgaggaaaacaaggcccgggtgcacctggtgatggagaagggcgacactgttttcttc catcctttgctcatccacggatctggtcagaataaaacccagggattccggaaggcaatt tcctgccatttcgccagtgccgattgccactacattgacgtgaagggcaccagtcaagaa aacatcgagaaggaagttgtaggaatagcacataaattctttggagctgaaaatagcgtg aacttgaaggatatttggatgtttcgagctcgacttgtgaaaggagaaagaaccaatctt tga >gi568815588r:13178304_13400042|GENSCAN_predicted_peptide_5|118_aa MRRHLEFKKLHASIEFLTMAKNKFRGQKSRNVFRIASQKSFKAKNRVKPITANLKKIHIM NDEKVSRVNKAFGSVQEELPHFSKGLSLDLLQKELIPQQRHQSKPVNVDEATKLMAQL >gi568815588r:13178304_13400042|GENSCAN_predicted_CDS_5|357_bp atgcggcgtcacctggagtttaagaagctgcacgcaagtattgaattcctgacaatggcc aagaacaaatttagagggcagaagtccaggaacgtatttcgcatagctagccaaaaaagc tttaaggctaaaaacagagtaaaaccaattaccgctaatcttaagaagatacacattatg aatgatgaaaaagtcagcagagtaaacaaagcttttggaagtgtacaagaggaacttcca catttctcaaaaggcctttctcttgatcttctgcagaaagagctgattcctcagcagcgt catcagagcaaaccagttaatgttgatgaagctacaaaactaatggctcagctgtaa >gi568815588r:13178304_13400042|GENSCAN_predicted_peptide_6|363_aa MARPGPRAAASAPPPARSPGLLGLPRRRRGPEPEPEPEGTSDKYQIIGNLYFFRTSVRFL LWSNTLETGQGRIACANVLSDLYAMGVTECDNMLMLLGVSNKMTDRERDKVMPLIIQGFK DAAEEAGTSVTGGQTVLNPWIVLGGVATTVCQPNEFIMPDNAVPGDVLVLTKPLGTQVAV AVHQWLDIPEKWNKIKLVVTQEDVELAYQEAMMNMARLNRTAAGLMHTFNAHAATDITGF GILGHAQNLAKQQRNEVSFVIHNLPVLAKMAAVSKACGNMFGLMHGTCPETSGGLLICLP REQAARFCAEIKSPKYGEGHQAWIIGIVEKGNRTARIIDKPRIIEVAPQVATQNVNPTPG ATS >gi568815588r:13178304_13400042|GENSCAN_predicted_CDS_6|1092_bp atggcccggccggggccccgagctgccgcctccgccccgccgccggcccgcagcccgggc ctgctgggccttccgaggcggcggcgagggcctgagcctgagcctgagcccgaaggtacc agtgacaaataccagattattggaaacctatatttcttccgcacatctgtccgttttctc ctctggagcaatactcttgagactgggcagggcaggatagcgtgtgccaatgtcctcagt gacctctatgcaatgggggtcacggaatgtgacaatatgctgatgctccttggagtcagt aataaaatgaccgacagggaaagggataaagtgatgcctctgattatccaaggttttaaa gacgcagctgaggaagcaggaacatctgtaacaggcggccaaacagtactaaacccctgg attgtcctgggaggagtggctaccactgtctgccaacccaatgaatttatcatgccagac aatgcagtgccaggggacgtgctggtgctgacaaaacccctggggacacaggtggcagtg gctgtgcaccagtggctggatatccctgagaaatggaataagattaaactagtggtcacc caagaagatgtagagctggcctaccaggaggcgatgatgaacatggcgaggctcaacagg acagctgcaggactcatgcacacgttcaatgcccacgccgccactgacatcacgggcttc gggattttgggccatgcgcagaacctggccaagcagcagaggaacgaggtgtcgtttgta attcacaacctcccggtgctggccaagatggctgcggtgagcaaggcctgcggaaacatg ttcggcctcatgcacgggacctgcccggagacttcaggcggccttctgatctgtttacca cgtgagcaagcagctcggttctgtgcagagataaagtcccccaaatatggtgaaggccac caagcatggattattgggattgtagagaagggcaaccgcacagccagaatcatagacaaa ccccggatcatcgaggtcgcaccacaagtggccactcaaaatgtgaatcccacacccggg gccacctcttaa >gi568815588r:13178304_13400042|GENSCAN_predicted_peptide_7|114_aa DVLTLCPPSHSKHNHEFQSLQGAVTGAGKRKVIQLPIGDERFPNSHYLDPTHFPQKAAEN NNLSQKAIRNYSLIGGLPALFTSLFQNHWEHQQILPTCEMGRTLAVLPAETALG >gi568815588r:13178304_13400042|GENSCAN_predicted_CDS_7|345_bp gatgtcctcaccctctgcccaccttcccactcaaagcacaaccatgagttccaatcatta cagggtgctgtgacaggtgctgggaaaagaaaggtcatacaactgccaataggggatgaa aggttccccaattcccattacttagatcctactcatttcccacagaaggctgcagagaac aataacctcagccagaaagcaatcaggaattattcactgattggtgggttgcctgcactt ttcacatctcttttccagaaccactgggagcaccagcagatcctgcccacctgtgagatg ggcagaaccctggccgtcctgcctgcggagacagctttaggatga