GENSCAN 1.0 Date run: 6-Nov-116 Time: 08:47:59 Sequence gi568815588f:18559638_18775213 : 215576 bp : 39.96% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 74 721 648 1 0 40 38 566 0.846 42.72 1.02 PlyA + 1310 1315 6 1.05 2.02 PlyA - 6451 6446 6 1.05 2.01 Sngl - 11171 10731 441 2 0 53 39 447 0.681 32.30 2.00 Prom - 12462 12423 40 -16.48 3.11 PlyA - 12943 12938 6 1.05 3.10 Term - 13938 13598 341 1 2 24 44 345 0.229 17.41 3.09 Intr - 17984 17877 108 0 0 106 83 6 0.239 1.24 3.08 Intr - 19336 19117 220 1 1 42 68 121 0.051 2.45 3.07 Intr - 21497 21391 107 0 2 63 77 43 0.012 -0.29 3.06 Intr - 26456 26312 145 2 1 102 105 94 0.015 11.43 3.05 Intr - 36690 36571 120 0 0 98 94 30 0.030 4.37 3.04 Intr - 50289 50208 82 2 1 85 108 -9 0.005 -0.48 3.03 Intr - 54976 54811 166 2 1 96 49 163 0.052 11.30 3.02 Intr - 56656 56547 110 0 2 81 101 111 0.999 10.71 3.01 Init - 58907 58900 8 2 2 110 26 2 0.129 -3.39 3.00 Prom - 59471 59432 40 -5.95 4.00 Prom + 61815 61854 40 -3.25 4.01 Sngl + 68661 69227 567 2 0 88 43 450 0.922 36.40 4.02 PlyA + 69538 69543 6 1.05 5.00 Prom + 70072 70111 40 -6.15 5.01 Init + 71846 73122 1277 1 2 58 53 318 0.114 17.02 5.02 Intr + 99502 99640 139 1 1 -21 97 134 0.571 2.95 5.03 Intr + 99926 100046 121 1 1 48 100 67 0.018 3.05 5.04 Intr + 106938 106998 61 1 1 100 87 14 0.089 -0.63 5.05 Intr + 108803 109040 238 2 1 39 69 141 0.134 3.99 5.06 Intr + 112985 113068 84 1 0 82 84 78 0.948 5.90 5.07 Intr + 114347 114498 152 1 2 46 110 81 0.799 4.14 5.08 Intr + 131008 131069 62 1 2 63 84 48 0.003 -0.64 5.09 Intr + 150802 150951 150 2 0 32 63 154 0.617 6.51 5.10 Intr + 168080 168172 93 0 0 39 93 82 0.047 2.82 5.11 Term + 176342 176625 284 0 2 30 39 260 0.505 9.90 5.12 PlyA + 176835 176840 6 1.05 6.00 Prom + 177335 177374 40 -7.45 6.01 Init + 178261 178441 181 0 1 48 34 129 0.330 2.89 6.02 Intr + 178485 178760 276 1 0 23 80 213 0.144 10.57 6.03 Term + 187052 187068 17 0 2 122 38 7 0.055 -3.48 6.04 PlyA + 187571 187576 6 1.05 7.04 PlyA - 188023 188018 6 1.05 7.03 Term - 193252 193089 164 2 2 62 31 134 0.481 2.22 7.02 Intr - 194177 194033 145 2 1 59 75 52 0.206 -0.07 7.01 Intr - 215118 214981 138 0 0 55 87 49 0.069 1.24 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 6771 6406 366 0 0 38 40 301 0.927 16.14 S.002 Init + 26665 26818 154 0 1 60 73 148 0.899 10.69 S.003 Term - 54976 54771 206 2 2 96 48 194 0.918 12.75 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588f:18559638_18775213|GENSCAN_predicted_peptide_1|215_aa MKWNGQWNGMDNGVENGREWNAEWNGSEWRIEWKREWNGMEKGMGNGMQWGFECGMECNG KGNGMEWRMDWNGMENGLEWNREWFGMEWRIEWNREWNAMENGMEWRLSWNGEWNEMENG MEWNGEWNGERKGREWNGIENAEWIIEWNGEWNGMENELEWKMEWNGEWYGMERRMAENG MDNGRQWNREWYGMENGIEWIMEWRMEWYEKCNVM >gi568815588f:18559638_18775213|GENSCAN_predicted_CDS_1|648_bp atgaaatggaatggacaatggaatggaatggacaatggagtggagaatggaagggaatgg aatgcagaatggaatggaagcgaatggagaatagaatggaaaagagaatggaatggaatg gagaaaggaatggggaatggaatgcagtggggatttgaatgcggaatggaatgcaatgga aaggggaatggaatggaatggagaatggattggaatggaatggagaatggactggaatgg aatagagaatggtttggaatggaatggagaatcgaatggaatagagaatggaatgcaatg gagaatggaatggaatggagattgagctggaatggagaatggaatgaaatggaaaatgga atggaatggaatggagaatggaatggagaaaggaagggaagggaatggaatggaatcgag aatgcagaatggataatagaatggaatggggaatggaatggaatggagaatgaactggaa tggaaaatggaatggaatggagaatggtatggaatggaacggagaatggcagagaatgga atggataatggacggcaatggaatagagaatggtatggaatggagaatggaatagaatgg ataatggaatggagaatggaatggtatgagaaatgtaatgtaatgtaa >gi568815588f:18559638_18775213|GENSCAN_predicted_peptide_2|146_aa MEDGMENGMEGSMEWNGEWNGEWNEMEWRMEWNREWNGMEHEMEWRIEWNGMDDGMENGM EWRMEWNGQWNGMENGMDTGLEWRMEKSGMEWRMEWRMEWKGGWNGMESGTKWIGEWNGM ENGMEWRMEWRMEGNGMEYGMEWRIK >gi568815588f:18559638_18775213|GENSCAN_predicted_CDS_2|441_bp atggaggatggtatggagaatggaatggaagggagcatggaatggaatggagaatggaat ggagaatggaatgaaatggagtggagaatggaatggaatagagaatggaatggaatggag catgaaatggaatggagaatagaatggaatggaatggatgatggtatggagaatggaatg gaatggagaatggaatggaatggacagtggaatggaatggagaatggaatggacactgga ttggaatggagaatggaaaagagcggaatggaatggaggatggaatggagaatggaatgg aaaggaggatggaatggaatggaaagtggaacgaaatggattggagaatggaatggaatg gagaatggaatggaatggagaatggaatggagaatggaagggaatggaatggaatatggt atggaatggaggattaaatag >gi568815588f:18559638_18775213|GENSCAN_predicted_peptide_3|468_aa MPGKNIKKQQCEAIVGAQCGNAVLRGAHVYAPGIVSASQFMKAGDVISVYSDIKGKCKKG AKEFDGTKVFLGNGISELSRKEIFSGLPELKYVIRGMGIRMTEPVYLSPSFDSVLPRYLF LQNLPSALVSHVLNPQPGEKILDLCAAPGGKTTHIAALMHDQGEVIALDKIFNKVEKIKQ NALLLGLNSIRAFCFDGTKAVKLDMVEDTEVRAGHCPCWQPTPREESGEVTPDPRSMSAY KTPSQKYFSENIRSGECRDDTLPLPLTGSPFTDVSYVRHMIEGECAEMQPIVSSSWYKII DTELGHLIPFLKQVACGLSALSLQTSLTCLGLVVLLDFPKGETSLDKVEIVKGDFKLVVK FGQKDCGRREWRMENGMEWNVEWYGECNGMENGMEYGMEWRMEWKGKWNRWNGRENGMEW RMERNWKLNEEWKGMGNGMEWRMEWNGEWNGMEYAEWNAMDNGKWNGE >gi568815588f:18559638_18775213|GENSCAN_predicted_CDS_3|1407_bp atgcctggaaagaatattaaaaaacaacagtgtgaagccattgttggagcccagtgtggc aatgcagttttaagaggagcccatgtctatgccccaggaattgtgtcagcatcacaattt atgaaagctggagatgttatttctgtatactctgatattaaaggaaaatgtaagaaagga gccaaagaatttgatggaacaaaagtatttcttggaaatgggatttctgaactaagccgc aaagaaatcttcagtggattacctgaactgaagtatgtcatcagaggcatgggcataaga atgacagaaccagtatatctcagcccttcatttgacagtgtactgccccgttacttattt ttacaaaatttgccatctgccttagtaagtcatgtactaaatcctcaacctggagagaag attctagacttgtgtgcagcacctggagggaaaacaacacacattgcagcactaatgcat gatcagggagaagttatagcactggataaaatcttcaacaaagtagaaaaaatcaaacag aatgccttattgttagggctgaattccatcagggcattttgttttgatggaacaaaggcg gttaaacttgatatggtggaggacacagaagtgcgagcaggccactgtccatgctggcag cccaccccaagggaagaatcaggagaagtaacgccagaccccagaagtatgtcagcatac aaaaccccaagtcagaagtacttcagtgagaatattagaagtggtgaatgcagagatgac acattaccattaccgttgacaggttcaccatttacagatgtgtcctatgtaagacatatg attgagggtgagtgcgctgaaatgcagcccatagtttcttcgtcatggtacaagatcatt gatacggagcttgggcaccttattccatttttaaaacaggtagcttgtgggctttcagct ctttctcttcaaacttctttaacatgtcttggcttagttgttctgcttgattttccaaaa ggagaaacttctctggataaggtggagatagttaagggagattttaaacttgttgtaaaa tttggccaaaaagactgtgggagaagggaatggagaatggagaacggaatggaatggaac gtagaatggtatggagaatgcaatggaatggagaatggaatggaatatggaatggaatgg agaatggaatggaaaggaaaatggaaccgatggaatggaagggagaatggaatggaatgg agaatggaaaggaattggaaattgaatgaggaatggaaaggaatgggtaatggaatggaa tggagaatggaatggaatggagaatggaatgggatggagtatgcagaatggaatgcaatg gacaatggaaaatggaatggagaatga >gi568815588f:18559638_18775213|GENSCAN_predicted_peptide_4|188_aa MGKKQSRKTGNSKKQSTSPPPKEYSSSPAMEQSWMENDFEELREEGFRRPNYSELREEIQ TKGIEVENFEKSLEESITRITNTEKCLKELRELKTKARELREECRSLRSQCDQLEERVSA MEDELNEMKQEGKFREKRIKRNEESLQEIWDYVKRPNLRRLVYQKVTGRMESSWKTLCRI LSRRTSPL >gi568815588f:18559638_18775213|GENSCAN_predicted_CDS_4|567_bp atggggaaaaaacagagcagaaaaactggaaactctaaaaagcagagcacctctcctcct ccaaaggaatacagttcctcaccagcaatggaacaaagctggatggagaatgactttgag gagctgagagaagaaggcttcagacgaccaaattactccgagctacgggaggaaattcaa accaaaggcattgaagttgaaaactttgaaaaaagtttagaagaatctataactagaata accaatacagaaaagtgcttaaaggagctgagggagctgaaaaccaaggctcgagaacta cgtgaagaatgtagaagcctcaggagccaatgtgatcaactggaagaaagggtatcagcg atggaagatgaactgaatgaaatgaagcaagaagggaagtttagagaaaaaagaataaaa agaaatgaggaaagcctccaagaaatatgggactatgtgaaaagaccaaatctacgtcga ttggtgtaccagaaagtgacggggagaatggaatcaagttggaaaacactctgcaggata ttatccaggagaacttccccactctag >gi568815588f:18559638_18775213|GENSCAN_predicted_peptide_5|886_aa MIVYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELPFTIAS KRIKCLGIQLTRDVKDLFKENYKPLLKEIKEDTNKWKNIPCSWVGRINIVKMAILPKVIC RFNAIPIKLPMTFFTEVEKTTLKFIWNQKRAHITKSILSEKNKAGGITLPDFKLYYKATV TKTAWYWYQNRDIDQWNRTEPSEIMPHIYNYLIFDKPEKNKQWGKDSLVNKWCWQNWLAI CRKLKLDPFLTPYTKIISRWIKDLNVRPKTIKTLEENLGITIQDLGMGKDFMSKTPKAMA TKAKIDKWDLIKLKSFCTAKETTIRVNRQPTKWEKIFTTYSSDKGLISRIYNELKHIYKK KTNNPIKKWAKDMNRHFSKEDIYAAKRHTKKCSPSLAIREMQIKTTMRCHLTPVRMAIIK KSGNNRLTSEGARTKGNFPPPRDTANSQPLAVWDSPTGSETEENATSKAWVAPAPRWGTR RSGTCCRGTPRPAPVLVMGLIFAKLWSLFCNQEHKVIIVGLDNAGKTTILYQLRLIDLKN TEIKVASDSQDLQELLRCLFFNSLMNEVVHTSPTIGSNVEEIVVKNTHFLMWDIGGQESL RSSWNTYYSNTEFIILVVDSIDRERLAITKEELYRMLAHEDLRKAAVLIFANKQDMKGCM TAAEISKYLTLSSIKDHPWHIQSCCALTGEGDEVTKTDFGILDLPVLRSEVCGFGGVVAE EGGTGIPGPTATPDVALLREGTPGKLLLRLAGVPCSRARGKEPRLKPEESTVGLCCVMLS WRTSLHENILEAAGAGAYACKTIKDNQKRYNERMLGLGLTSEEKQKRATLSALDGEPVPQ GSLPNHVPFLLTGGRTAVFAAAEATWVRDPGTGVLIMSEDPELPYM >gi568815588f:18559638_18775213|GENSCAN_predicted_CDS_5|2661_bp atgattgtatatctagaaaaccccattgtctcagcccaaaatctccttaagctgataagc aacttcagcaaagtctcaggatacaaaatcaatgtacaaaaatcacaagcattcttatac accaacaacagacaaacagagagccaaatcatgagtgaactcccattcacaattgcttca aagagaataaaatgcctaggaatccaacttacaagggatgtgaaggacctcttcaaggag aactacaaaccactgctcaaggaaataaaagaggatacaaacaaatggaagaacattcca tgctcatgggtaggaagaatcaatatcgtaaaaatggccatactgcccaaggtaatttgt agattcaatgccatccccatcaagctaccaatgactttcttcacagaagtggaaaaaact actttaaagttcatatggaaccaaaagagagcccacatcaccaagtcaatcctaagcgaa aagaacaaagctggaggcatcacactacctgacttcaaactatactacaaggctacagta accaaaacagcatggtactggtaccaaaacagagatatagatcaatggaacagaacagag ccctcagaaataatgccgcatatctacaactatctgatctttgacaaacctgagaaaaac aagcaatggggaaaggattccctagttaataaatggtgctggcaaaactggctagccata tgtagaaagctgaaactggatcccttccttacaccttatacaaaaatcatttcaagatgg attaaagacttaaacgttagacctaaaaccataaaaaccctagaagaaaacctaggcatt accattcaggacttaggcatgggcaaggacttcatgtctaaaacaccaaaagcaatggca accaaagccaaaattgacaaatgggatctaattaaactaaagagcttctgcacagcaaaa gaaactaccatcagagtgaacaggcaacctacaaaatgggagaaaattttcacaacctac tcatctgacaaagggctaatatccagaatctacaatgaactcaaacacatttacaagaaa aaaacaaacaaccccatcaaaaagtgggcgaaggacatgaacagacacttctcaaaagaa gacatttatgcagccaaaagacacacgaaaaaatgctcaccatcactggccatcagagaa atgcaaatcaaaaccacaatgagatgccatctcacacccgttagaatggcaatcattaaa aagtcaggaaacaacaggctgacatcagaaggagcacgcaccaagggaaatttcccaccc ccccgcgataccgcaaattcccagccgctggccgtttgggactctcctaccggaagtgaa accgaagaaaacgccacgtcgaaggcgtgggttgcccccgcgccgcggtgggggacccgg cgcagcggcacctgctgccgagggaccccgcggcccgccccggtgctcgtgatggggctg atcttcgccaaactgtggagcctcttctgtaaccaagaacacaaagtaattatagtggga ctggataatgcagggaaaaccaccattctttaccaattaaggttgattgatcttaaaaat acagagataaaagttgcttcagattctcaagatttacaagagcttttacggtgtctgttt ttcaacagcttaatgaatgaagtggttcatacttctccaaccataggaagcaatgttgaa gaaatagttgtgaagaacactcattttcttatgtgggatattggtggtcaggagtctctg cgatcatcctggaacacatattactcaaatacagagttcatcattcttgttgttgatagc attgacagggaacgactagctattacaaaagaagaattatacagaatgttggctcatgag gatttacggaaggctgcagtccttatctttgcaaataaacaggatatgaaagggtgtatg acagcagctgaaatctcgaaatacctcacccttagttcaattaaggatcatccatggcac attcaatcctgctgtgctctcacaggagaaggggatgaagtaacaaagactgatttcgga attttagatttacccgtattgagatctgaggtctgcgggtttggaggggtggtcgcggag gaagggggcacaggtatacctggccccacagcgaccccagacgttgccctcctcagggag gggacccccgggaagctcctgctgcggctggcgggtgtcccatgcagtagggcccggggc aaggagccaagattaaagcccgaagaaagcaccgtgggcttgtgttgtgtaatgctttca tggcgtacctcattgcatgaaaatatccttgaagctgcaggagctggtgcatatgcctgc aagactataaaggataaccaaaaaagatataatgaaagaatgctagggttagggctgaca tcagaagagaagcaaaaaagggccacattatctgctctagatggagagccagttcctcaa ggcagcttgccaaatcatgtccctttcctgctaactggcggaagaactgctgtttttgct gcagccgaagccacctgggttcgggatcctgggaccggggtcctcattatgtctgaagat cctgagctgccatacatgtga >gi568815588f:18559638_18775213|GENSCAN_predicted_peptide_6|157_aa MIPIAVVQPVGVSSGKLLITLKDGGKVETDHIAAAVSLKPSAELAENWWAGNRPRFWWLL GDAACFYDINLGRRRLEHHDQAFVSRRLAGENMTGAAKPYWHQSVCWNDLSPDAGYEAIG PQLVFLQKQLCKTTQNLPKSNQELLSAQRVKHGAVHN >gi568815588f:18559638_18775213|GENSCAN_predicted_CDS_6|474_bp atgatacccattgctgttgtgcaaccggttggagtcagcagtggcaagttactcatcacg ttgaaagatggtgggaaggtagaaactgaccacatagcagcagccgtgagcctgaagccc agtgctgagttggccgagaactggtgggctggaaataggcccagattttggtggcttctg ggagatgctgcatgcttctatgatataaatctggggaggaggcggttagagcaccatgat caagcttttgtgagcagaagattggctggagaaaatatgactggagctgctaagccgtat tggcatcagtcagtgtgctggaatgatttgagccctgatgctggctatgaagctattggc ccacagttggtgtttttgcaaaagcagctgtgcaagacaacccaaaatctgccaaagagc aatcaggagctgttatctgctcagagagtgaaacacggcgctgtgcacaattaa >gi568815588f:18559638_18775213|GENSCAN_predicted_peptide_7|148_aa RQKDLAVLTVQTVSSVCLIGSRLCHMPRKSYTTAHCLSFPIYKMGEQHRIMRSVSYNFRL PHRKRESHQSLPLMLFGYVPTHGEVASLCAVNTRARQGTSIEGYALTDGATVSAHLDKGG ERVLIPDARGPCCCVIPLLARVRPHRLN >gi568815588f:18559638_18775213|GENSCAN_predicted_CDS_7|447_bp aggcaaaaggatttagcagttttaacagtacagactgtgtcatcagtttgcctgattgga tcgaggctgtgccacatgccacggaaaagttacacaactgctcattgcctcagttttccg atctataaaatgggtgagcagcatcgcataatgaggagtgtgtcctataacttcaggctc ccccacagaaagagagagagccatcagtctcttcctttgatgttgtttggctatgtcccc acccatggtgaagtggcgtcattgtgtgcggtaaatacccgagcaaggcaaggtacttct atagaagggtacgcccttacagatggagcaacggtgagcgcacacttggacaagggaggg gaaagggttcttatccctgatgcacgtggcccctgctgctgtgtcattcccctattggct agagttagaccacacaggctaaactaa