GENSCAN 1.0 Date run: 7-Nov-116 Time: 14:28:30 Sequence gi568815595f:127823062_128028118 : 205057 bp : 42.36% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 8621 8746 126 0 0 10 80 158 0.069 6.93 1.02 Intr + 17859 17941 83 1 2 58 86 66 0.040 1.84 1.03 Intr + 28669 28781 113 0 2 93 67 45 0.112 1.16 1.04 Intr + 30877 31100 224 1 2 12 76 211 0.163 9.15 1.05 Intr + 32060 32198 139 0 1 24 92 68 0.616 -0.60 1.06 Intr + 33767 33864 98 2 2 37 119 64 0.685 3.13 1.07 Term + 34643 34818 176 0 2 73 38 109 0.535 1.34 1.08 PlyA + 35142 35147 6 1.05 2.03 PlyA - 36129 36124 6 1.05 2.02 Term - 43231 43089 143 2 2 15 41 131 0.673 -1.79 2.01 Init - 46383 46215 169 0 1 56 75 104 0.711 5.65 2.00 Prom - 47619 47580 40 -3.25 3.08 PlyA - 47675 47670 6 1.05 3.07 Term - 48409 48284 126 1 0 72 42 132 0.533 4.30 3.06 Intr - 65582 65486 97 1 1 91 103 71 0.135 7.99 3.05 Intr - 66720 66662 59 0 2 114 81 17 0.135 0.36 3.04 Intr - 70182 70045 138 0 0 84 98 85 0.110 8.84 3.03 Intr - 70521 70436 86 1 2 80 54 90 0.065 3.42 3.02 Intr - 79544 79490 55 2 1 71 87 33 0.049 -0.87 3.01 Init - 81075 80974 102 0 0 52 88 65 0.141 3.19 3.00 Prom - 87605 87566 40 -5.55 4.00 Prom + 90080 90119 40 -3.35 4.01 Init + 91315 91526 212 0 2 90 44 124 0.985 6.50 4.02 Term + 92095 92533 439 1 1 126 50 277 0.782 21.46 4.03 PlyA + 92930 92935 6 1.05 5.00 Prom + 96722 96761 40 -7.25 5.01 Init + 100001 101070 1070 1 2 72 58 560 0.884 45.25 5.02 Intr + 104703 104973 271 0 1 134 42 193 0.800 15.92 5.03 Intr + 107072 107222 151 1 1 55 86 167 0.674 12.01 5.04 Intr + 135331 135369 39 2 0 80 83 56 0.177 1.58 5.05 Intr + 140128 140325 198 2 0 107 80 181 0.919 17.60 5.06 Term + 151299 151453 155 1 2 105 48 118 0.595 6.60 5.07 PlyA + 151914 151919 6 1.05 6.00 Prom + 156299 156338 40 -6.05 6.01 Init + 156992 157047 56 1 2 64 75 35 0.142 0.61 6.02 Intr + 159023 159112 90 2 0 126 67 28 0.125 2.69 6.03 Intr + 167039 167324 286 2 1 73 67 190 0.607 11.82 6.04 Intr + 167447 167619 173 1 2 82 52 155 0.608 9.12 6.05 Intr + 167755 167852 98 1 2 68 34 54 0.360 -3.27 6.06 Intr + 168041 168601 561 0 0 107 69 200 0.072 11.86 6.07 Intr + 171256 171382 127 2 1 54 89 44 0.045 -0.08 6.08 Intr + 176650 176811 162 1 0 34 15 165 0.344 1.97 6.09 Intr + 178632 178766 135 0 0 105 80 107 0.630 10.36 6.10 Intr + 178889 178948 60 2 0 104 88 38 0.498 2.33 6.11 Term + 178998 179148 151 0 1 5 48 135 0.526 -2.40 6.12 PlyA + 179427 179432 6 1.05 7.08 PlyA - 179822 179817 6 -0.45 7.07 Term - 180205 180023 183 1 0 57 54 128 0.214 2.86 7.06 Intr - 183146 182942 205 1 1 65 11 207 0.963 8.98 7.05 Intr - 183873 183686 188 0 2 62 76 92 0.496 2.97 7.04 Intr - 184193 183918 276 2 0 28 60 222 0.866 10.19 7.03 Intr - 185737 185699 39 1 0 120 79 3 0.000 0.10 7.02 Intr - 199229 199086 144 2 0 101 30 62 0.290 1.26 7.01 Intr - 199967 199851 117 2 0 87 62 70 0.517 4.04 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 184929 184744 186 0 0 51 30 161 0.981 5.71 S.002 Term + 189934 190129 196 1 1 53 43 223 0.911 10.20 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:127823062_128028118|GENSCAN_predicted_peptide_1|319_aa XADGFEDLELKKKVRPETEMWVLAGNGPCSCGGECADPGEWAAHTGPEGQHILCGWMSMC PERPSPWEWKDSVENKTFSCISTFKKEKSKGPLLTQAQDCGWALDEARRGSTAAQSWKLE RRGSMEMTHLAAPGRPSPELAMEKDEAQPDLPHRIPGKPRTSRHSMSLAVEVTRMVTWRL QEGRWRQTPHKNAETEKEKIKNQNDRMHLCSFPIAAIKKELHILWLKTTLGTFLANLGRL GRGSQVDEGLELYDGSMGSATGPRESLVNGLCNLPEDPKSQSFPKPDQPSWYLLSKQHLI YLSLSVTSLEGNLAVSNKI >gi568815595f:127823062_128028118|GENSCAN_predicted_CDS_1|960_bp ngggcagatggatttgaggatctagagctgaagaagaaggtcagaccggagacagagatg tgggtgttggccggaaatggcccttgcagctgtgggggagagtgtgctgacccaggagag tgggcagctcacacaggtccagagggtcagcacattctttgtggttggatgtccatgtgt cctgaacgtccatctccgtgggagtggaaggacagtgtagaaaataaaacatttagttgc atatccacatttaaaaaagaaaaaagcaagggtcccttgctcactcaggcccaggactgt ggatgggccttggatgaagccaggagaggaagcacggcagcacagtcctggaagctggaa cgcagagggagcatggaaatgacacacctggcagccccaggaagaccaagccctgagctg gcaatggagaaagatgaagctcagccagacttaccccacagaatccccggaaagcccagg acttcaaggcactcgatgtctctggcagtagaggtgacaaggatggtgacctggaggctg caggaaggacgctggagacagacgccccacaagaatgcggagacagagaaagagaagatc aagaaccagaatgacagaatgcatttgtgttcatttcccattgctgctattaaaaaagaa ctacacattttgtggcttaaaacaacgttaggaactttcttagcaaacctagggcgtttg ggtagaggatcccaagtggatgaaggcctagagttatatgatggctccatggggtcagca actggtccacgagagagccttgttaatggactctgcaaccttccagaggaccctaagtct cagagctttcccaagcctgaccagccatcctggtacctgctgtctaagcagcacctcata tacctcagcttatctgtcacctctctggagggcaatttggcagtgtctaacaaaatttaa >gi568815595f:127823062_128028118|GENSCAN_predicted_peptide_2|103_aa MCDKLAADRLRHRGQLGHCFSGSGTSQGAPEPRQWQSVGSNRTALMDKEERRWSSAGEWV DSVNNAALGRKKESFANGKSHVSPPGRRRGQKDFLGFFAMWSE >gi568815595f:127823062_128028118|GENSCAN_predicted_CDS_2|312_bp atgtgcgataaattggcagctgacagactgaggcacagaggtcagttaggacattgtttc agtggttcaggaacaagccaaggggcacctgagcccaggcagtggcagtcagttggaagc aacaggacagccttgatggacaaagaagaaagacgttggagtagtgcaggagaatgggtg gacagtgtgaataatgcagctctggggagaaagaaggaaagctttgcgaatggaaaatct catgtatcaccaccaggtcgtagaagaggccagaaggactttctaggattctttgctatg tggagcgaatga >gi568815595f:127823062_128028118|GENSCAN_predicted_peptide_3|220_aa MKYSTAEEQQTETTWLQIPSEEEVVLKRVRLCKEEDVSLRMENGDLQDGTALETKSSSEE ETTLDRGKQTEERAAPELSKKRGLVDEGLAPPDGGWGPLKKGSAPLQFLVPWKLYLEVPG FGAICWKGAVTIKRLPTTGVVTLGPGRSVGQLGCLRSSCLLSSLDLARNALMGMAEAKMH GRLSTAGFPTWHLRPSATGDLRTSASALQYPMDRKRKGNL >gi568815595f:127823062_128028118|GENSCAN_predicted_CDS_3|663_bp atgaaatattctacagctgaagaacaacagacagagacaacatggttgcagatacccagc gaggaggaagtggtgttgaagagggtcagattatgtaaagaggaagatgtgagtctcagg atggaaaatggtgacctccaggatggcacagctcttgagactaaaagctcctctgaagaa gagacaacattagacagggggaaacaaacagaggagcgagcagcccctgaattgagtaaa aagcggggcctggtagacgagggccttgcccctccagatggaggctggggtcctttaaag aagggttcggcccctttacagtttctggttccctggaaactctatctagaagtacctggg tttggagccatctgttggaagggtgctgtaaccataaaaaggttaccaaccactggggta gtcactttagggccaggaaggtccgtgggtcagctggggtgtctccgctcctcgtgtctc ttatcctccttggacctagccaggaatgctctcatgggaatggcagaggcaaagatgcat ggccgcctcagcacagccggattcccgacatggcacctgaggccttcagccacgggtgac ctgcgaacctctgcgtctgctcttcagtatcccatggatcggaagaggaaaggaaacttg tga >gi568815595f:127823062_128028118|GENSCAN_predicted_peptide_4|216_aa METTSPLKGTNCRHNACEKDSALFSGFEAGGRSQEQKNEDSFQKLQKGRDRFTPGSPRNK HSPADTLISIQPSVALARGPAREHSAQAHSVPRGTLSRVRDARGETLCSDTGAVVAVWNQ RGNRCAGAQWLEGSGGVVSGHFQAEGSTCPLGGAASALPPPPQGAGPLFISPSQHRPILG SGLSQARWPPCHTVASKPGRGPASGDAPRTRQHLRK >gi568815595f:127823062_128028118|GENSCAN_predicted_CDS_4|651_bp atggagaccacaagcccactgaaaggaaccaactgcagacacaatgcatgtgagaaagac tcagccctattttctggctttgaagctggaggaaggagccaggagcaaaagaatgaagac agtttccagaagctgcaaaagggcagggacagattcactccaggaagccccagaaacaaa cacagtcctgctgacaccttgatttcaatccagccttcagtcgctctggcacgcgggcca gcgcgggaacactctgcgcaggcgcacagcgttccgcgggggacgctcagtagggtacgg gacgcgcgcggggagaccctgtgcagcgacaccggggctgtagtcgccgtctggaaccag cgcgggaataggtgcgcaggcgcacagtggctggagggcagcggcggcgtggtctcgggc cacttccaagcagagggcagcacctgtcccctcggaggcgctgccagcgccctccctcct cctccgcagggcgcgggacctctatttatatcgcccagccagcacaggcccatattgggc agtggcctcagtcaggcccgctggccgccctgccacacggtagcgtccaagccaggtcgt ggcccagcgagtggggacgcgccccgaaccaggcagcaccttcgtaagtag >gi568815595f:127823062_128028118|GENSCAN_predicted_peptide_5|627_aa MECKIEGKEKYQHSLNLLNKIQNMKELAEMIDVVLTAEGEKFPCHRLVLAAFSPYFKAMF TCGLLECNQREVILYDITAESVSVLLNYMYNAALEINNANVQTVAMAAYFMQMEEVFSVC QKYMMDHMDASNCLGIYYFAKQIGAEDLSDRSKKYLYQHFAEVSLHEEILEIEVHQFLTL IKSDDLNISREESILDLVLRWVNHNKELRTVHLVELLKQVRLELVNPSFLRQALRRNTML LCDADCVDIIQNAFKAIKTPQQHSLNLRYGMETTSLLLCIGNNSSGIRSRHRSYGDASFC YDPVSRKTYFISSPKYGEGLGTVCTGVVMENNTIIVAGEASASKLSRQKNKNVEIYRYHD RGNQFWEKLCTAEFRELYALGSIHNDLYVIGGQMKIKNQYLITNCVDKYSVERDNWKRVS PLPLQLACHAVVTVNNKLYVIGGWTPQMDLPDEEPDRLSNKLLQYDPSQDQWSVRAPMKY SKYRFSTAVVNSEIYVLGFRPAGTPAGQTGGGIGCVGQDKGQVRKCLDVVEIYNPDGDFW REGPPMPSPLLSLRTNSTNAGAVDGKLYVCGGFHGAGASMIRKEGSTAISFILYRHSSLR ALLSLDARAADCEQVSVWAIMKGMSFN >gi568815595f:127823062_128028118|GENSCAN_predicted_CDS_5|1884_bp atggagtgcaagattgagggaaaagaaaaataccaacatagcttgaatttactgaataaa attcagaacatgaaagaattagcagaaatgattgatgtggtactcacagcagaaggagag aaatttccttgccacagactggtcctggctgcatttagcccttatttcaaagctatgttc acctgtggactacttgaatgtaatcaaagggaagtcatactttatgacatcacagcagaa agtgtgtcggtgttattaaattacatgtacaatgcagctttggagatcaataatgccaat gtacagactgtagctatggctgcctattttatgcagatggaagaagtcttcagtgtgtgt caaaaatatatgatggaccacatggatgcctccaactgtttaggtatctattattttgca aagcagattggagctgaagatttatctgatcgatcaaagaaatatttatatcagcacttt gccgaggtgagcttacatgaagaaatactagaaatcgaagtgcaccaatttttgacactt attaaatcagatgatcttaacatatccagagaagagagcattctggacttagttctgaga tgggtaaatcataacaaagaattgcgtacagtgcatcttgttgagcttttgaagcaagtc agattggaacttgtaaatccttcttttttaagacaagccctaagaaggaacacaatgctt ctgtgtgatgcagattgtgttgacataattcaaaatgcattcaaagccatcaagacaccc caacagcactctctaaatctgcgctatggtatggagactaccagtcttctgctttgcatt ggcaacaattcttcaggaatcagatcaagacataggagctatggggatgccagtttttgt tatgatcctgtatcacggaaaacctatttcatctcatctcccaagtacggagagggttta ggaactgtgtgtactggtgttgtcatggaaaataatactataattgtggctggagaagca agtgcctctaaactctctagacaaaagaacaagaatgttgaaatttataggtatcatgat agaggaaaccagttttgggaaaagttatgcacagctgaatttcgagaactctatgctctg ggcagtattcataatgacctttatgttataggaggacagatgaaaattaaaaaccagtat cttattacaaactgtgttgataagtactctgtagaacgggacaattggaaaagggtgtct ccccttccactgcaattggcatgtcatgctgtagtgacagtgaataataaactttatgta attggaggctggacccctcagatggatcttcctgatgaagaacctgatcgattaagcaac aaactgttgcagtatgaccccagccaagatcaatggagtgtgcgggcacccatgaagtac tctaagtaccgattcagtacagctgtagtcaacagtgagatttatgttttgggctttaga cctgctgggacccctgcaggacagactggtggtggcattggctgtgtaggtcaagacaag ggccaggttcgaaaatgccttgacgtggtggagatctacaacccagatggggacttttgg cgagagggccctcccatgccaagtcccctcctctcactccgcaccaattccaccaatgca ggggcagtggatgggaaactctatgtctgcgggggattccatggagcaggggcctccatg atacggaaagagggctccacagccatcagtttcatcctgtaccgtcactccagcctgagg gcattgctctccctagatgccagagcagcagattgtgaacaggtttctgtatgggccatc atgaagggaatgtccttcaactga >gi568815595f:127823062_128028118|GENSCAN_predicted_peptide_6|632_aa MEKFELHQKSSKKRYSEEMLASIDSIKGSFPLWLLDGSVSRRKEQEIIREEPREGVDSWH KEENRRRPASLGSHTGPAALRHSGAHDPPPAVKQQRNSWSETATKASSSVARRGWRQRED PQLACSLASALFQPRACRAPSILKLCRACAGADQDSRNKTGGHPTMEDGLGEDSHGDDLK APVTHSAFSESFLAPAPHWAGRTHDRKFRGLKQIHVLAFPYFRSPKWISVSQNQGPWSYT GPVWKTQDALSDQQPQPPSCVMPPTWEVLGSADLLGEPLFCLPQPPNPHKVRSSQMCMCG VYRSSHTRQGWKPQRGGKEEHIHRCLVLRTQCPGTSMTSRGVLRGSGRFFSPWRSWRKHA GLGLAAALFQPQVKLTPWKENSKARTQATDHITVPLNEARPEACCTSGTPRFQANTSHSL VVSSTGTPWLSNMVGLTATPWLFNLVSSTGSLWPSNMADTQVQDVGGRSPSTARTGTGES DELEFARQTTEGETQNGQTWSCPFSTGVRLGLLLNAKLESHWLKATPSQTAQVTEQAGNV KIWPFQLSTEHADGRPCSGVPNREASLRQLAKAHPNEMAENARQGDNETASARKSRGSTN LYPATKEDKIETFPNKPKLEILPAADLQYGKC >gi568815595f:127823062_128028118|GENSCAN_predicted_CDS_6|1899_bp atggagaaatttgaactgcaccagaaatcttctaaaaagaggtactcagaagaaatgctg gcctctatagacagcattaaaggttcctttcctctctggcttctggatggttcagttagc aggaggaaagagcaggaaattatcagagaggagcccagggagggagtggacagctggcac aaagaggagaaccgacgcaggccggcgtctcttggaagccacactggccctgcagcactg agacactcaggagcccatgatcctccaccagccgtgaagcagcagagaaactcatggtcc gaaaccgcaaccaaagcctccagttccgtggccagacgtgggtggaggcaaagggaagac ccacagctggcctgcagcctcgcgtctgccctcttccagccgcgggcctgcagggcgccc agcatcctcaagctctgcagagcatgtgctggtgctgatcaggactcccggaacaagaca ggaggccaccccaccatggaggatggtctgggggaggacagtcatggcgatgacctcaag gcaccggtcacacactcagcattttccgagtcctttttggcaccagcaccgcactgggca gggagaacacatgaccgcaaatttagaggcttaaaacagattcacgtacttgcatttccg tacttcagaagcccaaaatggatctcagtgagtcaaaatcaaggaccctggagttacact gggcccgtctggaaaacccaggatgctctctctgaccagcagccccagcccccttcttgt gtgatgccacccacttgggaggttctgggttctgcggaccttttaggggaaccattattt tgcctaccacaacccccaaatccccataaagtacggtcatcacagatgtgcatgtgcggc gtttataggagttcacatacacgccaaggttggaaaccccagcgaggagggaaagaggag cacatacacagatgcctggtccttagaacacagtgccctgggacctccatgaccagcagg ggcgtgctgagaggttctggaagattcttttctccgtggaggagctggaggaagcacgca ggcctggggctggcggcagccctcttccagccgcaggtgaagctcacaccgtggaaggaa aacagcaaagcaagaactcaggccactgaccacatcactgtgcccctgaatgaagccagg cccgaagcctgctgtacctctggcactcccaggtttcaagccaacacatcacattccctt gtggtgagctcaactggaaccccctggctttccaacatggtgggcttaactgctacccca tggctctttaacctggtgagctcaactgggtccctgtggccctccaatatggcagataca caggtccaagatgtaggtggtagaagcccaagcacagcaaggactggcaccggggaaagt gatgagctggagtttgcaagacagaccacagaaggggagacacagaatgggcagacatgg agctgcccattcagcacaggggtccgcctgggtctgttgctgaacgctaagctagagagc cactggctaaaggccacaccttcccagacagcccaggtgactgagcaggcagggaatgtg aagatctggccatttcagctcagcacagaacatgctgatggaaggccttgctctggagtc cccaacagggaagccagcctgagacaactagcaaaggcacatcctaatgaaatggctgag aatgcgagacaaggagacaatgaaacagcatctgcaagaaaaagccggggatctacaaat ctatatcctgcaaccaaagaagacaaaatagagacttttccaaacaaaccaaagctagag attttgccagcggcagacctgcaatacgggaaatgttga >gi568815595f:127823062_128028118|GENSCAN_predicted_peptide_7|383_aa CWDYRHEHLAPVSDLTCKDFKAAIVNMVKELKDTMKQRKVAGSHLTYSCCLQSWTEAGTW HGGARQVETPDSVAWLGVALSRQPGVERRRLRHRQNQGHTEQPEIATGHSGRQSLCGQGV DPATEKNTLGGGRGPAALNPERMGGEGPREGGKTWERCRNWYWDVWGKKVLEEEQLSRTW RRRTNQRNPESGGPRPLGAGALGPAAPSKIPSGCSAGNQGRWEASAMVQATNKGGRGLAL NGLCGCSTWRPLQGRVKPGAKGPGRRSLWSQHLALAAGSNLGGLLLGGSPGGAPPLSAPG NHSDSVGPLPLNPIPTARGQKAQHMVYFDDGSMCAENGHSAIVKREMVKLADRVRVFCVL IGRGVWESCMRIVDLSVFPFNSA >gi568815595f:127823062_128028118|GENSCAN_predicted_CDS_7|1152_bp tgctgggattacaggcatgagcacctggccccagtgtcagatttaacttgcaaagatttc aaagcagccattgtaaatatggtcaaagaactaaaggacaccatgaagcaaaggaaggtg gctggctcccatctcacctacagctgctgcctgcagagctggactgaggctggcacctgg catgggggagccaggcaagtcgaaacaccggacagtgtggcctggttgggagtggctctg agcagacagcctggggtggagaggaggagactgaggcacaggcagaaccaaggtcacaca gaacagcctgaaatagctaccgggcactctggaaggcagtcactgtgtgggcagggggtg gaccctgctacggagaagaatacactagggggagggagaggacctgcagcgttgaatcct gagcgcatgggtggggaagggccccgagaagggggaaagacttgggagaggtgcaggaac tggtattgggatgtctgggggaagaaggttctagaggaagaacagctaagccgcacatgg cggcggaggaccaaccagaggaatccggagagtgggggacctcggcctctgggagccgga gccctgggacctgctgcaccgtccaagattccctctggctgcagtgcgggaaaccaggga aggtgggaggcctctgccatggtccaggcaacaaacaaaggtggccggggcctggctctg aacgggctttgcggatgctccacctggcgtcctcttcaaggaagggtaaagcccggtgca aaaggccccggtcgccggtccctctggtcccagcatttggcccttgcagcaggcagcaat ttgggtgggctgcttcttggagggtcgccagggggtgcccctcccctcagtgcgcctggg aatcacagtgacagcgttggtcctctgcctctcaaccccatccctactgccagagggcag aaagcccagcatatggtttatttcgatgatggctctatgtgcgctgagaatggacattct gccattgttaagcgtgagatggttaaattggctgaccgtgttcgagtcttctgtgtcctt attggaagaggagtgtgggaatcatgcatgagaattgtggatttgtctgtttttcctttc aattctgcgtga