GENSCAN 1.0 Date run: 4-Nov-116 Time: 01:50:12 Sequence gi568815597f:7684878_7942752 : 257875 bp : 44.01% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 66 252 187 0 1 59 54 125 0.433 5.89 1.02 Intr + 2807 2929 123 1 0 47 99 36 0.588 1.38 1.03 Intr + 5424 5509 86 2 2 50 65 96 0.810 2.12 1.04 Term + 5722 5815 94 1 1 51 41 113 0.772 0.20 1.05 PlyA + 6922 6927 6 1.05 2.08 PlyA - 7359 7354 6 1.05 2.07 Term - 13954 13779 176 2 2 19 41 145 0.455 0.82 2.06 Intr - 16397 16349 49 2 1 69 90 33 0.431 -0.15 2.05 Intr - 18353 18234 120 2 0 78 79 56 0.826 4.39 2.04 Intr - 19536 19230 307 2 1 24 -9 183 0.654 -1.35 2.03 Intr - 19899 19800 100 1 1 62 89 69 0.321 3.57 2.02 Intr - 29082 28912 171 1 0 74 68 86 0.854 5.11 2.01 Init - 30111 30054 58 0 1 81 73 13 0.313 0.57 2.00 Prom - 41627 41588 40 -3.36 3.00 Prom + 43694 43733 40 -8.46 3.01 Init + 44976 45027 52 2 1 38 91 24 0.578 -0.93 3.02 Intr + 47571 47722 152 1 2 109 96 126 0.927 15.38 3.03 Intr + 51467 51663 197 1 2 57 91 159 0.974 11.31 3.04 Intr + 52054 52132 79 1 1 58 95 112 0.983 8.45 3.05 Intr + 52378 52693 316 0 1 95 110 149 0.957 13.44 3.06 Intr + 53082 53605 524 1 2 92 98 225 0.603 16.57 3.07 Intr + 59958 60145 188 2 2 83 117 98 0.443 10.69 3.08 Intr + 60968 61214 247 2 1 103 92 87 0.660 8.06 3.09 Intr + 62833 62904 72 0 0 77 97 41 0.910 3.50 3.10 Intr + 66322 66515 194 0 2 107 94 158 0.940 16.59 3.11 Intr + 86394 86645 252 0 0 77 35 146 0.130 4.75 3.12 Intr + 88565 88634 70 2 1 94 57 74 0.183 4.08 3.13 Intr + 92283 92441 159 2 0 35 86 122 0.199 6.88 3.14 Intr + 92494 92537 44 0 2 105 90 45 0.256 3.44 3.15 Intr + 99317 99499 183 2 0 52 109 53 0.085 2.70 3.16 Intr + 100564 100709 146 1 2 85 92 136 0.983 13.63 3.17 Intr + 101844 101959 116 1 2 83 61 70 0.732 3.97 3.18 Intr + 103168 103369 202 0 1 57 77 76 0.497 2.26 3.19 Intr + 109080 109131 52 1 1 75 131 -3 0.276 0.67 3.20 Intr + 113648 113796 149 2 2 70 115 39 0.876 4.68 3.21 Intr + 116236 116314 79 2 1 93 106 -7 0.576 0.21 3.22 Intr + 118170 118276 107 0 2 70 77 35 0.472 0.46 3.23 Intr + 118815 118971 157 1 1 35 85 75 0.719 0.97 3.24 Intr + 124016 124121 106 2 1 81 110 73 0.898 9.02 3.25 Intr + 125016 125144 129 2 0 99 57 223 0.996 21.19 3.26 Intr + 125561 125711 151 1 1 60 89 149 0.486 11.94 3.27 Intr + 134408 134570 163 0 1 125 103 138 0.914 18.03 3.28 Intr + 135238 135362 125 1 2 53 87 89 0.999 5.53 3.29 Intr + 135590 135763 174 0 0 54 81 95 0.629 5.31 3.30 Intr + 141603 141833 231 1 0 80 102 167 0.988 15.04 3.31 Intr + 142241 142446 206 0 2 112 4 162 0.502 9.12 3.32 Intr + 145034 145284 251 1 2 22 72 220 0.381 10.04 3.33 Intr + 152122 152272 151 1 1 107 82 181 0.924 19.56 3.34 Term + 152532 152543 12 2 0 64 46 -7 0.215 -9.10 3.35 PlyA + 153022 153027 6 1.05 4.06 PlyA - 153320 153315 6 1.05 4.05 Term - 155290 155006 285 1 0 -27 43 298 0.912 9.20 4.04 Intr - 158860 158743 118 0 1 101 91 -3 0.603 1.77 4.03 Intr - 161630 161507 124 0 1 50 39 96 0.539 0.54 4.02 Intr - 166045 165935 111 2 0 54 98 60 0.528 3.95 4.01 Init - 172441 172417 25 1 1 93 96 11 0.576 2.09 4.00 Prom - 184622 184583 40 -1.46 5.04 PlyA - 185911 185906 6 1.05 5.03 Term - 220956 220892 65 1 2 87 48 62 0.209 0.15 5.02 Intr - 248419 248285 135 2 0 116 74 32 0.379 5.14 5.01 Init - 253361 253316 46 2 1 55 102 29 0.105 1.84 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:7684878_7942752|GENSCAN_predicted_peptide_1|163_aa XCCEKRSEEAAQVRLRGLRGVGEFTGTVPRSQVACFEEFAGTVPWSQVACFEEFAGTVLR SQVVRGPQEGSFCVPGLVEMSHICSESQRQIPSSWLPAGEQGLQERGSSDLALALCSLAL MQKDQFWHQQIGSKTPSGTELPPGGTAADICTCTTCQDLAMKY >gi568815597f:7684878_7942752|GENSCAN_predicted_CDS_1|492_bp nngtgttgtgagaagcgcagtgaagaagccgcgcaggttaggctccggggactccgcggt gtcggggagttcacaggcaccgtcccgcggtcacaggtggcatgttttgaggagttcgca ggcaccgtcccgtggtcacaggtggcgtgttttgaggagttcgcaggcaccgtcctgcgg tcacaggtggtccgtgggccccaggagggcagcttctgtgtccctgggctcgtggagatg tcccacatctgttctgagtcccagagacagattccttcttcatggctccccgctggggag caggggctccaggagagagggagcagtgacctggctctagccctctgctccctggccctc atgcagaaggaccagttctggcatcagcagatcgggagcaaaacaccttctggcacagag ttgccaccaggagggacagcagctgacatctgcacatgcaccacctgccaggacctggcc atgaagtactga >gi568815597f:7684878_7942752|GENSCAN_predicted_peptide_2|326_aa MVKCESQMGLNVANKAKGGMWTPEWLRCSPGNIPYADMKYTDKRNLRETETSLVRKEPLT TILALSVQGLFDNYKGSPLAAAGTTRGAPGTAVSPPLPECAARTNGGARRAAREWGGDGS LEAGTARSPTAAPAARADGPRSPARASRSPSRGAAPRPPCLAPRAVGAARSSAPRARVPG LGLPARPRRALRRRRLQCSRREEPALRNERGGGDSYAFIKVQLKSGHPQWLFLTLSAEPN LQLCYVLVLQYTALTLLLSLLQAGYSIEGQKMSEDRQFDSASEGREKARGESVKSLLAMM MSLTAGDPGENSIQNLGFSLQITMVG >gi568815597f:7684878_7942752|GENSCAN_predicted_CDS_2|981_bp atggtgaaatgtgagtcccaaatggggctgaacgtagcaaacaaagcaaaagggggcatg tggacccccgagtggctgcggtgctcaccgggcaatatcccctatgctgatatgaaatat acagataagagaaatttaagagaaactgaaacttcccttgttagaaaagagcccttaaca acaattcttgcgctttctgttcaggggctctttgacaattataaaggctctccactcgcc gcagctggaacgactcgcggggctccgggtaccgcggtctccccgccactgcccgagtgc gccgcccggaccaatggcggggcgcggcgcgcggcccgcgaatggggcggcgacggttcc ctcgaggccgggacggctcggagcccgacggctgcgcccgctgcccgggccgacggcccc cggagccccgcgcgggcctcccgttccccttcccgcggggccgccccgcgcccgccctgc ctggctccccgcgccgtcggggccgcccgaagctccgcgccgagggcccgcgttcccggg ctcgggctccctgcgcggccccggcgcgccctgcggcggcggcgtctgcaatgcagccgg cgggaggagccggcgctgcgcaacgagcggggaggcggagactcctatgcattcatcaag gtccagctcaaatctggccaccctcagtggcttttcctgacgctctcagcagaaccaaac ctccagctttgttatgtgttggtgttgcagtacacagcactcacgctgctgctaagtctc ctgcaagcaggatattccatagaggggcagaagatgtcagaagacaggcagtttgacagt gcttctgaaggtcgtgaaaaagccaggggtgagtctgtaaagtccctgctggccatgatg atgtccctgacagctggggatcctggcgagaacagcattcagaacttgggattttccttg cagatcacaatggttggctga >gi568815597f:7684878_7942752|GENSCAN_predicted_peptide_3|1811_aa MEPSYALLASLGIYSQDDNQFRMSILERLEQMERRMAEMTGSQQHKQASGGGSSGGGSGS GNGGSQAQCASGTGALGSCFESRVVVVCEKMMSRACWAKSKHLIHSKTFRGMTLLHLAAA QGYATLIQTLIKWRTKHADSIDLELEVDPLNVDHFSCTPLMWACALGHLEAAVVLYKWDR RAISIPDSLGRLPLGIARSRGHVKLAECLEHLQRDEQAQLGQNPRIHCPASEEPSTESWM AQWHSEAISSPEIPKGVTVIASTNPELRRPRSEPSNYYSSESHKDYPAPKKHKLNPEYFQ TRQEKLLPTALSLEEPNIRKQSPSSKQSVPETLSPSEGVRDFSRELSPPTPETAAFQASG SQPVGKWNSKDLYIGVSTVQVTGNPKGTSVGKEAAPSQVRPREPMSVLMMANREVVNTEL GSYRDSAENEECGQPMDDIQVNMMTLAEHIIEATPDRIKQENFVPMESSGLERTDPATIS STMSWLASYLADADCLPSAAQIRSAYNEPLTPSSNTSLSPVGSPVSEIAFEKPNLPSAAD WSEFLSASTSEKVENEFAQLTLSDHEQRELYEAARLVQTAFRKYKGRPLREQQEVAAAVI QRCYRKYKQYALYKKMTQAAILIQSKFRSYYEQKKFQQSRRAAVLIQKYYRSYKKCGKRR QARRTAVIVQQKLSDVFAPRRAVPPISLASGPNFASLLTLSRRRCRRRSCQNVSDATRRG RLRAPQAAARAGTAILPSCRRPPLGRRPGLRGDPEAWGSTGPTAATGSNRRLQQTQNQVD EVVDIMRVNVDKVLERDQKLSELDDRADALQAGASQFETSAAKLKRKYWWKNCKRINRLQ IMDFHDSGRANGGLRSRPRRPIARPGGGTGHAAAGGAAPAADRHAASLETARGRPRGRAA VRGAKGGGSSEQQDRNRVSEELIMVVQEMKKYFPSERRNKPSTLDALNYALRCVHSVQAN SEFFQILSQNGAPQADVSMYSLEELATIASEHTSKNTDTFVAVFSFLSGRLVHISEQAAL ILNRKKDVLASSHFVDLLAPQDMRVFYAHTARAQLPFWNNWTQRAAARYECAPVKPFFCR IRGGEDRKQEKCHSPFRIIPYLIHVHHPAQPELESEPCCLTVVEKIHSGYEAPRIPVNKR IFTTTHTPGCVFLEVDEKAVPLLGYLPQDLIGTSILSYLHPEDRSLMVAIHQKVLKYAGH PPFEHSPIRFCTQNGDYIILDSSWSSFVNPWSRKISFIIGRHKVRTSPLNEDVFATKIKK MNDNDKDITELQEQIYKLLLQPVHVSVSSGYGSLGSSGSQEQLVSIASSSEASGHRVEET KAEQMTLQQVYASVNKIKNLGQQLYIESMTKSSFKPVTGTRTEPNGGGESANGGGECKTF TSFHQTLKNNSVYTEPCEDLRNDEHSPSYQQINCIDSVIRYETASLDTIYLKSYNIPALK RKCISCTNTTSSSSEEDKQNHKADDVQALQAGLQIPAIPKSEMPTNGRSIDTGGGAPQIL STAMLSLGSGISQCGYSSTIVHVPPPETARDATLFCEPWTLNMQPAPLTSEEFKHVGLTA AVLSAHTQKEEQNYVDKFREKILSSPYSSYLQQESRSKAKYSYFQGDSTSKQTRSAGCRK GKHKRKKLPEPPDSSSSNTGSGPRRGAHQNAQPCCPSAASSPHTSSPTFPPAAMGESIPS YCQRSVHRIASHEESIPSYCQRSVHRIASHEESIPSYCQHTVHGIASQQDSIPSYCHCSV HGVTSQRIPIQNWFSSIRVKEVVLKEDLEKLESMRQQQPQFSHGQKEELAKVYNWIQSQT VTQEIDIQLEC >gi568815597f:7684878_7942752|GENSCAN_predicted_CDS_3|5436_bp atggaaccgtcgtacgctttgctcgctagtcttggaatttatagtcaggatgataaccag ttcaggatgtccatcctggaacgactggagcagatggagaggaggatggccgagatgacg gggtcccagcagcacaaacaggcgagcggaggcggcagcagtggaggcggcagcgggagc gggaatggagggagccaggcacagtgtgcttctgggactggggccttggggagctgcttt gagagccgtgtggtcgtggtatgcgagaagatgatgagccgagcctgctgggcgaagtcc aagcacttgatccactcaaagactttccgcggaatgaccctactccacctggccgctgcc cagggctatgccaccctaatccagaccctcatcaaatggcgtacaaagcacgcggatagc attgacctggaactggaagttgaccccttgaatgtggaccacttctcctgtactcctctg atgtgggcgtgtgccctagggcacttggaagctgccgtcgtgctgtacaagtgggaccgt cgggccatctcgattcccgactctctaggaaggctgcctttgggaattgccaggtcacgg ggtcatgtgaaattagcagagtgtctggagcacctgcagagagatgagcaggctcagctg ggacagaaccccagaatccactgtcctgcaagcgaagagcccagcacagagagctggatg gcccagtggcacagcgaagccatcagctctccagaaatacccaagggagtcactgttatt gcaagcaccaacccagagctgagaagacctcgttctgaaccctctaattactacagcagt gagagccacaaagattatccggctcccaaaaagcataaattgaaccctgagtacttccag acaaggcaggagaagctgcttcccactgcactgagtctggaagagccaaatatcaggaag caaagccctagttctaagcagtctgtccccgagacactcagccccagtgaaggagtgagg gacttcagccgggaactctcccctcccactccagagactgcagcatttcaagcctctgga tctcagcctgtaggaaagtggaattccaaagatctttacattggtgtgtctacagtacag gtgactggaaatccgaaggggaccagtgtaggaaaggaggcagcaccttcacaggtgcgt ccacgggaaccaatgagtgtcctgatgatggctaacagagaggtggtgaatacagagctg gggtcctaccgtgatagtgcagaaaatgaagaatgcggccagcccatggatgacatacag gtgaacatgatgaccttggcagaacacattattgaagccacacctgaccgaatcaagcag gagaattttgtgcccatggagtcctcaggattggaaagaacagaccctgccaccattagc agtacaatgagctggctggccagttatctagcggatgctgactgccttcccagtgctgcc cagatccgaagtgcatataacgagcctctaaccccttcttctaataccagcttgagccct gttggctctcccgtcagtgaaatcgctttcgagaaacctaaccttccctccgccgcggat tggtcagaattcctgagtgcatctaccagtgagaaggtagagaatgagtttgctcagctc actctgtctgatcatgaacagagagaactctatgaggctgccaggcttgtccagacagct ttccggaaatacaagggccgacccttgcgggaacagcaagaagtagctgctgctgttatt cagcgttgttacagaaaatataaacagtacgcactttataaaaagatgacacaggctgcc atccttatccagagcaaattccgaagttactatgaacaaaaaaaattccagcagagccga cgggctgctgtgctcatccaaaagtactaccgaagttataagaaatgtggcaaaagacgg caggctcgccggacggctgtgattgtacaacagaaactcagtgacgtctttgccccgcgc cgcgccgtcccacccatctccctggcctccggtcccaacttcgcttctctgctgaccctc tctcgtcgccgctgccgccgccgcagctgccaaaatgtgagtgacgctacgcgacgcggg cggcttcgggccccgcaggcggcagctcgcgctgggaccgccatactgcccagttgccgc cggccgccactcggacgcaggccggggctgcgcggggatcccgaggcctgggggtctaca ggtccaactgctgccactggcagtaatcgaagacttcagcagacacaaaatcaagtagat gaggtggtggacataatgcgagttaacgtggacaaggttctggaaagagaccagaagctc tctgagttagacgaccgtgcagacgcactgcaggcaggcgcttctcaatttgaaacgagc gcagccaagttgaagaggaaatattggtggaagaattgcaagagaattaacaggctacag ataatggactttcacgactctgggagggccaatggaggcctgcggagtcggccccgacga ccaatcgcacggcccgggggcgggaccggtcacgcggcggcagggggcgcggcgccggct gctgaccggcacgcggcgagcctcgagactgcgcgagggcggccccgggggcgagcggct gtgcgcggggccaagggcgggggcagcagtgaacagcaagatcgaaacagagtttctgaa gaacttatcatggttgtccaagaaatgaaaaaatacttcccctcggagagacgcaataaa ccaagcactctagatgccctcaactatgctctccgctgtgtccacagcgttcaagcaaac agtgagtttttccagattctcagtcagaatggagcacctcaggcagatgtgagcatgtac agtcttgaggagctggccactatcgcttcagaacacacttccaaaaacacagataccttt gtggcagtattttcatttctgtctggaaggttagtgcacatttctgaacaggctgctttg atcctgaatcgtaagaaagatgtcctggcgtcttctcactttgttgacctgcttgcacct caagacatgagggtattctacgcgcacactgccagagctcagcttcctttctggaacaac tggacccaaagagcagctgcacggtatgaatgtgctccggtgaaaccttttttctgcagg atccgtggaggtgaagacagaaagcaagagaagtgtcactccccattccggatcatcccc tatctgattcatgtacatcaccctgcccagccagaattggaatcggaaccttgctgtctc actgtggttgaaaagattcactctggttatgaagctcctcggatcccagtgaataaaaga atcttcaccaccacacacaccccagggtgtgtttttcttgaagtagatgaaaaagcagtg cctttgctgggttacctacctcaggacctgattggaacatcgatcctaagctacctgcac cctgaagatcgttctctgatggttgccatacaccaaaaagttttgaagtatgcagggcat cctccctttgaacattctcccattcgattttgtactcaaaacggagactacatcatactg gattccagttggtccagctttgtgaatccctggagccggaagatttctttcatcattggt cggcataaagttcgaacgagcccactaaatgaggatgtttttgctaccaaaattaaaaag atgaacgataatgacaaagacataacagaattacaagaacaaatttacaaacttctctta cagccagttcacgtgagcgtgtccagcggctacgggagcctggggagcagcgggtcgcag gagcagcttgtcagcatcgcctcctccagtgaggccagtgggcaccgtgtggaggagacg aaggcggagcagatgaccttgcagcaggtctatgccagtgtgaacaaaattaaaaatctg ggtcagcagctctacattgagtcaatgaccaaatcatcattcaagccagtgacggggaca cgcacagaaccgaatggtggtggtgagtcagcgaatggtggtggtgaatgtaagaccttt acttccttccaccaaacactgaaaaacaatagtgtgtacactgagccctgtgaggatttg aggaacgatgagcacagcccatcctatcaacagatcaactgtatcgacagtgtcatcagg tatgagaccgcaagtttggataccatatacctgaagagctacaacattccagctttgaaa agaaagtgtatctcctgtacaaatacaacttcttcctcctcagaagaagacaaacagaac cacaaggcagatgatgtccaagccttacaagctggtttgcaaatcccagccatacctaaa tcagaaatgccaacaaatggacggtccatagacacaggaggaggagctccacagatcctg tccacggcgatgctgagcttggggtcgggcataagccaatgcggttacagcagcaccatt gtccatgtcccacccccagagacagccagggatgctaccctcttctgtgagccctggacc ctgaacatgcagccagcccctttgacctcggaagaatttaaacacgtggggctcacagcg gctgttctgtcagcgcacacccagaaggaagagcagaattatgttgataaattccgagaa aagatcctgtcatcaccctacagctcctatcttcagcaagaaagcaggagcaaagctaaa tattcatattttcaaggagattctacttccaagcagacgcggtcggccggctgcaggaaa gggaagcacaagcggaagaagctgccggagccgccagacagcagcagctcgaacaccggc tctggtccccgcaggggagcgcatcagaacgcacagccctgctgcccctccgcggcctcc tctccgcacacctcgagcccgaccttcccacctgccgccatgggagaatccatcccatcc tactgccagcgctctgtccacaggatcgcctcccatgaagaatccatcccatcctactgc cagcgctctgtccacaggatcgcctcccatgaagaatccatcccatcctactgccagcac actgtccatgggattgcctcccagcaggactccatcccatcctactgccactgttctgtc cacggggtcacctcccagcgaatccccatccagaactggttcagcagcatcagggttaaa gaagttgtactaaaagaagacctggaaaagctagaaagtatgaggcagcagcagccccag ttttctcatgggcaaaaggaggagctggctaaggtgtataattggattcaaagccagact gtcactcaagaaatcgacattcaattggaatgttag >gi568815597f:7684878_7942752|GENSCAN_predicted_peptide_4|220_aa MEDCGESKAPHEDARLTPEELERASLLQILPEMLGAERGDILRKAEGSATTKEQHTTCVN LVTDHHKFPREYRSQSVYPKDIKNLAGPHNTALFKGKQKAVPVRPVVGLERPLLYKLIGK KELTVLRDSKADLSRQKKEYTYEDRTIETVKSDEQKEKRLKKSQQGLEDMWETIEWTNIS TTGAPEEEKEKEAERLFEKIMAENSPSLMKNMNVKVQEAK >gi568815597f:7684878_7942752|GENSCAN_predicted_CDS_4|663_bp atggaggattgtggggagagcaaggcacctcatgaagacgcgcgcttaactccggaggag ctagaaagagcttcccttctacagatactgccagagatgctgggtgcagaaagaggggat attctcaggaaagcagaggggtctgcaacaaccaaagagcaacacacgacttgtgtcaat cttgtcactgaccatcacaagtttcctagagagtacaggtctcagagcgtttaccctaaa gacatcaagaacttggccggtccccacaacactgccctgttcaagggcaaacaaaaggca gtgccagtcagacctgtggtagggctggagcgacctcttttgtataaattgattggaaag aaagagttaactgttttgagggattcaaaagccgatctaagcaggcagaagaaagaatac acgtatgaagaccggacaattgaaactgtcaagtccgacgaacagaaagaaaaaagactg aagaaaagtcaacagggcctggaggacatgtgggaaaccatcgagtggaccaacataagc actacaggggccccagaggaagaaaaagagaaagaggcagaaagattattcgaaaaaata atggctgaaaactccccaagtttgatgaaaaacatgaatgtaaaagtccaagaggctaaa tga >gi568815597f:7684878_7942752|GENSCAN_predicted_peptide_5|81_aa MCEQDCKQGQELTKKGHSPQIISFFLALTSTALLFLLFFLTLRFSVVKRGRKKLLYIFKQ RNRQPNLNKPYIPADLETTRK >gi568815597f:7684878_7942752|GENSCAN_predicted_CDS_5|246_bp atgtgtgaacaggattgtaaacaaggtcaagaactgacaaaaaaaggacactctccgcag atcatctccttctttcttgcgctgacgtcgactgcgttgctcttcctgctgttcttcctc acgctccgtttctctgttgttaaacggggcagaaagaaactcctgtatatattcaaacaa cgaaaccgtcagcccaacctcaacaaaccctacattcctgctgaccttgaaaccaccaga aaataa