GENSCAN 1.0 Date run: 5-Nov-116 Time: 16:31:21 Sequence gi568815596f:87967865_88210509 : 242645 bp : 43.54% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 4691 4750 60 1 0 94 22 142 0.581 7.45 1.02 Term + 4774 5253 480 0 0 5 42 314 0.648 13.30 1.03 PlyA + 5830 5835 6 -0.45 2.11 PlyA - 6604 6599 6 1.05 2.10 Term - 8990 8910 81 2 0 114 48 57 0.543 1.99 2.09 Intr - 21972 21832 141 0 0 55 94 25 0.291 0.35 2.08 Intr - 24994 24860 135 1 0 69 94 55 0.708 4.96 2.07 Intr - 32420 32375 46 1 1 131 96 16 0.544 5.11 2.06 Intr - 36464 36440 25 0 1 87 82 -4 0.218 -4.02 2.05 Intr - 49324 49103 222 2 0 51 109 113 0.395 7.80 2.04 Intr - 60721 60006 716 0 2 87 85 294 0.219 20.08 2.03 Intr - 65069 65044 26 2 2 96 77 15 0.015 -1.98 2.02 Intr - 87698 87525 174 2 0 87 36 59 0.174 0.74 2.01 Init - 88916 88863 54 2 0 79 88 58 0.612 4.18 2.00 Prom - 92150 92111 40 -3.76 3.00 Prom + 95987 96026 40 -4.86 3.01 Init + 100016 100137 122 1 2 79 80 161 0.833 14.06 3.02 Intr + 116452 116628 177 1 0 98 94 222 0.983 22.83 3.03 Intr + 119998 120211 214 1 1 91 54 407 0.998 36.32 3.04 Intr + 123148 123278 131 0 2 91 89 122 0.997 12.09 3.05 Intr + 128731 128920 190 1 1 77 81 316 0.991 29.39 3.06 Intr + 135194 135286 93 1 0 81 94 30 0.861 3.06 3.07 Intr + 138461 138624 164 1 2 84 42 240 0.827 17.77 3.08 Intr + 140507 140735 229 2 1 101 65 140 0.487 10.87 3.09 Term + 142490 142648 159 1 0 109 55 256 0.995 22.34 3.10 PlyA + 142899 142904 6 1.05 4.05 PlyA - 143603 143598 6 1.05 4.04 Term - 148837 148694 144 1 0 44 48 89 0.048 -1.59 4.03 Intr - 156722 156630 93 2 0 106 101 145 0.959 17.76 4.02 Intr - 158484 158312 173 1 2 24 97 221 0.897 16.26 4.01 Init - 160153 160087 67 1 1 104 109 94 0.978 14.33 4.00 Prom - 160544 160505 40 -3.46 5.04 PlyA - 160912 160907 6 1.05 5.03 Term - 163444 163271 174 1 0 75 33 81 0.005 -0.94 5.02 Intr - 182512 181991 522 1 0 73 -43 222 0.210 0.75 5.01 Init - 182768 182610 159 2 0 60 91 71 0.534 4.42 5.00 Prom - 183516 183477 40 -2.96 6.00 Prom + 194827 194866 40 -3.76 6.01 Init + 196780 196851 72 0 0 70 46 101 0.483 3.27 6.02 Intr + 200824 200887 64 0 1 76 52 66 0.592 0.09 6.03 Intr + 202423 202592 170 2 2 107 54 58 0.587 3.97 6.04 Intr + 205275 205509 235 2 1 97 101 197 0.908 19.16 6.05 Intr + 206775 206969 195 1 0 86 75 213 0.987 19.19 6.06 Intr + 207385 207537 153 2 0 70 85 170 0.998 14.94 6.07 Intr + 210919 211149 231 2 0 108 110 239 0.972 25.84 6.08 Intr + 214835 214983 149 0 2 73 96 140 0.975 13.25 6.09 Intr + 215084 215209 126 1 0 64 86 134 0.988 11.68 6.10 Intr + 217464 217615 152 2 2 60 92 260 0.986 22.56 6.11 Term + 218034 218259 226 0 1 88 38 257 0.792 16.95 6.12 PlyA + 218741 218746 6 1.05 7.04 PlyA - 219774 219769 6 1.05 7.03 Term - 224493 224437 57 0 0 76 49 54 0.033 -2.01 7.02 Intr - 226914 226809 106 2 1 111 92 16 0.052 4.52 7.01 Init - 237134 237061 74 2 2 63 92 19 0.046 0.34 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 60755 60748 8 2 2 76 99 11 0.906 0.72 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:87967865_88210509|GENSCAN_predicted_peptide_1|179_aa MGRAGTMAVAAEVAGAGRLAGAVGGGMARASSGNGSEEAWGALRAPQQQLRELCPGVNNQ PYLCESGHCCRETGCCTYYYELWWFWLLWTVLILFSCCWAFRHRRAKLRLQQQQRQREIN LLAYHGACHGAGPFPTGSLLDLRLLSTFKPPAYEDVVHRPGTTPRPPPPRLILSPQAAP >gi568815596f:87967865_88210509|GENSCAN_predicted_CDS_1|540_bp atgggccgggcagggaccatggcagtggcagcagaggtggcaggggcggggcggctggcg ggggctgtaggtggaggtatggctcgggccagcagcgggaacggcagcgaggaggcctgg ggggcacttcgggcgccgcaacagcagcttcgagagctgtgcccaggagtgaacaaccag ccctacctctgtgagagtggtcactgctgcagggagactggctgctgcacctactactat gagctctggtggttctggctgctctggactgtcctcatcctctttagctgctgttgggcc ttccgccaccgacgagctaaactcaggctgcaacaacagcagcggcagcgtgaaatcaac ttgttggcctatcatggggcatgccatggggctggtcctttccctaccggttcactgctt gaccttcgcctcctcagcaccttcaagcccccagcctacgaggatgtggttcaccgccca ggcaccaccccccgccccccgcctccgcgccttatactgtcgccccaggccgccccttga >gi568815596f:87967865_88210509|GENSCAN_predicted_peptide_2|539_aa MDLARWLTPVIPALWEAEDRPRAPSPPGAPRVCEMADQGAARVPRTPHRPPPPERDARAS EWSAEGRGARLAEEAEKWVLAVLARKSRFVNLMKHSKKTYDSFQDELEDYIKVQKARGLE PKTCFRKMKGDYLETCGYKGEVNSRPTYRMFDQRLPSETIQTYPRSCNIPQTVENRLPQW LPAHDSRLRLDSLSYCQFTRDCFSEKPVPLNFNQQEYICGSHGVEHRVYKHFSSDNSTST HQASHKQIHQKRKRHPEEGREKSEEERSKHKRKKSCEEIDLDKHKSIQRKKTEVEIETVH VSTEKLKNRKEKKSRDVVSKKEERRPPVRAEEEEPQLPRTESASRHLDRRPPTPWLRPAA LVPIALYGLPFQHRGRYKAALSLPLILISDSSPMLSSAIRFKIDGMWLKLPDPLTEFSSS QVGKVFIPGLPAPSLTMSNTMPQPSTSLDGVSAPKPLSKLLGSLDEVLLLFPVPELRDSS KLHDSLYNEDCTFQQLGTYIDSIRDPVHNRVTLLSVLYYKNNKTPTSQNCKWFMAPHID >gi568815596f:87967865_88210509|GENSCAN_predicted_CDS_2|1620_bp atggacctggcacggtggctcacgcctgtaatcccagcactttgggaggccgaggacagg cctcgcgcgcccagcccgccgggcgcgccccgagtgtgcgaaatggccgaccagggcgcg gcccgagtcccgcgcacccctcaccggccgccgcccccggaaagagacgccagggcttcc gagtggagtgcggagggaaggggcgctcggcttgcggaggaagccgagaaatgggttctt gctgtgttggccagaaaatcccgttttgtcaacctaatgaagcattcaaagaagacatat gactcttttcaagatgaacttgaagattatattaaagtacagaaagccagaggcttagag ccaaagacttgtttcagaaagatgaaaggggactatttggaaacctgtgggtacaaagga gaggttaattccagacccacgtatagaatgtttgaccagagactcccatctgaaaccatc cagacctacccaagatcatgcaatattccacaaacagtggaaaatcggttgcctcagtgg ttaccagcccatgacagcagattgagactagactctctgagctactgtcagttcacgagg gactgtttctcagaaaaaccagtacccctgaactttaatcaacaagaatatatttgtggc tcacatggtgtagaacatagagtttacaagcacttctcctcagataacagtaccagtact catcaagccagtcacaaacagatacatcagaagaggaaaaggcacccagaggaaggcaga gaaaaatcagaggaggagcggtctaagcataagagaaaaaaaagctgcgaggaaattgac ttagacaaacacaagagcatccaaagaaagaaaacagaggtggaaatagaaaccgtacat gtcagtacagaaaagcttaagaatcgaaaggagaaaaaaagccgagatgtagtctctaag aaagaggaacgtcgcccgccggtgagggcggaggaggaagaaccgcagctcccgaggaca gaatccgcttcaaggcaccttgaccgtcgcccacccacaccctggctgcggcccgccgcc ctggtccccatcgcgctgtacggtcttcccttccaacaccgagggcgttacaaggccgcc ctgtccctgccactcattcttattagtgactcctcccccatgctttcctcagctattaga tttaaaatagatggaatgtggctcaagctgcctgacccgctgacagagttctcttcctcc caggtgggaaaggtttttattcctggactgccagctccctctctgacgatgtccaacaca atgcctcagcccagtacttcacttgatggcgttagtgctccaaagcctcttagtaaactc cttggatcattggacgaggttcttctgttgttcccagttccagaactgagggattcttca aaacttcatgattctctctataatgaggattgtactttccaacagcttggaacttacatt gattctatcagagatcctgtccataacagagtcaccctactttctgtcctgtactacaag aacaacaaaactccaactagtcagaattgcaagtggtttatggccccccatatagattga >gi568815596f:87967865_88210509|GENSCAN_predicted_peptide_3|492_aa MENVEVFTAEGKGRGLKATKEFWAADIIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRC GQCKFAHYCDRTCQKDAWLNHKNECSAIKRYGKVPNENIRLAARIMWRVEREGTGLTEGC LVSVDDLQNHVEHFGEEEQKDLRVDVDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLS DQRGLQAVGVGIFPNLGLVNHDCWPNCTVIFNNGKIELRALGKISEGEELTVSYIDFLNV SEERKRQLKKQYYFDCTCEHCQKKLKDDLFLGVKDNPKPSQEVVKEMIQFSKDTLEKIDK ARSEGLYHEVVKLCRECLEKQEPVFADTNIYMLRMLSIVSEVLSYLQAFEEASFYARRMV DGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHGMICKAYAILLVTHGPSHPITKDLE ASSVLRLVFPSWPELSEDGSAMRVQTEMELRMFRQNEFMYYKMREAALNNQPMQVMAEPS NEPSPALFHKKQ >gi568815596f:87967865_88210509|GENSCAN_predicted_CDS_3|1479_bp atggagaacgtggaggtcttcaccgctgagggcaaaggaaggggtctgaaggccaccaag gagttctgggctgcagatatcatctttgctgagcgggcttattccgcagtggtttttgac agccttgttaattttgtgtgccacacctgcttcaagaggcaggagaagctccatcgctgt gggcagtgcaagtttgcccattactgcgaccgcacctgccagaaggatgcttggctgaac cacaagaatgaatgttcggccatcaagagatatgggaaggtgcccaatgagaacatcagg ctggcggcgcgcatcatgtggcgggtggagagagaaggcaccgggctcacggagggctgc ctggtgtccgtggacgacttgcagaaccacgtggagcactttggggaggaggagcagaag gacctgcgggtggacgtggacacattcttgcagtactggccgccgcagagccagcagttc agcatgcagtacatctcgcacatcttcggagtgattaactgcaacggttttactctcagt gatcagagaggcctgcaggccgtgggcgtaggcatcttccccaacctgggcctggtgaac catgactgttggcccaactgtactgtcatatttaacaatggcaaaattgagctccgggcc ctaggcaagatctcagaaggagaggagctgactgtgtcctatattgacttcctcaacgtt agtgaagaacgcaagaggcagctgaagaagcagtactactttgactgcacatgtgaacac tgccagaaaaaactgaaggatgacctcttcctgggggtgaaagacaaccccaagccctct caggaagtggtgaaggagatgatacaattctccaaggatacattggaaaagatagacaag gctcgttccgagggtttgtatcatgaggttgtgaaattatgccgggagtgcctggagaag caggagccagtgtttgctgacaccaacatctacatgctgcggatgctgagcattgtttcg gaggtcctttcctacctccaggcctttgaggaggcctcgttctatgccaggaggatggtg gacggctatatgaagctctaccaccccaacaatgcccaactgggcatggccgtgatgcgg gcagggctgaccaactggcatgctggtaacattgaggtggggcacgggatgatctgcaaa gcctatgccattctcctggtgacacacggaccctcccaccccatcactaaggacttagag gcaagtagcgtcttgaggctggtgttcccttcctggcctgagctttctgaggatgggagt gccatgcgggtgcagacggagatggagctacgcatgttccgccagaacgaattcatgtac tacaagatgcgcgaggctgccctgaacaaccagcccatgcaggtcatggccgagcccagc aatgagccatccccagctctgttccacaagaagcaatga >gi568815596f:87967865_88210509|GENSCAN_predicted_peptide_4|158_aa MSFSGKYQLQSQENFEAFMKAIGLPEELIQKGKDIKGVSEIVQNGKHFKFTITAGSKVIQ NEFTVGEECELETMTGEKVKTVVQLEGDNKLVTTFKNIKSVTELNGDIITNSPAGASYWL NEKEEVSLSDTKQGREEWRRPVGEMEKNQHNGNSFSCK >gi568815596f:87967865_88210509|GENSCAN_predicted_CDS_4|477_bp atgagtttctccggcaagtaccaactgcagagccaggaaaactttgaagccttcatgaag gcaatcggtctgccggaagagctcatccagaaggggaaggatatcaagggggtgtcggaa atcgtgcagaatgggaagcacttcaagttcaccatcaccgctgggtccaaagtgatccaa aacgaattcacggtgggggaggaatgtgagctggagacaatgacaggggagaaagtcaag acagtggttcagttggaaggtgacaataaactggtgacaactttcaaaaacatcaagtct gtgaccgaactcaacggcgacataatcaccaattctcctgctggtgcttcttattggtta aatgagaaagaagaagtcagcctttctgacacaaagcagggcagagaagagtggagaagg cctgtgggagaaatggagaaaaaccagcacaatgggaactccttcagttgcaagtga >gi568815596f:87967865_88210509|GENSCAN_predicted_peptide_5|284_aa MSELPFTIASKRIKYLGIQLTTDVKDLFKENYKPLLNEIKEDTNKWKNILCSQNWEKTTL KFIWNQKRARIAKSILSQKNKAGGISLPDFKLYYKATVTKTAWYWYQNRDIDQWNGTEPS EIMPHIYNYLIFDKPDKNKKWGKDSLFNKWCWENWLAMCRKLKLDPFLTPYTKINSRWIK DLNVRPKTIKTLEENLGNTIQDVGMVKDFMSKTPKAMATKAKIDKWDTPGKVHLVPTSPE ASGVGADLLYGEAFMTLLPTPLRRSQKAAVWGMILDICDQGERA >gi568815596f:87967865_88210509|GENSCAN_predicted_CDS_5|855_bp atgagtgaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaactt acaacggatgtgaaggacctcttcaaggagaactacaaaccactgctcaatgaaataaaa gaggatacaaacaaatggaagaacattctatgctcacagaattgggaaaaaactacttta aagttcatatggaaccaaaaaagagcccgcattgccaagtcaatcctaagccaaaagaac aaagctggaggcatctcgctacctgacttcaaactgtactacaaggctacagtaaccaaa acagcatggtactggtaccaaaacagagatatagaccaatggaatggaacagagccctca gaaataatgccacatatctacaactatctgatctttgacaaacctgacaaaaacaagaaa tggggaaaggattccctatttaataaatggtgctgggaaaactggctagccatgtgtaga aagctgaaactggatcccttccttacaccttatacaaaaattaattcaagatggattaaa gacttaaatgttagacctaaaaccataaaaaccctagaagaaaacctaggcaataccatt caggacgtgggcatggtcaaggacttcatgtctaaaacaccaaaagcaatggcaacaaaa gccaaaattgacaaatgggatacaccaggaaaggtgcacctggtccctacttcacctgag gcctccggggttggtgctgacttgctctatggagaagctttcatgacgctccttcccact cccctccgtagatcccagaaagcagcagtgtggggcatgatacttgacatttgcgatcaa ggggagagggcctag >gi568815596f:87967865_88210509|GENSCAN_predicted_peptide_6|590_aa MAMRIVEALITGVLGPVLLLGEWTNHKGLQARVSVAEGLWERSVEGPSHCEAENPSSGPD RKPHRTVRDADSGVRTFAARAALLRTGPRAPRPAPRAPRPAPASRIMWYVSTRGVAPRVN FEGALFSGYAPDGGLFMPEELPQLDRGTLCQWSTLSYPGLVKELCALFIGSELLPKDELN DLIDRAFSRFRHREVVHLSRLRNGLNVLELWHGVTYAFKDLSLSCTTQFLQYFLEKREKH VTVVVGTSGDTGSAAIESVQGAKNMDIIVLLPKGHCTKIQELQMTTVLKQNVHVFGVEGN SDELDEPIKTVFADVAFVKKHNLMSLNSINWSRVLVQMAHHFFAYFQCTPSLDTHPLPLV EVVVPTGAAGNLAAGYIAQKIGLPIRLVVAVNRNDIIHRTVQQGDFSLSEAVKSTLASAM DIQVPYNMERVFWLLSGSDSQVTRALMEQFERTQSVNLPKELHSKLSEAVTSVSVSDEAI TQTMGRCWDENQYLLCPHSAVAVNYHYQQIDRQQPSTPRCCLAPASAAKFPEAVLAAGLT PETPAEIVALEHKETRCTLMRRGDNWMLMLRDTIEDLSRQWRSHALNTSQ >gi568815596f:87967865_88210509|GENSCAN_predicted_CDS_6|1773_bp atggcgatgcgaattgtggaggctttaatcactggtgtgctgggaccagttctgctgctg ggagaatggactaatcacaaaggcctgcaggcccgtgtaagtgtggcagagggcttgtgg gagaggtccgtggaaggtccttcccactgcgaagctgagaaccctagctccggccccgac cggaagccccaccgcaccgtcagggacgcggactcgggagtgcgcacattcgcagcccgg gcagccctgctgcgcaccgggcctcgcgccccgcgccccgcgccccgcgccccgcgcccc gcaccggcctccaggatcatgtggtatgtcagcaccaggggcgtagccccacgggtcaac tttgagggggccctcttctctggctatgcacctgacgggggcctctttatgcctgaagag ctcccacagttggacagagggaccctgtgccagtggagcacactctcctatcctggcctg gtgaaggagctgtgtgccctcttcattggctctgagctccttccaaaagatgaattaaat gatctgatcgaccgagccttcagcagattccgtcacagagaagtggtccatctgtccagg ttgaggaatgggctgaacgtgttggagctgtggcatggcgtcacatatgcatttaaggac ctgtccctgtcctgcacaacacagttcctgcagtacttcctggagaagagggagaagcac gtcactgtggttgtaggaacatctggggacacaggaagtgctgccattgagagtgttcaa ggggcaaagaacatggacattatcgttctgctgcccaaaggtcactgcacaaagattcag gagctccagatgacaacggtgctgaagcagaacgtacatgtgtttggagtggagggaaac agcgatgagctcgatgagccgatcaagactgtgtttgccgatgtggcttttgtcaagaag cacaatctgatgagcctgaattcgatcaactggtcccgggtcctggtgcagatggcccat cacttctttgcttacttccagtgtacgccatccttggacacacatcccctacccctggtg gaggtggttgtgccaacaggggctgccggtaaccttgcagctgggtacattgctcaaaag ataggcctgcccatccgtctggtcgtggcagtgaaccgcaatgacatcatccacaggact gtccagcagggagacttctctctctctgaggctgttaaatcaaccttggcatcagctatg gacattcaggtgccctacaacatggagagggtgttctggctgctctctggctctgacagc caggtgacaagagccctcatggagcagtttgaaaggacccaaagtgtgaatctgcccaag gaactgcacagcaagctttcagaggcagtgacatccgtgtcagtgtcggatgaagccatc acccagaccatgggccgctgctgggatgagaaccagtacttgctgtgcccccactcagcg gtggccgtgaactaccattaccagcagatagacaggcagcagcccagcactccccggtgc tgcctcgcccctgcctctgcagccaagttcccggaagctgtcctggctgctggcctgacc cctgagactcccgcggagatcgtagccctggagcacaaggagacacgctgcaccctgatg cggagaggtgacaactggatgctgatgcttcgggacaccattgaggaccttagccgacag tggaggagtcatgccctcaacacctcccagtag >gi568815596f:87967865_88210509|GENSCAN_predicted_peptide_7|78_aa MVCLIEKIWNQPKSPSINEWIKKLRRQGGCRGYKLHWLSPQWQEAPMVLNEFSLSQGQNQ ASKEKQTEGTHIRQAGNK >gi568815596f:87967865_88210509|GENSCAN_predicted_CDS_7|237_bp atggtgtgccttattgaaaaaatatggaaccagccaaaaagcccatcaatcaatgagtgg ataaagaaactacggcgacaaggtggctgcaggggctacaagttgcactggctttcaccc cagtggcaagaggcacctatggtcctgaatgagttttccctatctcagggccagaaccag gccagtaaggagaagcagacagaagggacacatatacgacaagctggcaacaagtga