GENSCAN 1.0 Date run: 4-Nov-116 Time: 13:38:19 Sequence gi568815591f:27648525_27928826 : 280302 bp : 39.52% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init - 1053 949 105 0 0 112 110 96 0.858 14.47 1.00 Prom - 1818 1779 40 -7.45 2.00 Prom + 6890 6929 40 -7.45 2.01 Init + 13388 13464 77 1 2 56 78 69 0.818 3.32 2.02 Intr + 13697 13801 105 2 0 8 84 113 0.635 1.31 2.03 Intr + 14121 14248 128 0 2 32 69 203 0.596 12.20 2.04 Term + 16811 17328 518 0 2 32 44 187 0.356 2.19 2.05 PlyA + 17367 17372 6 1.05 3.06 PlyA - 19634 19629 6 1.05 3.05 Term - 21600 21456 145 2 1 57 38 74 0.005 -4.30 3.04 Intr - 41148 40972 177 2 0 101 111 74 0.110 9.11 3.03 Intr - 41929 41877 53 1 2 61 25 40 0.070 -8.21 3.02 Intr - 43003 42906 98 2 2 114 116 60 0.535 10.21 3.01 Init - 57633 57531 103 0 1 100 28 120 0.275 7.65 3.00 Prom - 58730 58691 40 -5.75 4.00 Prom + 59567 59606 40 -4.35 4.01 Init + 61528 61684 157 0 1 46 52 123 0.435 4.62 4.02 Term + 79745 79929 185 0 2 88 39 136 0.662 5.42 4.03 PlyA + 81585 81590 6 1.05 5.00 Prom + 90372 90411 40 -4.25 5.01 Init + 91499 91745 247 1 1 70 82 147 0.484 8.20 5.02 Intr + 91921 91999 79 2 1 52 44 116 0.250 1.29 5.03 Intr + 99994 100162 169 1 1 71 116 53 0.347 5.43 5.04 Intr + 109507 109609 103 0 1 114 116 48 0.993 9.03 5.05 Intr + 117310 117497 188 2 2 80 73 222 0.920 18.39 5.06 Intr + 121152 121310 159 2 0 82 96 181 0.891 17.56 5.07 Intr + 136639 136787 149 0 2 79 68 187 0.860 13.91 5.08 Intr + 136875 136965 91 0 1 43 86 99 0.999 4.28 5.09 Intr + 138894 139079 186 2 0 43 106 73 0.904 3.56 5.10 Intr + 143482 143706 225 0 0 78 98 334 0.750 30.66 5.11 Intr + 144542 144688 147 1 0 102 89 93 0.999 10.31 5.12 Intr + 145799 145922 124 1 1 39 98 108 0.946 6.14 5.13 Intr + 147592 147695 104 2 2 70 111 50 0.835 4.47 5.14 Intr + 151441 151566 126 0 0 22 106 166 0.986 11.76 5.15 Intr + 167825 167966 142 1 1 75 44 42 0.213 -2.49 5.16 Intr + 168366 168514 149 1 2 61 33 129 0.327 3.73 5.17 Intr + 175268 175316 49 1 1 93 92 -6 0.141 -2.37 5.18 Term + 177939 178069 131 1 2 86 54 95 0.216 3.26 5.19 PlyA + 180486 180491 6 1.05 6.09 PlyA - 180553 180548 6 1.05 6.08 Term - 184452 184276 177 0 0 116 38 173 0.996 11.80 6.07 Intr - 187738 187660 79 0 1 104 99 17 0.811 3.03 6.06 Intr - 189975 189832 144 2 0 53 95 56 0.340 1.28 6.05 Intr - 190360 190317 44 1 2 57 75 50 0.513 -3.28 6.04 Intr - 192343 192174 170 2 2 119 87 243 0.892 26.04 6.03 Intr - 195563 195500 64 2 1 83 61 47 0.257 -1.13 6.02 Intr - 196325 196230 96 2 0 93 59 79 0.533 4.79 6.01 Init - 208646 208485 162 2 0 100 21 136 0.006 5.88 6.00 Prom - 210747 210708 40 -10.15 7.00 Prom + 210875 210914 40 -3.05 7.01 Init + 212485 212557 73 0 1 78 83 23 0.023 2.29 7.02 Intr + 215923 216055 133 2 1 38 42 80 0.011 -2.82 7.03 Intr + 217501 217633 133 1 1 41 37 140 0.108 3.93 7.04 Intr + 218362 218485 124 0 1 11 70 114 0.127 1.14 7.05 Intr + 219554 219660 107 0 2 82 94 133 0.978 12.31 7.06 Term + 226696 226830 135 0 0 108 42 102 0.732 4.64 7.07 PlyA + 228232 228237 6 1.05 8.04 PlyA - 228623 228618 6 1.05 8.03 Term - 236487 236459 29 1 2 107 55 17 0.198 -2.44 8.02 Intr - 246892 246696 197 0 2 89 109 143 0.995 14.64 8.01 Init - 249325 249255 71 1 2 66 75 118 0.994 8.87 8.00 Prom - 249777 249738 40 -8.95 9.00 Prom + 250352 250391 40 -7.45 9.01 Init + 251368 251722 355 0 1 73 97 201 0.246 16.87 9.02 Term + 257656 257834 179 2 2 36 48 119 0.190 -0.43 9.03 PlyA + 258113 258118 6 1.05 10.03 PlyA - 258730 258725 6 1.05 10.02 Term - 265529 265402 128 0 2 92 48 109 0.163 4.76 10.01 Intr - 275176 275001 176 0 2 89 13 101 0.036 1.36 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:27648525_27928826|GENSCAN_predicted_peptide_1|35_aa MGNPMAKNLMKHGYPLIIYDVFPDACKEFQDAGEQ >gi568815591f:27648525_27928826|GENSCAN_predicted_CDS_1|105_bp atggggaatccaatggcaaaaaatctcatgaaacatggctatccacttattatttatgat gtgttccctgatgcctgcaaagagtttcaagatgcaggtgaacag >gi568815591f:27648525_27928826|GENSCAN_predicted_peptide_2|275_aa MSLVRSMKDCIRSPWIFILALKTASLPSPRTQVVKHQALLPKCVPNPRCTLGALKDQGSL GRSERPKEGQGGGGVRSLPAAKLPAAGRSRRLQYRRPEAAPRSRCDETPQHPNTPIALLL NPSKVMSEKTKERYGIFIFNWAVTSNPPHTSFPTLSVTGNHRATLDFHSYLTVVRYPSLS LLRWCNWRCIGESELSSLPNGKDVTPLPLHGVTGGYMGAVRSHSKPSQLGKYQWTTSGEL KLPPLPSSNKEPPTLGVNRGGIRNLDFYYQLPFTS >gi568815591f:27648525_27928826|GENSCAN_predicted_CDS_2|828_bp atgtccctcgtgcggagtatgaaagactgtattagatctccgtggatcttcatcctggct ctaaaaactgcctctttgccatcccctcggactcaggtggttaaacatcaggccctgctg cccaagtgtgtccccaacccaaggtgcaccctgggggcgctgaaagaccaggggtcgctg ggaagaagcgagcggccgaaagaaggacaagggggaggaggcgtgaggtccttacccgct gcaaagctgccggctgccggccgcagccgccggctccagtaccggagaccggaggcagct ccgaggagccgttgtgatgagacaccccagcatcctaacaccccaatcgctctactcctc aacccctctaaggtgatgtcagagaaaaccaaagagaggtatgggattttcatcttcaac tgggcagtaacaagtaatcctccacacacatccttccctactctcagtgtcactggaaac cacagggcgaccctggatttccactcctaccttacagtggtgaggtacccctccctttcc ctgcttagatggtgtaattggaggtgtattggagagtcagagctttcatcactgcccaat ggtaaggatgtcacaccccttcccctccatggtgtcactggaggctacatgggggcagta aggagccattccaagccctctcaactagggaagtatcagtggacaactagtggggagctg aaactcccaccattgcccagcagtaacaaggaaccacccacccttggtgtcaacagaggt ggaataaggaacctagacttctactaccagttgccatttacaagttga >gi568815591f:27648525_27928826|GENSCAN_predicted_peptide_3|191_aa MENYGRRAFEVFHNETRIPAVTASQQPIGENFEIWRNEKKPPQKQCILNYPLMSYPQDLY TPEQYTKFWAFIYILTIIEVVMRCRGRPDCTVPEVYGFPMVGCGNGQRDGITEKKDIFMS EKYYLDNKNRWLLILSFKSEKWSWMVQYENVSSHSHWPLPGEQLGQCVMLEDEAKTMAGR VEQVSLLQTFQ >gi568815591f:27648525_27928826|GENSCAN_predicted_CDS_3|576_bp atggagaactatggcagaagagcctttgaggtcttccataacgaaacaagaatacctgct gtcactgctagtcagcagcctattggagaaaattttgaaatctggaggaatgagaagaag ccccctcagaagcagtgcattttaaattatccccttatgagttatcctcaagatttgtat acgccagagcaatacaccaagttctgggctttcatttatattcttaccatcatagaagtt gtcatgcgatgcagaggcagacctgactgtacagtccctgaggtgtatggatttcctatg gtgggatgtgggaatggtcagagagatgggattacagagaagaaagatattttcatgtca gagaagtattatttggataacaaaaatagatggctcctcattttgtccttcaagtcagag aagtggagttggatggttcagtatgagaatgtaagcagccatagccactggccactgcca ggggagcaactaggacagtgcgtgatgcttgaagatgaagccaagaccatggcaggtaga gtggaacaagtctctctactacagacttttcagtaa >gi568815591f:27648525_27928826|GENSCAN_predicted_peptide_4|113_aa MGRYGAEGPIIAPSCMSEGQEQICEISNVGAGEVTPNRLSKQGKPEMRILSQTGSANTDA TITVMSPSGDPTVSTELLPLLGQNEFYMQLTGGAQDPITREYEKTHFFHFQYL >gi568815591f:27648525_27928826|GENSCAN_predicted_CDS_4|342_bp atgggacggtatggggcagaaggaccaatcatagctccaagttgtatgtcagaagggcaa gaacaaatctgcgagatttctaatgttggagctggagaagtgactccaaacaggttgtca aagcaaggaaagcctgagatgaggatcttgagccagactggttctgctaacactgatgct acaattactgtcatgtctccctctggagaccccacagtttccactgagctgctgcctctc ctgggccagaatgaattctatatgcagctaactggtggagcccaggacccgataacaagg gagtatgaaaaaacacatttcttccattttcaatatttataa >gi568815591f:27648525_27928826|GENSCAN_predicted_peptide_5|855_aa MRRPAAEWAGGVSSRRGAANQAGGGRDRPVGGSDARPTVGWEGRCSRSLGVGGSGGRGSA ADGGSGSEACVTFSLDPGVFPLDCDCDGGVGDSDQLWSFGMATGGRPTRFTMTSFQEVPL QTSNFAHVIFQNVAKSYLPNAHLECHYTLTPYIHPHPKDWVGIFKVGWSTARDYYTFLWS PMPEHYVEGSTVNCVLAFQGYYLPNDDGEFYQFCYVTHKGEIRGASTPFQFRASSPVEEL LTMEDEGNSDMLVVTTKAGLLELKIEKTMKEKEELLKLIAVLEKETAQLREQVGRMEREL NHEKERCDQLQAEQKGLTEVTQSLKMENEEFKKRFSDATSKAHQLEEDIVSVTHKAIEKE TELDSLKDKLKKAQHEREQLECQLKTEKDEKELYKVHLKNTEIENTKLMSEVQTLKNLDG NKESVITHFKEEIGRLQLCLAEKENLQRTFLLTTSSKEDTCFLKEQLRKAEEQVQATRQE VVFLAKELSDAVNVRDRTMADLHTARLENEKVKKQLADAVAELKLNAMKKDQDKTDTLEH ELRREVEDLKLRLQMAADHYKEKFKECQRLQKQINKLSDQSANNNNVFTKKTGNQQKVND ASVNTDPATSASTVDVKPSPSAAEADFDIVTKGQVCEMTKEIADKTEKYNKCKQLLQDEK AKCNKYADELAKMELKWKEQVKIAENVKLELAEVQDNYKELKRSLENPAERKMEGQNSQS PQCFKTCSEQNGYVLTLSNAQPVLQYDGADGAFYPDEIQRPPVRVPSWGLEDNVVCSQPA RNFSRPDGLEDSEDSKSLKTTSNLNLTTLGTLYLTDPLIAALGQACLPSNIKASSVPAAL GDMLLVHDMRAANGC >gi568815591f:27648525_27928826|GENSCAN_predicted_CDS_5|2568_bp atgaggcgtcccgcggcggagtgggctgggggcgtgtcgtcgcggaggggagcggcgaac caggcaggaggcggaagagaccgccccgtgggcggaagtgacgcaaggcctactgtcggc tgggaggggaggtgtagccggtctttgggggtaggcggtagtggcggaagaggttcggcg gctgatggcggatcaggatcggaagcctgcgtaactttctcccttgatccgggagtcttt ccactggactgcgattgtgacgggggtgtgggcgactcggatcagctgtggagttttgga atggcaacgggtggccgccccacgcgattcacaatgacatcctttcaagaagtcccattg cagacttccaactttgcccatgtcatctttcaaaatgtggccaagagttaccttcctaat gcacacctggaatgtcattacaccttaactccatatattcatccacatccaaaagattgg gttggtatattcaaggttggatggagtactgctcgtgattattacacgtttttatggtcc cctatgcctgaacattatgtggaaggatcaacagtcaattgtgtactagcattccaagga tattaccttccaaatgatgatggagaattttatcagttctgttacgttacccataagggt gaaattcgtggagcaagtacacctttccagtttcgagcttcttctccagttgaagagctg cttactatggaagatgaaggaaattctgacatgttagtggtgaccacaaaagcaggcctt cttgagttgaaaattgagaaaaccatgaaagaaaaagaagaactgttaaagttaattgcc gttctggaaaaagaaacagcacaacttcgagaacaagttgggagaatggaaagagaactt aaccatgagaaagaaagatgtgaccaactgcaagcagaacaaaagggtcttactgaagta acacaaagcttaaaaatggaaaatgaagagtttaagaagaggttcagtgatgctacatcc aaagcccatcagcttgaggaagatattgtgtcagtaacacataaagcaattgaaaaagaa accgaattagacagtttaaaggacaaactcaagaaggcacaacatgaaagagaacaactt gaatgtcagttgaagacagagaaggatgaaaaggaactttataaggtacatttgaagaat acagaaatagaaaataccaagcttatgtcagaggtccagactttaaaaaatttagatggg aacaaagaaagcgtgattactcatttcaaagaagagattggcaggctgcagttatgtttg gctgaaaaggaaaatctgcaaagaactttcctgcttacaacctcaagtaaagaagatact tgttttttaaaggagcaacttcgtaaagcagaggaacaggttcaggcaactcggcaagaa gttgtctttctggctaaagaactcagtgatgctgtcaacgtacgagacagaacgatggca gacctgcatactgcacgcttggaaaacgagaaagtgaaaaagcagttagctgatgcagtg gcagaacttaaactaaatgctatgaaaaaagatcaggacaagactgatacactggaacac gaactaagaagagaagttgaagatctgaaactccgtcttcagatggctgcagaccattat aaagaaaaatttaaggaatgccaaaggctccaaaaacaaataaacaaactttcagatcaa tcagctaataataataatgtcttcacaaagaaaacggggaatcagcagaaagtgaatgat gcttcagtaaacacagacccagccacttctgcctctactgtagatgtaaagccatcacct tctgcagcagaggcagattttgacatagtaacaaaggggcaagtctgtgaaatgaccaaa gaaattgctgacaaaacagaaaagtataataaatgtaaacaactcttgcaggatgagaaa gcaaaatgcaataaatatgctgatgaacttgcaaaaatggagctgaaatggaaagaacaa gtgaaaattgctgaaaatgtaaaacttgaactagctgaagtacaggacaattataaagaa cttaaaaggagtctagaaaatccagcagaaaggaaaatggaaggtcagaattcccagagt cctcaatgtttcaaaacatgctcagagcaaaatggttatgttctcacattgtcaaatgca caaccagttctgcaatatgatggagcagatggtgctttttacccagatgaaatacaaagg ccacctgtcagagtcccctcttggggactggaagacaatgttgtctgcagccagcctgct cgaaactttagtcggcctgatggcttagaggactctgaggatagcaaatccctgaaaacc accagtaatttgaatttgactactctgggtaccttatacttaactgacccattaatagca gctctggggcaagcatgcctacccagtaacataaaggcttcctcagtgccggctgcgctt ggggatatgctgttagttcatgacatgagagcagctaacggctgctga >gi568815591f:27648525_27928826|GENSCAN_predicted_peptide_6|311_aa MPEPPPAPMGSCAAQASPTSAAPCSTAPSSINHPRAEECRHTARDWQAAPPAALSSPEPH FNLIASVQTVMCPVGAPAGMQGSGLKALWLPPGAVDAWLIKGEHLGYSSEYDEEEVDYEE SDSDESWTTESAISSEAILSSMCMNGGEEKPFACPVPGCKKRYKVTWVAVGGPDPTREAS LCQPSLLGTDQDLQSSPFHWHLRIRQKMRYRTPRPHAEQGMGEGSHCLMSEHHFEKTQRQ FSPDYYPNPSSQLNVNGIKYHAKNGHRTQIRVRKPFKCRCGKSYKTAQGLRHHTINFHPP VSAEIIRKMQQ >gi568815591f:27648525_27928826|GENSCAN_predicted_CDS_6|936_bp atgcctgagcctccccccgcccccatgggctcctgcgcggcccaagcctccccgacgagc gccgccccctgctccacggcgcccagttccatcaaccacccaagggctgaggagtgcagg cacacggcgcgggactggcaggcagctccacctgcggccctgagtagcccagagcctcat ttcaacctgatagcctcagtgcagactgtgatgtgccccgttggagccccagctggcatg cagggcagtgggctcaaagccctctggctgccgcctggtgctgtggacgcctggttaata aaaggggagcatttggggtacagcagcgagtatgacgaggaggaggtggactatgaggag tcggacagcgatgagtcctggaccacagagagtgccatcagctccgaagccatcctcagc tccatgtgcatgaatggaggggaagagaagccttttgcctgcccagttcctggatgtaaa aagagatacaaggtgacatgggtggcagttggtggtcctgatcccacgagggaagccagt ctctgccagccaagtctcctgggcactgaccaggatctacagtcctcaccatttcattgg catttacggatcagacagaagatgaggtaccgaactcccagacctcatgcagagcaggga atgggagaaggatcccactgcttgatgtcagagcatcactttgagaaaactcaaagacag ttttctccagactattatcccaatccttcctcccaactgaatgtgaatggcataaagtat cacgctaagaatggtcacagaacacagattcgtgtccgcaaaccattcaagtgtcgctgt gggaagagttacaagacagctcagggcctgcggcaccacacaatcaatttccatcccccg gtgtcggctgagattatcaggaagatgcagcaataa >gi568815591f:27648525_27928826|GENSCAN_predicted_peptide_7|234_aa MGPMPVPRFCVFLFLRSPLAWVMGDFKAALCSPTQFGVKTHGFARLRAYCPPSRSPLPAE GSGQADFSWPSSGCYLEWVRRQVQVLQHNLSSSSSSEGSTAGTERMVLCDTVHVHDAELG WADCITPWRNARLEMAQTLRECLTLNDMTSEGSEKPTSEDSMSEYDNGAWISACHTVSTS RYYFHWLNRHPHPTPIGSTPKHSEFIRTSLLCDPRPSLTSLSSFLGTAGIESLF >gi568815591f:27648525_27928826|GENSCAN_predicted_CDS_7|705_bp atggggccgatgcctgttccaaggttctgtgtcttcctctttcttcgttcaccccttgct tgggtcatgggagacttcaaagcagccctctgctcacctactcagtttggagtcaaaacc catgggtttgcccggctcagagcctactgcccccctagcaggagtcctctccctgctgag ggctccggtcaagctgacttctcctggccgtcatcaggctgctacctagaatgggttcgc agacaggttcaagtcttgcagcacaacctcagctcgtcttcatcgagtgagggcagcact gccggcactgaaaggatggtgctctgtgacacagtgcacgtgcacgatgcagaacttggg tgggcagactgcatcacaccgtggaggaatgcaaggttggagatggcccagacactaagg gagtgcctcactcttaacgacatgacttctgaaggcagtgagaagcccacttcagaggat tcaatgagtgaatacgacaacggtgcttggatcagtgcctgccatactgtcagcactagt cgttactattttcactggttgaacagacaccctcatccaactcccatcggctctactcca aaacattccgagttcatccgcacttccctgctctgtgatcctcgtcccagccttacatca ctttcctctttcctcggcacagctggaatagagtccctcttttag >gi568815591f:27648525_27928826|GENSCAN_predicted_peptide_8|98_aa MEDKSPSAGTLAAVHVGVCSGAERFMTDAARREQESLKKKIQPKLSLTLSSSVSRGNVST PPRHSSGSLTPPVTPPITPSSSFRSSTPTEQRMSSNLL >gi568815591f:27648525_27928826|GENSCAN_predicted_CDS_8|297_bp atggaggacaagtcgccttctgcaggcacccttgcagcagtgcacgtgggagtctgttca ggtgcagagagattcatgacagatgctgcccgccgagagcaggagtccctaaagaagaag attcagccgaagctctcgctgactctgtccagctcagtgtctcgagggaatgtgtccact cccccacgccacagcagtggaagccttactccccccgtgaccccacccatcaccccctcc tcttcattccgcagcagcactccgacagagcagagaatgtcctcaaacctgctctga >gi568815591f:27648525_27928826|GENSCAN_predicted_peptide_9|177_aa MVFTVESVQGAGAWGLGKCSENSSGGLHGGSGQLGEQKAHPLNVSPSCCVGPLGSPVVAG EVSGIVAPLGYDLGKAGCAHLPEFSLLCNGDTALIWQPPCAGAGSYWVVKADLNFQGFEL GGGEQELSVLSQLDTQRGGEVLDAPVPWPHMPQADTADQWKRPHEQQQTSSCGVDST >gi568815591f:27648525_27928826|GENSCAN_predicted_CDS_9|534_bp atggttttcacagttgaaagtgttcagggagctggagcatggggactggggaaatgcagt gagaactcctctggcggtttacatgggggcagtgggcagctgggagaacaaaaggctcat cctctcaatgtcagtccatcctgctgtgtgggaccactgggctccccggttgtggcaggg gaggtctctggaattgtagccccactgggctacgatctgggaaaggctggctgtgcccac ctcccagaattctctctgctttgcaatggtgacactgctctgatctggcagccgccgtgt gctggagctggctcatactgggttgtgaaagctgacttaaattttcagggatttgagctg ggaggtggtgagcaagagctgagtgtcctctcacagctggacacacagaggggtggtgag gtgctggatgcccccgttccctggccccacatgccccaggcagacactgcagaccagtgg aaacgtcctcatgaacagcagcagactagcagctgtggtgtggatagcacttaa >gi568815591f:27648525_27928826|GENSCAN_predicted_peptide_10|101_aa XVPSLQAGVGGTGLGGCRGNGARGMTVNVGGASPAMLPSVGSPTSAGKCAVPSSQPQPPI LPPLLFIRFYCALNLYPEEPPYVLSELVEIKKHINETPIYR >gi568815591f:27648525_27928826|GENSCAN_predicted_CDS_10|306_bp nntgtgccaagtctgcaagcaggcgttggtgggacagggctggggggctgcagggggaat ggagcaagaggaatgactgtaaatgttggcggcgccagcccagccatgctcccatccgtt ggcagcccaacatctgctgggaagtgtgcagttccatccagtcagccacagcctcccatt ctcccgcccctcttattcattcgattttactgtgctttaaatctttatcctgaagagcca ccttacgtgctttcagaacttgtggaaataaagaaacacatcaatgaaacccccatttac agataa