GENSCAN 1.0 Date run: 4-Nov-116 Time: 20:26:55 Sequence gi568815594f:40235468_40454517 : 219050 bp : 44.11% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 7920 8495 576 2 0 78 40 815 0.993 69.87 1.02 PlyA + 9647 9652 6 1.05 2.03 PlyA - 12291 12286 6 1.05 2.02 Term - 20500 20433 68 2 2 70 42 85 0.483 0.20 2.01 Init - 26001 25935 67 0 1 8 98 119 0.481 4.15 2.00 Prom - 46179 46140 40 -1.56 3.02 PlyA - 46633 46628 6 1.05 3.01 Sngl - 59620 59159 462 1 0 98 42 429 0.524 35.46 3.00 Prom - 61667 61628 40 -4.46 4.00 Prom + 63312 63351 40 -4.26 4.01 Init + 71862 72023 162 2 0 68 76 126 0.656 9.13 4.02 Intr + 72810 72861 52 2 1 92 99 36 0.662 3.58 4.03 Intr + 74859 75024 166 1 1 92 50 31 0.474 -1.28 4.04 Term + 76998 77169 172 0 1 91 43 74 0.481 0.50 4.05 PlyA + 78700 78705 6 1.05 5.05 PlyA - 79476 79471 6 1.05 5.04 Term - 93694 93567 128 2 2 46 36 142 0.565 3.24 5.03 Intr - 94358 94283 76 2 1 53 110 66 0.919 4.39 5.02 Intr - 95643 95518 126 0 0 75 89 112 0.991 10.88 5.01 Init - 95840 95790 51 2 0 58 44 65 0.938 -1.72 5.00 Prom - 97255 97216 40 -6.06 6.00 Prom + 98929 98968 40 -6.56 6.01 Init + 100001 100064 64 1 1 63 78 4 0.884 -1.65 6.02 Intr + 100360 100505 146 2 2 27 101 165 0.978 11.70 6.03 Intr + 101743 101897 155 0 2 39 108 146 0.944 10.57 6.04 Term + 105096 105183 88 0 1 55 46 92 0.664 -1.17 6.05 PlyA + 105635 105640 6 1.05 7.00 Prom + 109146 109185 40 -1.66 7.01 Init + 113242 113354 113 0 2 73 74 58 0.897 2.58 7.02 Intr + 113415 113947 533 0 2 89 95 632 0.997 56.57 7.03 Term + 118512 119053 542 1 2 65 38 391 0.235 26.22 7.04 PlyA + 119539 119544 6 1.05 8.00 Prom + 119635 119674 40 -4.86 8.01 Init + 124413 124453 41 2 2 61 105 11 0.264 -0.24 8.02 Intr + 128228 128359 132 2 0 116 65 61 0.353 6.36 8.03 Term + 140427 140457 31 0 1 117 42 43 0.050 -0.07 8.04 PlyA + 142736 142741 6 1.05 9.05 PlyA - 146291 146286 6 1.05 9.04 Term - 161342 161259 84 2 0 95 44 59 0.401 -0.15 9.03 Intr - 164005 163985 21 1 0 108 72 19 0.242 0.04 9.02 Intr - 168974 168943 32 0 2 103 100 32 0.238 3.75 9.01 Init - 171744 171618 127 0 1 81 98 22 0.184 2.83 9.00 Prom - 172150 172111 40 -4.56 10.00 Prom + 182755 182794 40 -2.96 10.01 Init + 183866 183940 75 1 0 68 80 107 0.414 8.99 10.02 Term + 184205 184300 96 1 0 79 34 17 0.267 -6.63 10.03 PlyA + 184548 184553 6 1.05 11.05 PlyA - 185256 185251 6 1.05 11.04 Term - 190676 190437 240 2 0 100 54 314 0.994 25.23 11.03 Intr - 197395 197184 212 2 2 100 80 237 0.962 22.73 11.02 Intr - 201180 200974 207 1 0 27 82 110 0.469 3.35 11.01 Intr - 203457 202304 1154 2 2 135 96 3198 0.788 313.97 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594f:40235468_40454517|GENSCAN_predicted_peptide_1|191_aa MLSSIKCVLVGDSAVGKTSLLVRFTSETFPEAYKPTVYENTGVDVFMDGIQISLGLWDTA GNDAFRSIRPLSYQQADVVLMCYSVANHNSFLNLKNKWIGEIRSNLPCTPVLVVATQTDQ REMGPHRASCVNAMEGKKLAQDVRAKGYLECSALSNRGVQQVFECAVRTAVNQARRRNRR RLFSINECKIF >gi568815594f:40235468_40454517|GENSCAN_predicted_CDS_1|576_bp atgctgagttccatcaagtgcgtgttggtgggcgactctgctgtggggaaaacctctctg ttggtgcgcttcacctccgagaccttcccggaggcctacaagcccacagtgtacgagaac acaggggtggacgtcttcatggatggcatccagatcagcctgggcctctgggacacagcc ggcaatgacgccttcagaagcatccggcccctgtcctaccagcaggcagacgtggtgctg atgtgctactctgtggccaaccataactcattcctgaacttgaagaacaagtggattggt gaaattaggagcaacttgccctgtacccctgtgctggtggtggccacccagactgaccag cgggagatggggccccacagggcctcctgcgtcaatgccatggaagggaagaaactggcc caggatgtcagagccaagggctacctggagtgctcagcccttagcaatcggggagtacag caggtgtttgagtgcgccgtccgaactgccgtcaaccaggccaggagacgaaacagaagg aggctcttctccatcaatgagtgcaagatcttctaa >gi568815594f:40235468_40454517|GENSCAN_predicted_peptide_2|44_aa MGAAAAALLLVVLSGSPLATLSGSSRASDESIIEWKKGSLEQEI >gi568815594f:40235468_40454517|GENSCAN_predicted_CDS_2|135_bp atgggagctgctgctgctgcactcttgctggtggtgctaagcggcagtcccctagccacg ctgtcagggtcatccagagcatcagatgaaagcatcatcgaatggaaaaaagggagtttg gagcaagagatctaa >gi568815594f:40235468_40454517|GENSCAN_predicted_peptide_3|153_aa MEMEAEMEMEVEMEMEAEMEMEAEMEMEVEMEMEAEMEMEAQMEMEMEAEMEMEMEAEME MEMEAEMEMKMEAEMEMEMEVEMEAEMEMEMEVEMEAEIEVEMEAQMEMEIEAEMEMEAE MEMEMEAEMEMEMEAEMEMEMEAEMEMEIEAEM >gi568815594f:40235468_40454517|GENSCAN_predicted_CDS_3|462_bp atggagatggaggcagagatggagatggaggtggagatggagatggaggcagagatggag atggaggcagagatggagatggaggtggagatggaaatggaggcagagatggagatggag gcacagatggagatggagatggaggcagaaatggagatggagatggaggcagagatggag atggagatggaggcagagatggagatgaagatggaggcagagatggagatggagatggag gtggagatggaggcagagatggagatggagatggaggtggagatggaggcagagatagag gtggagatggaggcacagatggagatggagatagaggcagagatggagatggaggcagaa atggagatggagatggaggcagagatggagatggagatggaggcagagatggagatggag atggaggcagagatggagatggagattgaggcagagatgtag >gi568815594f:40235468_40454517|GENSCAN_predicted_peptide_4|183_aa MPPRPLWFYRLMFVLELEQDEVGNEAYRVTRDGGSSRKDRVIWFLEGILEISSYMGQLKL REKWLDQGPTAETTQDCYGGSLITRFLPSCSILFNTWLLPRGPKWLLQLQAWFSVSVTGQ EERNREGMTFSFRQTKPESKEKGSRKLRDLILKQGHRSQCEGATPGQIGHNLSTKVIKDS NGM >gi568815594f:40235468_40454517|GENSCAN_predicted_CDS_4|552_bp atgcccccaaggccactgtggttctacagacttatgtttgtgctcgagttagagcaggat gaggttggaaatgaagcttacagagttacaagggacggcggtagttccagaaaggacagg gtcatctggtttttggaaggcatcttggagatttcatcgtacatggggcagttgaagctc agagagaagtggcttgaccaaggcccaacggcagagacaacccaagactgttatggcggc tccctgatcaccaggtttcttccatcttgttccattctcttcaacacatggcttctacct cgtggcccaaaatggctgctccagctccaagcttggttcagcgtttcagtcactggtcag gaggaaaggaacagagaagggatgactttcagcttcaggcagaccaagcctgaatcaaag gagaagggatccagaaagttacgtgatctgattttaaagcaaggacacaggagccagtgt gaaggggctacccctgggcaaatcggacacaatttgagcaccaaagtaattaaggacagc aatgggatgtga >gi568815594f:40235468_40454517|GENSCAN_predicted_peptide_5|126_aa MRRRMAGPGLTPTAAQPGDRDSVVDEELEFPDIGDGGNCGYSQARGWQGEQQGNGSEVQP LLTAICTNHTCLLAANGAPEFPSPVDSLQWILTLFGIKAKILTGNYKAVRDVSPDFPSPT LSPSLM >gi568815594f:40235468_40454517|GENSCAN_predicted_CDS_5|381_bp atgcgcagacgcatggcagggccagggctgactcccactgcagctcagccaggtgaccgt gatagtgtcgtggatgaggaattagagtttccagatataggtgatggtgggaactgtggc tacagccaggccaggggctggcagggtgagcagcaggggaatgggtcagaagttcaacca ttgctgactgccatctgcaccaaccacacttgcctcctagcagccaacggtgcccctgaa ttcccttctccagtggattctctccagtggattctcaccttatttggaataaaagccaaa atccttactggcaactacaaggccgtacgtgatgtctcccccgactttccttctccaaca ctctccccctcgctcatgtag >gi568815594f:40235468_40454517|GENSCAN_predicted_peptide_6|150_aa MNWSHSCISFCWIYFAASRLRAAETADGKYAQKLFNDLFEDYSNALRPVEDTDKVLNVTL QITLSQIKDMDERNQILTAYLWIRQIWHDAYLTWDRDQYDGLDSIRIPSDLVWRPDIVLY NKYNIIASQKQVQNIIGNTMGFLSCGKKEK >gi568815594f:40235468_40454517|GENSCAN_predicted_CDS_6|453_bp atgaactggtcccattcctgcatctccttttgctggatctactttgctgcttccagactg agagctgcagagacggcagatggaaaatatgctcagaagttgtttaatgacctttttgaa gattattctaatgctcttcgtccagtggaagatacagataaagtcctgaatgtgaccctg cagattacgctctctcagattaaggatatggatgaaagaaaccaaattctgactgcttat ttgtggatccgccaaatctggcacgatgcctatctcacgtgggaccgagatcagtacgat ggcctagactccatcaggatccccagtgacctcgtgtggaggccagacatcgtcttatat aacaaatacaatatcattgcttctcagaagcaggtacagaacatcataggcaacacaatg ggcttcctgagttgtggcaagaaggagaaatag >gi568815594f:40235468_40454517|GENSCAN_predicted_peptide_7|395_aa MEEKDKLELIPDFRATVTGKMVVHCQNTEVMGTVSCLGADDESSEPVNTNVVLRYDGLIT WDAPAITKSSCVVDVTYFPFDNQQCNLTFGSWTYNGNQVDIFNALDSGDLSDFIEDVEWE VHGMPAVKNVISYGCCSEPYPDVTFTLLLKRRSSFYIVNLLIPCVLISFLAPLSFYLPAA SGEKVSLGVTILLAMTVFQLMVAEIMPASENVPLIGKYYIATMALITASTALTIMVMNIH FCGAEARPVPHWARVVILKYMSRVLFVYDVGESCLSPHHSRERDHLTKVYSKLPESNLKA ARNKDLSRKKDMNKRLKNDLGCQGKNPQEAESYCAQYKVLTRNIEYIAKCLKDHKATNSK GSEWKKVAKVIDRFFMWIFFIMVFVMTILIIARAD >gi568815594f:40235468_40454517|GENSCAN_predicted_CDS_7|1188_bp atggaggagaaggacaaattagagttgattccagacttccgagccacagtgacaggcaaa atggtggtccattgccaaaacactgaagtgatggggacagtgagctgtctgggggctgat gatgaatcttcagagcctgtgaacaccaatgtggtcctgcggtatgatgggctgatcacc tgggatgcaccggccatcaccaaaagctcctgtgtggtggatgtcacctacttccctttt gacaaccagcagtgcaacctgacttttggttcctggacctacaatggcaatcaggtggac atattcaacgccttggacagcggagatctctctgacttcattgaagatgtggaatgggag gtccatggcatgcccgctgtgaagaatgtgatctcctatggctgctgctctgagccttac ccggatgtcacattcaccctccttctgaagaggaggtcctcgttctatatcgtcaacctc ctcatcccatgcgtcctcatatcttttctggctcctctgagtttttatctcccagcagcc tccggagaaaaggtctccctgggagtgaccatcctgttggccatgactgtatttcagcta atggtggcagaaatcatgccggcctcagaaaatgtgcccctgataggtaaatactacata gccacgatggccctgatcacagcctccactgcgttgaccatcatggtgatgaatatccac ttctgtggggccgaggcccggccggtgccacactgggccagggtggtcatcctgaaatac atgtccagggtcttgtttgtctatgatgtgggtgaaagctgcctcagcccgcaccacagt agagagcgggaccacctcacgaaagtttatagcaaactcccagagtctaacctgaaagca gccaggaacaaagacctttccagaaagaaggacatgaacaaacgcttaaagaacgacctg ggctgccagggtaagaaccctcaggaggccgagagttactgtgcacagtacaaagtgctg acgaggaatattgagtacatcgccaagtgcctcaaagaccacaaggccaccaattccaag gggagtgaatggaagaaggtggcgaaagtcatagaccgattcttcatgtggatttttttc attatggtgtttgtgatgactattttgatcatagcaagagcggattag >gi568815594f:40235468_40454517|GENSCAN_predicted_peptide_8|67_aa MVHQRCPSANVKDCLCVSFSVSLGLRPTLYQCNRDKPGNWTVLDKLETQVLEHLKQLWHS LIPGHPI >gi568815594f:40235468_40454517|GENSCAN_predicted_CDS_8|204_bp atggtccaccaaagatgtccaagtgctaatgtgaaggactgcctctgtgtctccttttcc gtcagcttagggctcagaccaacgctttaccaatgcaaccgggacaaacctggcaactgg actgtcttggacaaactggaaacacaagttctagaacatctgaagcaattgtggcactcg ctcattcctggacatcctatctaa >gi568815594f:40235468_40454517|GENSCAN_predicted_peptide_9|87_aa MGQNKGLQAPCKSEIQQGSQVLKLQNGLLCFHVSHPGHTDAKATRAMDTTNQEIKQFFQP SKTKEQRYDSKVAVHIVKKEYNVTRII >gi568815594f:40235468_40454517|GENSCAN_predicted_CDS_9|264_bp atgggccagaacaagggactacaggccccatgcaagtctgaaatccagcagggcagtcaa gtcttaaagctccaaaatggcctcctttgcttccatgtctcacatccaggtcacactgat gcaaaagcgacaagggccatggatacaactaatcaggaaataaaacagttcttccagcca agtaaaaccaaggagcaaagatatgattctaaagttgcagtccatattgtcaaaaaggaa tataatgtaaccaggatcatctga >gi568815594f:40235468_40454517|GENSCAN_predicted_peptide_10|56_aa MQLLYDPRGKGEGPRGWNLDLHKKQLTQGHEPYLMLNSAKVGFLHTFSSFKAATLN >gi568815594f:40235468_40454517|GENSCAN_predicted_CDS_10|171_bp atgcagctgctgtatgacccaagaggaaagggtgaaggtcccagaggctggaacctggac ctgcacaagaaacagttaacacagggccatgaaccctatttaatgttgaacagtgcaaag gttggatttcttcacactttctcctcctttaaggcagccaccctgaactga >gi568815594f:40235468_40454517|GENSCAN_predicted_peptide_11|604_aa XFPPAVDAFDIMTAEDSTAAMSSDSAAGSSAKVPEGVAGAPNEAALLALMERTGYSMVQE NGQRKYGGPPPGWEGPHPQRGCEVFVGKIPRDVYEDELVPVFEAVGRIYELRLMMDFDGK NRGYAFVMYCHKHEAKRAVRELNNYEIRPGRLLGVCCSVDNCRLFIGGIPKMKKREEILE EIAKVTEGVLDVIVYASAADKMKNRGFAFVEYESHRAAAMARRKLMPGRIQLWGHQIAVD WAEPEIDVDEDVMETVKILYVRNLMIETTEDTIKKSFGQFNPGCVERVKKIRDYAFVHFT SREDAVHAMNNLNGTELEGSCLEVTLAKPVDKEQYSRYQKAARGGGAAEAAQQPSYVYSC DPYTLAYYGYPYNALIGPNRDYFVKAGSIRGRGRGAAGNRAPGPRGSYLGGYSAGRGIYS RYHEGKGKQQEKGYELVPNLEIPTVNPVAIKPGTVAIPAIGAQYSMFPAAPAPKMIEDGK IHTVEHMISPIAVQPDPASAAAAAAAAAAAAAAVIPTVSTPPPFQGRPITPVYTVAPNVQ RIPTAGIYGASYVPFAAPATATIATLQKNAAAAAAMYGGYAGYIPQAFPAAAIQVPIPDV YQTY >gi568815594f:40235468_40454517|GENSCAN_predicted_CDS_11|1815_bp nngtttccgccagctgtggatgcctttgacattatgaccgcagaggattccaccgcagcc atgagcagtgactcggccgccgggtcctccgccaaggtgcccgagggcgtggcgggcgcg cccaacgaggcagcactgctggcgctgatggagcgcacgggctacagcatggtgcaagag aacgggcagcgcaagtacggcggcccaccgcccggctgggagggcccgcacccgcagcgt ggctgcgaggtcttcgtgggcaagatcccgcgcgacgtgtacgaggacgagctggtgccc gtgttcgaggccgtgggccgcatctacgagctgcgcctcatgatggactttgacggcaag aaccgcggctacgccttcgtcatgtactgccacaagcacgaggccaagcgcgcagtgcgt gagctcaacaactacgagatccgcccgggccgcctgctcggcgtgtgctgcagcgtggac aactgccgcctcttcatcggcgggatccccaagatgaagaagcgcgaggaaatcctggag gagattgccaaggtcaccgagggcgtgctggacgtgatcgtctacgccagcgcggccgac aagatgaagaaccgcggcttcgccttcgtggagtacgagagccaccgcgcggctgccatg gctcgccgcaagctcatgcctggccgcatccagctgtggggccaccagatcgccgtggac tgggccgaacctgagatcgacgtggacgaggacgtgatggagaccgtgaagatcctctac gtgcgcaacctcatgatcgagaccaccgaggacaccatcaagaagagcttcggccagttc aaccccggctgcgtggagcgcgtcaagaagatccgcgactacgccttcgtgcacttcacc agccgcgaggatgccgtgcatgccatgaacaacctcaacggcactgagctggagggctcg tgcctggaggtcacgctggccaagcccgtggacaaggagcagtactcgcgctaccagaag gcagccaggggcggcggcgcggctgaggcagcgcagcagcccagctacgtgtactcctgc gacccctacacactggcctactacggctacccctacaacgcgctcattgggcccaacagg gactactttgtgaaagcaggcagcataagaggccgagggcgaggtgcagctggcaacaga gccccagggcctaggggttcctacctcgggggatattctgctggtcgtggtatatatagc cgatatcatgaagggaaaggaaagcagcaagaaaaaggatatgaactggtgccgaatttg gaaatccctaccgtcaacccagttgccattaaacctggtacagtagccatccctgccatt ggggctcagtattccatgtttccagcagctccagcccctaaaatgattgaagatggcaaa atccacacagtggagcacatgatcagccccattgctgtgcagccagacccagccagtgct gctgccgccgcagccgcggccgcagccgccgcagccgctgtcattcccactgtgtcgacg ccaccacctttccagggccgcccaataactccagtatacacggtggctccaaacgttcag agaattcctactgccgggatctacggggccagttacgtgccatttgctgctccagctaca gccacgatcgccacactacagaagaacgcggcagccgcggccgccatgtatggaggatac gcaggctacatacctcaggccttccctgctgctgccattcaggtccccatccccgacgtc taccagacatactga