GENSCAN 1.0 Date run: 4-Nov-116 Time: 10:28:23 Sequence gi568815588r:32088323_32288817 : 200495 bp : 43.43% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 11274 11326 53 2 2 86 89 26 0.019 3.13 1.02 Intr + 14643 14742 100 0 1 78 73 127 0.005 10.31 1.03 Intr + 21570 21830 261 2 0 61 43 385 0.384 28.98 1.04 Intr + 21882 22100 219 2 0 45 67 214 0.402 13.50 1.05 Term + 22338 22376 39 2 0 66 28 42 0.493 -6.81 1.06 PlyA + 22489 22494 6 -0.45 2.05 PlyA - 22880 22875 6 1.05 2.04 Term - 23498 23339 160 1 1 -4 41 179 0.483 1.41 2.03 Intr - 27137 26795 343 0 1 -85 90 726 0.674 49.99 2.02 Intr - 27458 27172 287 1 2 -9 32 860 0.586 67.69 2.01 Init - 27718 27672 47 1 2 74 79 56 0.561 2.41 2.00 Prom - 34282 34243 40 -3.16 3.00 Prom + 41061 41100 40 -2.76 3.01 Init + 58081 58162 82 0 1 61 94 49 0.196 3.93 3.02 Intr + 73326 73454 129 1 0 109 21 40 0.148 0.07 3.03 Intr + 73745 73809 65 0 2 94 94 -7 0.326 -1.06 3.04 Intr + 74867 74968 102 1 0 75 77 34 0.342 1.37 3.05 Intr + 77031 77088 58 2 1 113 91 14 0.435 2.66 3.06 Intr + 79387 79471 85 2 1 62 71 55 0.163 0.08 3.07 Term + 89356 89422 67 1 1 117 45 54 0.283 1.41 3.08 PlyA + 90220 90225 6 1.05 4.03 PlyA - 90602 90597 6 1.05 4.02 Term - 100426 99998 429 1 0 77 38 468 0.998 36.00 4.01 Init - 104229 104176 54 0 0 79 115 35 0.655 6.58 4.00 Prom - 105067 105028 40 -5.76 5.03 PlyA - 105209 105204 6 1.05 5.02 Term - 111492 111403 90 0 0 93 45 56 0.330 -0.48 5.01 Init - 121267 120542 726 1 0 32 13 268 0.120 8.51 5.00 Prom - 141037 140998 40 -4.66 6.08 PlyA - 142195 142190 6 1.05 6.07 Term - 143893 143701 193 0 1 22 34 177 0.703 2.59 6.06 Intr - 183595 183232 364 2 1 100 113 92 0.874 7.14 6.05 Intr - 183845 183704 142 2 1 85 110 94 0.934 11.23 6.04 Intr - 184959 184841 119 1 2 60 94 143 0.569 12.28 6.03 Intr - 198413 198372 42 0 0 70 116 26 0.201 1.81 6.02 Intr - 198628 198604 25 1 1 109 87 -19 0.213 -2.30 6.01 Intr - 198952 198776 177 1 0 78 87 164 0.988 15.42 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 14730 14524 207 0 0 55 63 171 0.855 10.12 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:32088323_32288817|GENSCAN_predicted_peptide_1|223_aa MVEGKGEAGPSSQGGRTECNKMYIKATLGVIDQTEILSHLVNADDIHESSRSHFTNNDSY VFSASRMSTEANCASLLAVFHSDSLLDPAIFAEVSKLDGAVQDLRVARGNGSQIQYQQVC ARYRVLCVPPNPLLYAWQGGHPLYLTSFFGGHILGDSLGMGHLLLRAKAVWLLYCLKTEH PEDDVQSKQWLTHFLDQFANVKNSLALKKIEMGRFSTDIDLHI >gi568815588r:32088323_32288817|GENSCAN_predicted_CDS_1|672_bp atggtggaaggcaaaggagaagcaggaccttcttcacagggtggcagaacagagtgtaat aagatgtacatcaaagcaactcttggtgtcatagatcagacagaaattctctctcatctt gtcaatgctgatgacatccatgaatccagcaggagccatttcaccaacaacgactcctac gtcttctcggcttccaggatgagcaccgaggccaattgcgcctcgcttttggcggtcttc cacagcgactcactgctggacccggccatctttgcagaagtcagcaaactggacggtgcg gtgcaggatctgcgtgtggcgcggggaaacggaagccagatccagtaccagcaggtgtgc gcgaggtacagggtgctctgcgtgccccccaacccgctcctgtacgcctggcagggcggg catcccctctacctgaccagcttcttcggaggacacatcttgggggacagcctaggaatg ggccacttactcctgcgggccaaagccgtgtggctgctgtactgcctgaagaccgagcat cctgaggacgacgtgcagagcaagcagtggctcacccatttcctcgaccaatttgccaac gttaagaacagcctggccttgaagaaaattgagatgggccggttcagtacagacattgat cttcatatttaa >gi568815588r:32088323_32288817|GENSCAN_predicted_peptide_2|278_aa MWLLFEATADLLTSAQRRKKQQKKQKRKRKKRKRRKKRRKKKEEEEEKKEKKEKEKKEKE KEEKKKKKKKKKKKKKKKKKKKKKEEEEEEEEEEEEEEEEEEEEEEEESIQYRKKKKKKK KEKEKEKEKEKEKEKKEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEESIQC GLHLRLWEKSRPTSGVILTMVDNDFPSNSSRTAFSKDTVRLQEAMRQLGSEPSRVRMDED ALTTLKILIIGESGAGKSSLLLSFTDNTFDPELAATIA >gi568815588r:32088323_32288817|GENSCAN_predicted_CDS_2|837_bp atgtggttgctatttgaagcaactgcagacctgctcacctcggcacagagaaggaagaag cagcagaagaagcaaaagaggaagaggaagaagaggaagaggaggaagaagaggaggaag aagaaggaggaggaggaggagaagaaggagaagaaggagaaggagaagaaggagaaggag aaggaggagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaag aagaagaagaaggaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaa gaagaagaagaagaagaagaagaaagtatccaataccgaaagaagaagaagaagaagaag aaggagaaggagaaggagaaggagaaggagaaggagaaggagaagaaggaagaagaagaa gaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaa gaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaaagtatccaatgt gggttgcatctgagactctgggagaaatctagaccgaccagtggggtcattctaacaatg gtggacaatgattttccttcaaacagtagtaggacagcctttagcaaagacactgtgagg ctgcaagaagcaatgagacagctgggctcagagcccagcagggtgaggatggatgaggat gcgctgaccaccctgaagatcctcatcattggcgagagtggggcgggcaagtccagcctg ctcttgagtttcacagataatacttttgatccagaacttgcagcaacaatagcatga >gi568815588r:32088323_32288817|GENSCAN_predicted_peptide_3|195_aa MLDDGIPSLLSVRYPVENSQIKELRLPDSPTCMGSLRMFRRLCEGPAHAARIPSGYCSSL LLLCCHSLPAGQSKVGNNPSFLQIVIWFCMAQGSWRGFSLKWCEQQLWMTGTTGTTHGLL GRAEVMVPLALPFPNPSSLVVPNAQAESNDYWFRPLTFGVVYDTTGANRSKHNRVFRNTT STEVYGSCRKTQMHG >gi568815588r:32088323_32288817|GENSCAN_predicted_CDS_3|588_bp atgcttgatgatggcatcccatcccttctgtctgtgagataccccgtggaaaactctcag atcaaagaactaaggttgccagacagtccaacctgcatgggctccctccgaatgtttcgg agactctgtgaggggcccgcacatgctgcacgcattccttctggctattgctcctctttg ctgctcctctgttgccactccttgccagctggtcaatcaaaggtgggaaataacccttcc tttctacagattgtcatatggttctgtatggcccaggggtcgtggcgaggcttctccctg aaatggtgtgagcagcagctgtggatgactggaacgactggaacgactcatggccttctt gggagggcagaagtcatggttcctctagccctccctttccccaacccttcaagcctagtg gtgcccaatgcccaagctgagtcaaatgattactggtttaggccactaacttttggtgtg gtttatgatacaaccggagctaacagatccaaacacaacagggtttttaggaacaccaca agcacagaggtgtacggttcctgcaggaaaacacagatgcatggctga >gi568815588r:32088323_32288817|GENSCAN_predicted_peptide_4|160_aa MVVQQCECTLMPLKMAKMLFADKFPKTAENFRALSTGEKGFSYKGSCFHRIIPGFMYQGR DFTRHNGTGGKSIYGEKFEDENFILKHTGPGILSMANAGPNTNGSQFFICTAKTEWLDGK HVVFGKVKEGMNIMEAMERFGSRNGKTSKKITIADCGQLE >gi568815588r:32088323_32288817|GENSCAN_predicted_CDS_4|483_bp atggttgtacaacaatgtgaatgtaccttaatgccgcttaaaatggctaaaatgctgttt gcagacaagttcccaaagacagcagaaaattttcgtgctctgagcactggagagaaagga tttagttataagggttcctgctttcacagaattattccagggtttatgtatcagggtcgt gacttcacacgccataatggcactggtggcaagtccatctatggggagaaatttgaagat gagaacttcatcctaaagcatacaggtcctggcatcttgtccatggcaaatgctggaccc aacacaaatggttcccagtttttcatctgcactgccaagactgagtggttggatggcaag catgtggtctttggcaaagtgaaagaaggcatgaatattatggaggccatggagcgcttt gggtccaggaatggcaagaccagcaagaagatcaccattgctgactgtggacaactcgaa taa >gi568815588r:32088323_32288817|GENSCAN_predicted_peptide_5|271_aa MLKGTVTKTAWYWYQNRDIDQWNRIEPSEIIPRIYNHLIFDKPDKNKKRGKDSLFNKWCW ENWLAICRKLKLDPFFTPYTKINSRWIKDLHVRPKTIKTLEENLGNTIQDIGMGKDFMAK TSKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQPTEWEKIFAIYPSDKGLISRIYKEL KQIYKKKTNNPIKKWVKDMNRHFSKEDIYAANGHMKKCSSSLAIREMQIKTTMTYHLTPV RMLMINNKCPMQKVQGPQLKEIFHQNAINAL >gi568815588r:32088323_32288817|GENSCAN_predicted_CDS_5|816_bp atgctaaaaggtacagtaaccaaaacagcatggtactggtaccaaaacagagatatagac caatggaacagaatagagccctcggaaataataccacgcatctacaaccatctgatcttt gacaaacctgacaaaaacaagaaacgaggaaaggattccctatttaataaatggtgctgg gaaaactggctagccatatgtagaaagctgaaactggatcccttctttacaccttataca aaaattaattcaagatggattaaagacttacatgttagacctaaaaccataaaaacccta gaagaaaacctaggcaataccattcaggacataggcatgggcaaggacttcatggctaaa acatcaaaagcaatggcaacaaaagccaaaattgacaaatgggatctaattaaactaaag agcttctgcacagcaaaagaaactaccatcagagtgaacaggcaacctacagaatgggag aaaatttttgcaatctacccatctgacaaagggctaatatccagaatctacaaagaactt aaacaaatttacaagaaaaaaacaaacaaccccatcaaaaagtgggtgaaggatatgaac agacacttctcaaaagaagacatttatgcagccaacggacacatgaaaaaatgctcatca tcactggccatcagagaaatgcaaatcaaaaccacaatgacataccatctcacaccagtt agaatgctaatgattaataataagtgccccatgcaaaaagtacaaggtcctcagctaaag gaaatctttcaccaaaatgccatcaatgcactttga >gi568815588r:32088323_32288817|GENSCAN_predicted_peptide_6|353_aa QDKADLIRPKRKYEKKPKVLPSSAAATPQQTSPAALPVFNAKDLNQYDFPSSDEEPLSQE SRLSVLCCVLDLHEDGLGAVEAFTAEQYQQHQQQLALMQKQQLAQIQQQQANSNSSTNTS QGFVSKTLDSASAQFAASALVTSEQLMGFKMKDDVVLGIGVNGVLPASGVYKGLHLSSTT PTALVHTSPSTAGSALLQPSNITQTSSSHSALSHQVTAANSATTQVLIGNNIRLTVPSSV ATVNSIAPINARHIPRTLSAVPSSALKLAAAANCQVSKVPSSSSVDSVPRKFVTNQLLQR KQMMIDVLHTWKATVPKTEIWEKLAKMYKTTLDVIFVFGLELILAVARQLALA >gi568815588r:32088323_32288817|GENSCAN_predicted_CDS_6|1062_bp caagataaagccgatcttatccgaccgaaacggaaatatgaaaagaagcccaaagtctta ccatcgtctgccgctgctactccccaacagacgagtcctgctgcactgccagtcttcaat gctaaagatctgaatcagtatgactttcccagctcagacgaagaacctctctcccaggaa agcaggctgtcagtactatgctgtgtattggatttgcacgaagacgggttgggcgcggtg gaagcatttacagccgaacaataccagcaacatcaacagcaactggcactcatgcagaaa cagcagcttgcacaaattcagcaacagcaagcaaatagtaattcctccaccaacacatca cagggttttgtttctaagactttggattctgctagtgcacagtttgctgcttctgctttg gtgacatcagaacaactgatgggattcaagatgaaggatgatgtggtgcttggaatcggg gtgaatggcgtccttccagcctcaggagtatacaagggcttacacctcagtagtactaca ccaacagcacttgtacatacaagtccatcaacggcaggttcagctttgttacagccttca aatattacacagacttcaagttcccacagtgcactgagtcatcaagtaactgctgccaat tctgcaacaactcaggttctgattgggaacaacattcgattaactgtaccttcatcagtt gccactgtaaactctattgccccaataaatgcacgacatatacctaggactttaagtgct gttccatcatctgccttaaagctggccgctgcagcaaactgtcaagtttccaaggtccca tcttcatcctctgtagattcagttccaagaaagttcgtgaccaaccaactacttcagagg aaacaaatgatgattgatgtccttcatacctggaaggcaacagtacctaagacagaaatt tgggaaaaactagccaaaatgtacaagaccacactggatgtcatctttgtgtttggatta gaactcattttggcggtggcaagacaactggctttggcatga