GENSCAN 1.0 Date run: 7-Nov-116 Time: 21:59:03 Sequence gi568815591r:96588927_96809763 : 220837 bp : 38.01% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 PlyA - 1136 1131 6 1.05 1.04 Term - 3734 3589 146 0 2 92 36 99 0.353 2.19 1.03 Intr - 32104 31982 123 2 0 -8 99 142 0.029 5.34 1.02 Intr - 33717 33619 99 1 0 90 76 37 0.024 1.86 1.01 Init - 55248 55110 139 0 1 55 88 168 0.549 13.85 1.00 Prom - 73399 73360 40 -5.05 2.08 PlyA - 74100 74095 6 1.05 2.07 Term - 75601 75329 273 1 0 -46 41 305 0.669 6.89 2.06 Intr - 77352 77208 145 2 1 45 84 100 0.618 4.66 2.05 Intr - 79970 79878 93 1 0 104 110 30 0.895 4.96 2.04 Intr - 83882 83796 87 1 0 66 91 71 0.465 3.37 2.03 Intr - 105965 105872 94 0 1 46 127 106 0.857 8.40 2.02 Intr - 120715 120566 150 2 0 36 81 87 0.084 2.01 2.01 Init - 120837 120762 76 0 1 64 86 101 0.174 8.80 2.00 Prom - 123219 123180 40 -5.45 3.03 PlyA - 123380 123375 6 1.05 3.02 Term - 124213 124152 62 2 2 104 43 37 0.020 -2.21 3.01 Init - 133031 132953 79 2 1 52 56 131 0.577 7.57 3.00 Prom - 133854 133815 40 -3.75 4.02 PlyA - 136457 136452 6 1.05 4.01 Sngl - 161983 160589 1395 1 0 49 42 488 0.566 36.08 4.00 Prom - 162250 162211 40 -4.95 5.05 PlyA - 163104 163099 6 1.05 5.04 Term - 165809 165534 276 2 0 32 54 187 0.198 4.18 5.03 Intr - 171119 170973 147 2 0 72 53 65 0.394 0.91 5.02 Intr - 171848 171465 384 2 0 -16 98 203 0.552 5.02 5.01 Init - 172260 172117 144 0 0 78 -10 139 0.488 3.27 5.00 Prom - 190138 190099 40 -3.85 6.00 Prom + 196216 196255 40 -5.25 6.01 Init + 196557 196673 117 2 0 55 98 68 0.641 4.75 6.02 Intr + 197228 197369 142 1 1 36 60 108 0.538 1.81 6.03 Intr + 206248 206398 151 2 1 33 80 150 0.848 7.00 6.04 Intr + 207508 207637 130 1 1 42 77 83 0.631 2.38 6.05 Intr + 208889 208989 101 1 2 108 94 24 0.718 2.99 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:96588927_96809763|GENSCAN_predicted_peptide_1|168_aa MSKLPFILPRHLEEQTPENKPVKQAPENKPEQASKGLHLMASYILGGLDTVNWRNSGENC QYSGVRLHIDHAANKKEGKDGSNLGQTSEGVLEDDDMWSDTTKSNCPKTHKAPNYQNHST DCCPEYVCFVSACANGFLCSVLQRKTARDSRCHGAAEKCILRAQEEPV >gi568815591r:96588927_96809763|GENSCAN_predicted_CDS_1|507_bp atgtctaaactgcctttcatccttccacgtcatttagaggagcaaaccccagagaacaag ccagtaaagcaagccccagagaacaagccagaacaagccagtaaagggttacacctaatg gcaagttacatccttggagggctggatacagtgaactggaggaatagcggggagaattgc caatactcaggagtcaggcttcacattgatcatgcagctaataagaaagaaggcaaagat gggtcaaatctggggcagacctcagagggcgtcctggaagatgatgacatgtggagtgac accaccaaatccaactgccccaaaacacacaaagccccaaactaccaaaatcactcaaca gattgctgccccgaatacgtgtgctttgtctctgcttgtgctaatggttttctgtgttcc gttctccaacggaagacagcacgtgacagccggtgtcatggtgctgctgaaaaatgcatt ctccgtgcccaggaagaacctgtataa >gi568815591r:96588927_96809763|GENSCAN_predicted_peptide_2|305_aa MSEKKQPVDLGLLEEDDEFEEFPAEGLRVEVAWVLGPSWGHRREVWTPRAGTRGANAHPS VAPQIRVWGVALGLTDWAGLDEDEDAHVWEDNWDDDNVEDDFSNQLRKDPGDQVPDPVEE EVLLHLAGAGQRTGKRGPSSYRDTSLMDPSTPVGPSYSLYLQEPLSRGDHRASVHRHKTR AQFPDLGYKDDAYTGKTFKISFKNKSPRDSALAVSYLKAEDGGQRIDQRDATRELLNLSL LSLKMEEGSQKQKNLEASEMKNSPHVMVNKQGTQSYNLKERNCANNAKEQGTDSLLEPPG RNAVG >gi568815591r:96588927_96809763|GENSCAN_predicted_CDS_2|918_bp atgtcagagaaaaagcagccggtagacttaggtctgttagaggaagacgacgagtttgaa gagttccctgccgaaggcctccgtgtggaggtagcttgggtgctcgggcccagctggggg caccggcgggaagtttggactcccagggccggcactcgaggcgctaacgctcacccctca gtggcgccccagattcgggtttggggtgttgcgctgggactcactgactgggctggctta gatgaagatgaagatgcacatgtctgggaggataattgggatgatgacaatgtagaggat gacttctctaatcagttacggaaggaccctggtgaccaggtgcctgaccctgtagaagag gaagtgttattgcatcttgcaggggcaggacagaggacagggaaaagggggccatcttct tatagggacactagtctgatggacccgtctactccagtaggaccatcttattcattatat ctgcaagaacccctttccagaggagaccacagggcctccgttcataggcataagacaaga gcccagtttccagatctggggtacaaggatgatgcatacactgggaaaacttttaagatt tctttcaagaacaagagtcctagagattcggccttggctgtgtcttacttaaaagcagaa gatggaggccaaagaattgatcagagagatgcaactcgagaacttctcaacctgtccttg ctgtctttgaagatggaggaggggagccaaaagcagaagaatctggaggcctctgaaatg aagaacagccctcatgttatggtgaataaacagggaactcagtcctacaacctcaaagaa cggaattgtgccaacaatgcaaaagaacagggaacagattctctcctagagcctccagga aggaatgcagtcggctaa >gi568815591r:96588927_96809763|GENSCAN_predicted_peptide_3|46_aa MLPEDADILITYDLNNLKEHKETPLMCPIPCGSCQGLGLAPSEAMA >gi568815591r:96588927_96809763|GENSCAN_predicted_CDS_3|141_bp atgctgccggaagatgctgacattcttataacttatgacctgaacaatctgaaggagcac aaagaaacaccactgatgtgcccaataccgtgtggaagctgccaaggcttggggcttgca ccctctgaagccatggcctga >gi568815591r:96588927_96809763|GENSCAN_predicted_peptide_4|464_aa MWQRLWNCVTGRGWNSLAGSEEVRKMWETLGLPGDSLNGSSQNADSDMDNEAQAEVVSDG DEETIGNWSKGHSCYALAKRLVAFFPCSSDLWNFELERDDLGYLVEEISKQQSIQEVTWV LLKAFSFMHSQRDGLILELMFTREAKHKCLENLQPDDVIEKKNPFSGDKFKLAAEICISN EGLNVNHQDNGKNVSRACWRSWQQPLPSQAQRPRRKNGFMFQDPYCSVQPQDIAPCVSAV SAPVVAKRDQGTAWAMASEGASPKPWQLTHGVWPVGSQKSRIEVWVPLPRFQQMYKNAWI CRKMLATGVEPSWRTSARAVQKGNVGLEPLNRVPTGALPSGAVRRRPSFSRPQNGRSTNN LHCVPGKAADAQHQPVKAAKRGAVPCKATGAELPKTVGAHLLHQHDPDVRHRVKGGHFSP LRFNGYSAGFQTCMGPVAPFVLANFSHLECVYLPNACTPSVSWK >gi568815591r:96588927_96809763|GENSCAN_predicted_CDS_4|1395_bp atgtggcagcggctttggaactgtgtaacaggcagaggatggaacagtttggcgggttca gaagaagtcaggaagatgtgggaaactttgggacttcctggagactcactgaatggttca agccaaaatgctgatagtgatatggacaatgaagcccaggctgaagtggtctcagatgga gatgaggaaactattggaaactggagtaaaggtcactcttgctatgctttagcaaagaga ttggtggcatttttcccctgttctagcgatctgtggaactttgaacttgagagagatgat ttagggtatctagtggaagaaatttctaagcagcaaagcattcaagaggtgacctgggtg cttttaaaagcattcagctttatgcattcacaaagagatggtttgatattggagcttatg tttacaagggaagcaaagcataaatgtttggaaaatttgcagcctgatgatgtgatagaa aagaaaaacccattttctggggataaattcaagctggctgcagaaatttgcataagtaat gaggggctgaatgttaatcaccaagacaatgggaaaaatgtctctagggcatgttggaga tcttggcagcagcccctcccatcacaggcccaaaggcctaggaggaaaaatggtttcatg ttccaggacccctactgctctgtgcagcctcaggacatagcaccctgtgtctcagctgtt tcagctccagttgtggctaaaagggaccaaggtacagcttgggccatggcttcagagggt gcaagccccaagccttggcaacttacacatggcgtttggcctgtgggttcacagaagtca agaattgaggtttgggtacctctacctcgatttcagcagatgtataaaaatgcctggata tgcaggaagatgcttgctacaggagtggagccctcatggagaacctctgctagggcagtg cagaagggaaatgtggggttggagcccctaaacagagtccccactggggcactgcctagt ggagctgtgagaaggaggccatcattctccagaccccagaatggtagatccaccaacaat ttgcactgtgtgcctggaaaagctgcagacgctcaacatcagcctgtgaaagcagccaag aggggggctgtaccctgcaaagccacaggggcagaactgcccaagaccgtgggagcccat cttttgcatcagcatgatccagatgtgagacacagagtcaaaggaggtcatttcagccct ttaagatttaatggctactctgctgggtttcagacttgcatggggcctgtagcccccttt gttttggccaatttctcccatttggaatgtgtgtatttacccaatgcctgtacccccagt gtatcttggaagtga >gi568815591r:96588927_96809763|GENSCAN_predicted_peptide_5|316_aa MDPNQEEIPGLPEKEFRRLVIKLIREAPEKGEAQCKEIQKTIQEVKGEEEEKSESLENIF GKRIEENFASLARDLDIQIQEAQRTSGKFIAKRSSPRFIVSRLSKVKMEARILRAVRQQH QVTCKGKPIRLTAAFSAETLQARRDWGPIFILLKHNNYQPRILYPVKLSIIIYEGKETHI NLKGWKKVFRANGRQKRAGVAILISEKANFKAIAVKRDKEGHYIMSKPPPFLTWVTAILF SISQDSNTEPSGYTYRYTREDLLGELANVIMEAEKSHNKTSSSGRIQEAGIMSQSKSKGF RTREAHVIALSPRPKA >gi568815591r:96588927_96809763|GENSCAN_predicted_CDS_5|951_bp atggatccaaaccaagaagaaatccctggtttacctgaaaaagaattcaggaggctagtc attaagctaatcagagaggcaccagagaaaggtgaagcccaatgcaaggaaatccaaaaa acaatacaagaagtgaagggagaagaagaagagaaatctgaaagtttggaaaacatattt gggaaaagaattgaggaaaacttcgccagccttgcgagagacttagacatccaaatacaa gaagcacaaagaacatctgggaaattcattgcaaaaagatcatcaccaagattcattgtc agcaggttatctaaagttaagatggaggcaagaatcttaagagctgtgagacaacagcac caggtaacctgtaaaggaaaacctatcaggttaacagcagctttctcagcagaaacccta caagccagaagggattggggccctatcttcatcctccttaaacacaacaattatcagcca agaattttgtatccagtgaaactcagcatcatcatatatgaaggaaaggagactcacata aacttaaaggggtggaaaaaggtatttcgtgcaaatggacgccaaaagcgagcaggggta gctattcttatatcagaaaaagcaaactttaaagcaatagcagttaaaagagacaaagag ggacattatataatgtccaaaccaccgccgtttctcacctgggttactgcaatactcttc tctattagtcaggattctaacacagaaccatcaggatacacatatagatatacgagagag gatttattaggggaactggctaatgtcattatggaggctgagaagtcccacaataagaca tcttccagtgggagaatccaggaagctggtatcatgtctcagtccaagtccaaaggcttc agaaccagggaagcccatgttatagccctcagtccaagaccaaaggcctga >gi568815591r:96588927_96809763|GENSCAN_predicted_peptide_6|214_aa MLQALEIQRRVNPSPQEEYNLRKKAGKSMKYYKTLISAKVLGNFTTIMKSGAQQAQLFSK YLKDTVADSLATSSQQIQLNRACTGTHSTQQLEDRASDQGRSGIGLEGSYLDSRKVTIQI MGWIYILEVRFYICAEKFMSQLRSRFSQSVSSIHVPGMTLPSSLNPRSHQATHPLVEPGL CLLVHLFLGLNSSSVVHCAISGLSLCRCWCAFAY >gi568815591r:96588927_96809763|GENSCAN_predicted_CDS_6|642_bp atgttacaagcactggagatacaaagacgagtaaatcccagccctcaagaagagtataat ctcaggaaaaaggcaggcaaatcaatgaagtattacaagacactgataagtgctaaggtc ttaggcaatttcaccacaattatgaaaagtggtgcccagcaagcccagctcttcagcaaa tatctgaaagacactgttgcggacagcctagcaacttcttcccaacagatacaactcaat cgtgcatgtacgggcacacattctacacagcaactggaggacagggcctcagatcaagga agaagtgggattggtttggagggatcctatctggactcaaggaaagtgacaatacaaatt atgggatggatctacatcctcgaggtacgtttttatatctgtgcagaaaagtttatgagt caactgagaagcagattttcacagtccgtatcctcaatccatgtccctggcatgacactc ccctcaagcctcaaccctcgatcccaccaggccacgcatcccctagtggagccaggtttg tgtcttctggtgcacttgttccttggacttaattcttcatccgtagttcactgtgctatt tcaggcctaagtctttgcagatgctggtgtgcctttgcctan