GENSCAN 1.0 Date run: 5-Nov-116 Time: 02:08:43 Sequence gi568815591r:135829929_136077100 : 247172 bp : 39.94% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 PlyA - 1421 1416 6 1.05 1.05 Term - 2766 2543 224 1 2 85 42 116 0.781 2.80 1.04 Intr - 4569 4376 194 2 2 60 56 167 0.777 8.91 1.03 Intr - 6708 6558 151 1 1 49 73 19 0.035 -5.20 1.02 Intr - 16306 16178 129 2 0 47 87 106 0.821 6.15 1.01 Init - 16942 16858 85 1 1 73 38 75 0.805 2.05 1.00 Prom - 18589 18550 40 -3.65 2.02 PlyA - 18873 18868 6 -0.45 2.01 Sngl - 19902 19000 903 0 0 60 34 327 0.987 20.46 2.00 Prom - 23310 23271 40 -5.25 3.14 PlyA - 24236 24231 6 1.05 3.13 Term - 27727 27562 166 0 1 62 53 130 0.829 3.31 3.12 Intr - 28898 28616 283 0 1 64 65 269 0.005 17.75 3.11 Intr - 34311 34222 90 1 0 80 53 66 0.031 1.35 3.10 Intr - 44588 44375 214 2 1 18 86 134 0.056 3.57 3.09 Intr - 48374 48304 71 0 2 109 62 57 0.133 3.18 3.08 Intr - 48598 48469 130 1 1 5 59 158 0.162 3.95 3.07 Intr - 65095 64993 103 0 1 33 29 134 0.003 1.26 3.06 Intr - 72138 71841 298 1 1 4 58 220 0.205 5.61 3.05 Intr - 79807 79694 114 2 0 69 60 57 0.246 0.50 3.04 Intr - 83390 83247 144 0 0 12 78 118 0.006 2.53 3.03 Intr - 89078 88971 108 0 0 41 96 64 0.028 1.84 3.02 Intr - 89532 89282 251 2 2 42 42 160 0.009 2.96 3.01 Init - 90705 90629 77 0 2 59 92 37 0.007 1.81 3.00 Prom - 91113 91074 40 -5.95 4.10 PlyA - 92541 92536 6 1.05 4.09 Term - 100084 99998 87 1 0 143 45 136 0.996 11.48 4.08 Intr - 120754 120671 84 1 0 85 100 45 0.852 4.50 4.07 Intr - 121702 121589 114 1 0 78 63 184 0.853 14.62 4.06 Intr - 147174 147101 74 1 2 63 121 106 0.130 9.61 4.05 Intr - 152420 152322 99 0 0 105 91 26 0.133 3.76 4.04 Intr - 169461 169365 97 0 1 50 70 129 0.224 5.96 4.03 Intr - 173668 173533 136 0 1 26 89 101 0.341 3.65 4.02 Intr - 178922 178805 118 0 1 94 98 72 0.769 7.40 4.01 Init - 180004 179998 7 1 1 53 110 0 0.647 -0.11 4.00 Prom - 181725 181686 40 -6.75 5.04 PlyA - 181953 181948 6 1.05 5.03 Term - 183064 182860 205 0 1 68 48 272 0.946 17.06 5.02 Intr - 203153 203086 68 2 2 35 115 18 0.001 -4.02 5.01 Init - 216963 216829 135 0 0 43 116 185 0.988 16.99 5.00 Prom - 217688 217649 40 -11.74 6.03 PlyA - 217844 217839 6 1.05 6.02 Term - 219518 219277 242 0 2 48 51 231 0.512 10.60 6.01 Init - 225198 225147 52 0 1 71 78 55 0.550 4.17 6.00 Prom - 228532 228493 40 -4.95 7.03 PlyA - 231132 231127 6 1.05 7.02 Term - 233977 233767 211 0 1 37 43 220 0.400 8.18 7.01 Init - 235609 235602 8 1 2 114 91 0 0.684 3.45 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 88357 88622 266 2 2 66 41 200 0.896 7.79 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:135829929_136077100|GENSCAN_predicted_peptide_1|260_aa MHTICPLALRPKVLDGITPLAVLGLQLAGSWRGVHYPASHDTRSHIPAQWSLKKRQKASR LSAQTDFTDPSYPLSLPPHQHSTMLSLTSGKTKALLPLSETLLPPPLCARIQITLEASGA GGGLWEPDASVFDNVEGNVCTAFFWVPLPLTGQLGQILPALSHSPKPQNQLPLPPPSSFS AHENGAPSTYQTPEPGLTKAGKRLGASGHQEMEPGTDCAATDPWCLQRSLNILISTRSSP SANTCPTAGCPPMPPLGEWS >gi568815591r:135829929_136077100|GENSCAN_predicted_CDS_1|783_bp atgcacaccatctgccccttggctctcaggcctaaggtcttagacggcattacaccactg gctgtcctgggtctccagcttgcaggctcttggagaggggttcactacccagccagccat gacacaagatctcacatacctgcacagtggtctctgaagaaaagacagaaagcctctcga ctctcagcgcagactgatttcacagatccgagctatcctctgtctctcccaccacaccaa cactcaactatgctctccctcacttctgggaagacaaaggcccttcttcccttatctgaa acactcttgcctccacctctctgtgccagaattcagataacattagaagcttcaggagca ggagggggactgtgggaaccggatgcttctgtgtttgacaatgtggaaggtaacgtctgc acggctttcttctgggtccctctgcccctcacaggccagctgggtcagatactcccggca ttatcacactctccaaaacctcagaaccagctccctttgccacctccttcttccttctct gctcatgaaaatggtgcaccctccacttatcaaaccccggagccagggctgaccaaggca ggtaagagacttggagcctcgggccaccaagagatggaacctggaacagattgtgctgcc accgatccttggtgcctgcagaggagcctgaacatcctcatatccactaggtcttcacct tctgccaacacctgccccacagctgggtgtcccccaatgcctcctctgggagaatggagc tga >gi568815591r:135829929_136077100|GENSCAN_predicted_peptide_2|300_aa MSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPQLNEIKEDTKKWKNIPCSWVGRINIV KMAILPKVIYRFNAILIKLPMTFFTELEKTTLKFIRNQKRARIAKSILSQKNKAGGITLP DFKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEITPHIYNYLIFDKPDKNKQWGKDSLFN KWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGITIQDIDIGKD FMSKTPKAMAPKAMATKAKIDKWDLIKLKSFCTAKETTIRVTGNLQNGRKFSQPTHLTKG >gi568815591r:135829929_136077100|GENSCAN_predicted_CDS_2|903_bp atgagtgaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaactt acaagggatgtgaaggacctcttcaaggagaactacaaaccacagctcaatgaaataaaa gaggatacaaagaaatggaagaacattccatgctcatgggtaggaagaatcaatatcgtg aaaatggccatactgcccaaggtaatttatagattcaatgccatcctcatcaagctacca atgactttcttcacagaattggaaaaaactactttaaagttcatacggaaccaaaaaaga gcccgcattgccaagtcaatcctaagccaaaagaacaaagctggaggcatcacactacct gacttcaaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaac agagatatagatcaatggaacagaacagagccctcagaaataacaccacatatctacaac tatctgatctttgacaaacctgacaaaaacaagcagtggggaaaggattccctatttaac aaatggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttcctt acaccttatacaaaaattaattcaagatggattaaagacttaaacgttagacctaaaacc atcaaaaccctagaagaaaacctaggcattaccattcaggacatagacataggcaaggac ttcatgtctaaaacaccaaaagcaatggcaccaaaagcaatggcaacaaaagccaaaatt gacaaatgggatctaattaaactaaagagcttctgcacagcaaaagaaactaccatcaga gtgacaggcaacctacaaaatgggagaaaattttcacaacctactcatctgacaaagggc taa >gi568815591r:135829929_136077100|GENSCAN_predicted_peptide_3|682_aa MQSFQSSSLEVHTSPWSPQKLTIAPRGQQRVPTLMGSTPELSNWTALMGPHRRQKPSSPR SLQRPTNTSDPGRELHLRLAILWNRNVVPRLYHTKQLCFPISPRLPKHRARGKSIAVVFD GAEENKEFLSEDNQTKCRMFWLYDPVAMSTPSTLILVSEHHSPVKEPRLLEGVVNSRSGQ ENCNVILEYLAPKNHIGELPHVAMAFVNCHGAGGSVAVRTTRSHSRGHLGFGGSVDAAHP HPPAGKGWGTLALKSRAAIMSQECAKHSGPTPSPVESGWGPGINAFMGLHSGSGGEGHCS HFETVFGWLNPSVIMRGANHQNKGSFSELVSIQEWYDDFLAEGPGELSDILTNGNHSYSR CLVTHSGTKTALDLREEEVDFVRNGEVRSMGYHTGQNEGKERNDTKIGVNFLRAETQSRS QPSQMEKGLFPLRIKYLGIQLTRDVKDLFKENYKPTYKGCEGPLQGELQTTAQGNKRRYK QMEEHSMLMCRKNQYHENGHTAQDQTPMPAAPSQICASVTLHSGDSLSAGLTNGRGSAWV RLRLHPQDPSLIEVTAEPDAPNDRIWGPRSTPSGASLGRQGHKLGKGEPQAIPEHQGHRK NSQQHRLWPGRWPRPREDLETPPQAARRTRTQDPSNGRAKRAVTQTELKYALCSPLCRQQ EGEKREGEKSCSPSGIPDLGDP >gi568815591r:135829929_136077100|GENSCAN_predicted_CDS_3|2049_bp atgcagtcatttcaaagcagctcccttgaggtccacactagcccatggtcaccccagaag ctaacaattgcccccaggggccagcagcgggtccccaccctgatgggttctactccagag ctctctaactggactgccctcatgggaccacataggagacagaagcccagctctcccagg tctctgcagcgacctactaatacttcagatccaggaagagagctgcacctgcgtttggca atcttatggaacaggaacgtggtccccaggctttatcacaccaagcagctgtgttttcca atatctcctcgtctccctaagcacagggccaggggaaaaagtattgctgtggtttttgat ggtgctgaggagaacaaggaattcctctctgaggataaccaaaccaaatgcagaatgttt tggttgtatgacccagtggctatgagcacacccagcaccctgatcttggtttctgaacat cattctccagtaaaagaaccaaggctccttgaaggagtggttaattctaggtctggacag gaaaactgcaatgtgatcctggagtatcttgcgcccaaaaaccatataggggaacttcct cacgttgccatggcatttgtaaactgtcatggtgctggtgggagtgtagcagtgaggacg accagaagtcactctcgtggccaccttggttttggtggcagtgttgatgctgcccatcct catccaccagcaggcaagggctggggcaccctagccctgaagtctagagctgccatcatg tcccaagagtgtgccaagcactctgggccaacacccagccctgtggagtctggctgggga ccaggaataaatgcatttatgggacttcacagtggctctggaggagaaggccactgcagt cactttgaaacagtgtttggttggctaaacccaagtgtcatcatgaggggtgccaatcat caaaacaaaggcagcttttcagagctagtgagcatacaggaatggtatgatgacttcctg gcagaaggccccggagagttgtcagacatcctgacaaatgggaatcactcctattctcgg tgtttggtcacacattcaggtaccaagacagcacttgatctgcgggaagaagaggtagat tttgtgagaaatggggaagtcaggtctatgggataccacacaggacagaatgaagggaag gaaagaaacgacacaaaaataggagtgaattttctaagagcagaaacgcagtcacgcagt cagccttctcagatggaaaagggattgttcccattgagaataaaatacctaggaatccaa cttacaagggatgtgaaggacctcttcaaggagaactacaaaccaacttacaagggatgt gaaggacctcttcaaggagaactacaaaccactgctcaaggaaataaaagaagatacaaa caaatggaagaacattccatgctcatgtgtagaaagaatcaatatcatgaaaatggccat actgcccaagatcaaacgccaatgccagcagctccttcacagatctgtgccagtgtcaca ctgcactcaggggactctttgtcagctggtctaacaaatggaagaggctctgcatgggtc cgcctgaggctacatccccaggatccgtccctcattgaggtgaccgctgagcctgatgct cccaatgaccggatctggggcccaagatccactcccagtggtgcctccctgggcaggcag ggccacaagctgggcaaaggggagccccaggccatccctgagcaccagggccacaggaaa aactcacagcagcatcgcctctggcctggacgctggcccaggcccagagaggacctggaa accccaccccaggctgcaaggaggacaagaactcaggatccatctaatggcagagctaaa agagctgtaacacagacagagctgaaatatgccctttgctcaccactttgtaggcaacaa gaaggagagaagagagaaggagagaagagctgcagcccttcagggatcccagacctagga gatccctga >gi568815591r:135829929_136077100|GENSCAN_predicted_peptide_4|271_aa MQGSMQLKGQRMYVIDLVPSHSILQIGHLEPRGDAGLVHVTHESSRMAYYSICISELALI FPFPSSIRTLEYLALASNAASAISTYQSQMLHMASYPKELSPPPAAAPTECDSSSAHITA HFSPLNTETLKIIFEERHRSQTVSAIPCSFVPVMCDKEFMWALKNGDLDEVKDYVAKGED VNRTLEGGRKPLHYAADCGQLEILEFLLLKGADINAPDKHHITPLLSAVYEGHVSCVKLL LSKGADKTVKGPDGLTAFEATDNQAIKALLQ >gi568815591r:135829929_136077100|GENSCAN_predicted_CDS_4|816_bp atgcagggttcgatgcaactcaaaggccagaggatgtatgttattgatctagtgcccagc cattccattttgcagattggacatctagagcccagaggggatgcaggacttgtccatgtc acacatgagtcttctcgtatggcctattattccatttgcatctccgagcttgccctcata ttcccgttcccttctagtatacgtactttagaatacctggctctggcttcaaatgctgct tctgctattagtacttaccagtcccagatgctgcacatggccagctaccctaaggaactc agtcctcctccagctgctgctcctacagaatgtgattcttccagcgcgcacatcactgcc cacttctcccctttaaatactgaaactctcaaaatcatctttgaagaaaggcacagatca cagactgtttctgcgattccatgttcatttgttccagtgatgtgcgacaaggagttcatg tgggccctgaaaaacggagacttggatgaggtgaaagactatgtggccaagggagaagat gtcaaccggacactagaaggtggaaggaaacctcttcattatgcagcagattgtgggcag cttgaaatcctggaatttctgctgctgaaaggagcagatattaatgctccagataaacat catattactcctcttctgtctgctgtctatgagggtcatgtttcctgtgtgaaattgctt ctgtcaaagggtgctgataagactgtgaaaggcccagatggactgaccgcctttgaagcc actgacaaccaggcaatcaaagctcttctccagtga >gi568815591r:135829929_136077100|GENSCAN_predicted_peptide_5|135_aa MEEAGRTVKGDITIRNRFRSEVTAGSEDERGNKPKNAGGLQQLEKFPWDIINITHMTSPH ALTYFIPCFTPEASETTNPPGGTNNSRRATLRAVTLTAKVCSFTPEASKTTNPQEGRNSE HIRTSEGTNSGHQRL >gi568815591r:135829929_136077100|GENSCAN_predicted_CDS_5|408_bp atggaagaggcaggtagaacagtcaaaggagatataacaatcagaaacagatttcgaagt gaagtgactgctggctctgaagatgaacggggcaacaagccaaagaatgcaggtggcctc cagcagctggagaagtttccatgggacatcatcaatataacacacatgacctctccccac gcactcacttattttattccatgcttcactcctgaagccagcgagaccacgaacccacca ggaggaacgaacaactccagacgcgccaccttaagagctgtaacactcactgcgaaggtc tgcagcttcactcctgaagccagcaagaccacgaacccacaagaaggaagaaactccgaa catatccgaacatcagaaggaacaaactccggacaccaacgcctttaa >gi568815591r:135829929_136077100|GENSCAN_predicted_peptide_6|97_aa MEQKEEHSIQIQRFYHLNHGDAYTHTVGSASFPPGTLPNFPNAPPIVSVRGRSDAIAVSE DKGATSQGMQVASRYWKGQGTRISPRASQRNLTLLTF >gi568815591r:135829929_136077100|GENSCAN_predicted_CDS_6|294_bp atggaacaaaaagaagaacacagtattcagatccagaggttctatcacctaaaccatgga gatgcctacactcacactgtgggttcagccagcttcccaccaggaacgctccccaacttc cccaacgctcctcctattgtctcggtcagaggtcgaagtgatgcaattgctgtatctgaa gataagggggcaacaagccaaggaatgcaggtggcctctagatactggaaagggcaagga accagaatctccccgagagcctcccagaggaacttaaccctgctgacattctga >gi568815591r:135829929_136077100|GENSCAN_predicted_peptide_7|72_aa MPRLVFCKAPFKVLEVLDQFTDGKEEGREEEKRKESVRAGTEVKECEVHSKKSGLVWTSI GYLEEKEIRIEG >gi568815591r:135829929_136077100|GENSCAN_predicted_CDS_7|219_bp atgcccagattggttttttgcaaggcaccgttcaaggtgctggaagtgctggaccagttt actgatggaaaagaagaaggaagagaggaagaaaaaagaaaggaatctgtacgagcaggg acagaagtcaaggagtgcgaggtacactccaaaaagagtggcctggtttggacaagtatc ggttacctagaggagaaagagatcaggatagaagggtag