GENSCAN 1.0 Date run: 4-Nov-116 Time: 02:17:35 Sequence gi568815586r:112305227_112508656 : 203430 bp : 42.84% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.07 Intr - 1008 838 171 0 0 70 103 94 0.978 8.32 1.06 Intr - 3665 3527 139 1 1 87 97 110 0.988 11.35 1.05 Intr - 14516 13999 518 2 2 104 110 605 0.993 55.13 1.04 Intr - 33111 32811 301 2 1 46 37 201 0.206 6.61 1.03 Intr - 47987 47849 139 0 1 48 55 89 0.008 0.20 1.02 Intr - 56503 56356 148 1 1 27 61 81 0.001 -1.81 1.01 Init - 76902 76726 177 0 0 100 83 449 0.511 42.91 1.00 Prom - 88347 88308 40 -6.75 2.09 PlyA - 92959 92954 6 1.05 2.08 Term - 100150 99998 153 1 0 88 36 193 0.999 11.04 2.07 Intr - 100811 100627 185 0 2 104 107 291 0.999 31.19 2.06 Intr - 101116 101068 49 1 1 85 99 28 0.998 0.93 2.05 Intr - 101664 101521 144 0 0 56 100 167 0.998 14.26 2.04 Intr - 103112 102936 177 2 0 97 93 212 0.553 21.79 2.03 Intr - 103430 103194 237 2 0 26 116 315 0.767 24.99 2.02 Intr - 104196 104093 104 1 2 56 68 153 0.737 9.07 2.01 Init - 104454 104361 94 0 1 82 116 114 0.900 14.20 2.00 Prom - 108512 108473 40 -6.65 3.03 PlyA - 108808 108803 6 1.05 3.02 Term - 109902 109848 55 2 1 89 49 38 0.351 -3.95 3.01 Init - 113887 113502 386 1 2 40 64 321 0.741 19.41 3.00 Prom - 115004 114965 40 -5.55 4.00 Prom + 125072 125111 40 -9.45 4.01 Init + 126322 126401 80 0 2 71 -5 123 0.332 1.88 4.02 Intr + 141050 141172 123 2 0 83 97 92 0.989 8.38 4.03 Intr + 145092 145286 195 0 0 43 86 282 0.990 21.11 4.04 Intr + 147969 148161 193 0 1 49 99 229 0.999 18.67 4.05 Intr + 149338 149454 117 0 0 49 99 157 0.999 12.64 4.06 Intr + 150724 150837 114 0 0 7 107 97 0.898 3.22 4.07 Intr + 167718 167814 97 2 1 142 95 61 0.998 10.86 4.08 Intr + 172425 172504 80 1 2 123 103 102 0.999 13.55 4.09 Intr + 172631 172789 159 1 0 36 94 142 0.988 8.86 4.10 Intr + 176848 176979 132 0 0 101 108 154 0.986 18.62 4.11 Intr + 181249 181403 155 0 2 59 61 221 0.999 14.45 4.12 Intr + 183217 183284 68 1 2 118 98 100 0.923 11.63 4.13 Intr + 183798 183949 152 1 2 93 81 239 0.984 22.66 4.14 Intr + 196076 196108 33 1 0 130 98 -5 0.551 2.20 4.15 Intr + 196918 197030 113 0 2 67 89 94 0.960 5.66 4.16 Term + 199469 199538 70 2 1 127 54 67 0.882 3.63 4.17 PlyA + 200431 200436 6 -0.45 5.02 PlyA - 200923 200918 6 1.05 5.01 Sngl - 202566 202315 252 0 0 65 35 188 0.822 6.04 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:112305227_112508656|GENSCAN_predicted_peptide_1|531_aa MGSSAAAAAAAAAAADSAQWLSVKEETIFLHDGLIRVTDLAELPSEILGAPEAADTDLEG PSGDPYEHRVSLVVKTVGFGIDTSRQLLVQFYHLSEFQFLGYKMPPFKELSRNIIRHSFN LVMVAASQVAVSQLLGSYEILLLVSIELMFCFGLGYFFIPMQEWPNTYGERVFVDVESSV FKWNHKCLHKTEAERDYTKKRLKLCGHKPGNAVGQQKLEEARNRFFTRAPGGSAALPTLR FQPSDTDFRLLASRTILTFETKNPSELAERLRSVCGNQSNAYARLLEYRLNALRGLWNAQ RQLALEEQHERESSGDEETLALLKRQGLLQQPEQAPFTSRMGLLLVFPLIQSQSRTDPSL CNITAEVLLNCLRDCQPLSLTKEPADCLNGIETLLCSWLEETSDTGRHIPHKQKENAAAA LVALACARGFVYCRNEELEPGWVAFGSGSLLHRPVSFDNKPHSLFQVIDQNTLQVCQVVP MPANHLPIGSTMSTVHLSSDGTYFYWIWSPASLNEKTPKGHSVFMDIFELV >gi568815586r:112305227_112508656|GENSCAN_predicted_CDS_1|1593_bp atgggctcgtcggcggccgcggcggcggcggcggcggcggccgctgactcggcgcagtgg ctctcggtgaaggaagagaccatcttcctgcacgacgggctgatccgggtcaccgacctg gccgagctgcccagcgagatcctcggggccccagaggccgcggacaccgacctggagggt ccttcaggggatccttatgaacacagagtgagcctagtggttaagactgtaggctttggc attgacacatctcgccagctgttagttcagttctaccacctatcagaatttcaatttctt ggttataaaatgccccccttcaaggagctgtcaagaaacatcatcagacatagttttaac ctggtaatggtggctgccagtcaggtggctgtcagtcagctccttggttcttatgaaatt cttctgctggtttccatagagttgatgttctgttttggtctggggtatttctttatacca atgcaagaatggcctaacacatatggagaaagggtctttgtagatgtggaatcatcagta tttaaatggaatcataagtgtctacataagacagaggcagagagagattacacaaagaag agattaaagttatgtggccacaagccagggaatgctgttggccaacagaagctggaagaa gcaagaaaccgattctttactagagcccctggagggagtgcagccctgccgacacttcga tttcagcccagtgataccgatttcagacttctagcctccagaactattttaacctttgag accaagaacccgtcggaattagcagaacgtttgcgctctgtttgtgggaatcagagcaat gcctatgcccggctgctggaataccgcctgaatgccttgcgaggactgtggaatgcccag cgccagctggccttagaagaacagcatgaaagggaaagttcaggtgatgaggaaactttg gccctgctcaaacgccagggcttgttgcagcaacctgagcaagcgcccttcacatcgagg atggggctcctgctggtcttcccccttattcagtcccagagtagaacagacccctctctc tgtaacataactgctgaggtgctattgaactgccttcgtgactgccagcctttgagcttg accaaggagcctgctgactgtctcaatggaattgaaactttgctgtgctcttggctagag gagacttctgacacaggccgacacatcccacataagcaaaaagaaaatgctgctgctgcc ctggtggctttggcttgtgccagaggttttgtgtactgccggaacgaggagttggaacca ggatgggtggcttttggcagcggcagtcttctccaccggcctgtctctttcgataataaa cctcactcccttttccaggtcattgaccagaacacccttcaggtgtgccaggtggtgcca atgccagccaatcacctccccattggcagcaccatgagcactgtgcacctgtcttcagat ggcacttacttctattggatctggtctcctgccagcctgaatgagaaaacaccgaaggga cattctgtcttcatggacatttttgaacttgtg >gi568815586r:112305227_112508656|GENSCAN_predicted_peptide_2|380_aa MPQIRKEVRLALPMIAYRPEAGTLILFPILQGHKEGEVECRSGSLRFFATLLGWGNEALS GTRSRLMAGEKVEKPDTKEKKPEAKKVDAGGKVKKGNLKAKKPKKGKPHCSRNPVLVRGI GRYSRSAMYSRKAMYKRKYSAAKSKVEKKKKEKVLATVTKPVGGDKNGGTRVVKLRKMVR CGDCKLDFLFMLEYCEDVFEYLDTPRYYPTEDVPRKLLSHGKKPFSQHVRKLRASITPGT ILIILTGRHRGKRVVFLKQLASGLLLVTGPLVLNRVPLRRTHQKFVIATSTKIDISNVKI PKHLTDAYFKKKKLRKPRHQEGEIFDTEKEKYEITEQRKIDQKAVDSQILPKIKAIPQLQ GYLRSVFALTNGIYPHKLVF >gi568815586r:112305227_112508656|GENSCAN_predicted_CDS_2|1143_bp atgccccagatccggaaggaagtgagactcgcacttcccatgattgcttatagaccggaa gccgggaccttaattctctttcccatcttgcaagggcacaaagagggtgaggtagaatgc cgtagtggctccctccggttcttcgccactctcttgggctgggggaacgaggccctttcg gggacccgtagcagactgatggcgggtgaaaaagttgagaagccagatactaaagagaag aaacccgaagccaagaaggttgatgctggtggcaaggtgaaaaagggtaacctcaaagct aaaaagcccaagaaggggaagccccattgcagccgcaaccctgtccttgtcagaggaatt ggcaggtattcccgatctgccatgtattccagaaaggccatgtacaagaggaagtactca gccgctaaatccaaggttgaaaagaaaaagaaggagaaggttctcgcaactgttacaaaa ccagttggtggtgacaagaacggcggtacccgggtggttaaacttcgcaaaatggtaaga tgtggggactgtaaattggattttctgtttatgcttgaatactgtgaagatgtttttgaa tacttagatactcctagatattatcctactgaagatgtgcctcgaaagctgttgagccac ggcaaaaaacccttcagtcagcacgtgagaaaactgcgagccagcattacccccgggacc attctgatcatcctcactggacgccacaggggcaagagggtggttttcctgaagcagctg gctagtggcttattacttgtgactggacctctggtcctcaatcgagttcctctacgaaga acacaccagaaatttgtcattgccacttcaaccaaaatcgatatcagcaatgtaaaaatc ccaaaacatcttactgatgcttacttcaagaagaagaagctgcggaagcccagacaccag gaaggtgagatcttcgacacagaaaaagagaaatatgagattacggagcagcgcaagatt gatcagaaagctgtggactcacaaattttaccaaaaatcaaagctattcctcagctccag ggctacctgcgatctgtgtttgctctgacgaatggaatttatcctcacaaattggtgttc taa >gi568815586r:112305227_112508656|GENSCAN_predicted_peptide_3|146_aa MFLPPSGSATDPLLAQAPLGSVTSGWPEGGPLGTGRGAEPAGRAQTPLQAWGSRRLCSCP PARLAPPPPPPFAAVAPVMSPLATPAATSCFPSAEDRATAGNGSAGAPRRRHAQCPKHPC FLLETAAARGFFLTVLHINSMPLISG >gi568815586r:112305227_112508656|GENSCAN_predicted_CDS_3|441_bp atgttcctcccgccctccggctccgcgacggacccgctccttgctcaggctccgctgggc tcggtcacatcgggctggcccgagggaggcccgctcgggaccggacgcggggcagagcca gccggccgcgcacagacccccctccaggcctggggatcccggagactgtgcagctgcccc ccggcccggctcgctcctcctccgcccccgcccttcgccgccgtcgcccccgtgatgtca ccgctcgcgacgcccgccgccacttcctgcttcccgtcagcggaggaccgagcgacggcc gggaatggcagcgcgggggctccgcggaggcgccacgcacagtgtccaaagcatccttgc ttcctcctggaaaccgcggccgccagagggttcttcctgactgtcctccacatcaactct atgcctctgatctcaggctag >gi568815586r:112305227_112508656|GENSCAN_predicted_peptide_4|626_aa MGDLTVGPPEGKAHVKAACTNWRKSETWFHPNITGVEAENLLLTRGVDGSFLARPSKSNP GDFTLSVRRNGAVTHIKIQNTGDYYDLYGGEKFATLAELVQYYMEHHGQLKEKNGDVIEL KYPLNCADPTSERWFHGHLSGKEAEKLLTEKGKHGSFLVRESQSHPGDFVLSVRTGDDKG ESNDGKSKVTHVMIRCQELKYDVGGGERFDSLTDLVEHYKKNPMVETLGTVLQLKQPLNT TRINAAEIESRVRELSKLAETTDKVKQGFWEEFETLQQQECKLLYSRKEGQRQENKNKNR YKNILPFDHTRVVLHDGDPNEPVSDYINANIIMPEFETKCNNSKPKKSYIATQGCLQNTV NDFWRMVFQENSRVIVMTTKEVERGKSKCVKYWPDEYALKEYGVMRVRNVKESAAHDYTL RELKLSKVGQGNTERTVWQYHFRTWPDHGVPSDPGGVLDFLEEVHHKQESIMDAGPVVVH CSAGIGRTGTFIVIDILIDIIREKGVDCDIDVPKTIQMVRSQRSGMVQTEAQYRFIYMAV QHYIETLQRRIEEEQGIIRTKNKVLKKSKRKGHEYTNIKYSLADQTSGDQSPLPPCTPTP PCAEMREDSARVYENVGLMQQQKSFR >gi568815586r:112305227_112508656|GENSCAN_predicted_CDS_4|1881_bp atgggggatctgactgtgggaccaccagagggaaaagcacatgtaaaagctgcgtgtacc aactggaggaaatcggagacatggtttcacccaaatatcactggtgtggaggcagaaaac ctactgttgacaagaggagttgatggcagttttttggcaaggcctagtaaaagtaaccct ggagacttcacactttccgttagaagaaatggagctgtcacccacatcaagattcagaac actggtgattactatgacctgtatggaggggagaaatttgccactttggctgagttggtc cagtattacatggaacatcacgggcaattaaaagagaagaatggagatgtcattgagctt aaatatcctctgaactgtgcagatcctacctctgaaaggtggtttcatggacatctctct gggaaagaagcagagaaattattaactgaaaaaggaaaacatggtagttttcttgtacga gagagccagagccaccctggagattttgttctttctgtgcgcactggtgatgacaaaggg gagagcaatgacggcaagtctaaagtgacccatgttatgattcgctgtcaggaactgaaa tacgacgttggtggaggagaacggtttgattctttgacagatcttgtggaacattataag aagaatcctatggtggaaacattgggtacagtactacaactcaagcagccccttaacacg actcgtataaatgctgctgaaatagaaagcagagttcgagaactaagcaaattagctgag accacagataaagtcaaacaaggcttttgggaagaatttgagacactacaacaacaggag tgcaaacttctctacagccgaaaagagggtcaaaggcaagaaaacaaaaacaaaaataga tataaaaacatcctgccctttgatcataccagggttgtcctacacgatggtgatcccaat gagcctgtttcagattacatcaatgcaaatatcatcatgcctgaatttgaaaccaagtgc aacaattcaaagcccaaaaagagttacattgccacacaaggctgcctgcaaaacacggtg aatgacttttggcggatggtgttccaagaaaactcccgagtgattgtcatgacaacgaaa gaagtggagagaggaaagagtaaatgtgtcaaatactggcctgatgagtatgctctaaaa gaatatggcgtcatgcgtgttaggaacgtcaaagaaagcgccgctcatgactatacgcta agagaacttaaactttcaaaggttggacaagggaatacggagagaacggtctggcaatac cactttcggacctggccggaccacggcgtgcccagcgaccctgggggcgtgctggacttc ctggaggaggtgcaccataagcaggagagcatcatggatgcagggccggtcgtggtgcac tgcagtgctggaattggccggacagggacgttcattgtgattgatattcttattgacatc atcagagagaaaggtgttgactgcgatattgacgttcccaaaaccatccagatggtgcgg tctcagaggtcagggatggtccagacagaagcacagtaccgatttatctatatggcggtc cagcattatattgaaacactacagcgcaggattgaagaagagcagggtatcatcagaacc aaaaataaagttttaaagaaaagcaagaggaaagggcacgaatatacaaatattaagtat tctctagcggaccagacgagtggagatcagagccctctcccgccttgtactccaacgcca ccctgtgcagaaatgagagaagacagtgctagagtctatgaaaacgtgggcctgatgcaa cagcagaaaagtttcagatga >gi568815586r:112305227_112508656|GENSCAN_predicted_peptide_5|83_aa MAAKLPQSLKGLRDLRKELPPGLDLGLPFRSQSQKNPRRPQLHGKDAIALLPSQKSTNDI FIEISALDDSTYEAKRDVFGEGT >gi568815586r:112305227_112508656|GENSCAN_predicted_CDS_5|252_bp atggcagccaaactaccccaaagtctcaagggactgagagacctcagaaaggaactacca ccagggctggatctaggccttcctttcagaagccaaagtcagaagaatccaaggaggcca caacttcatggcaaagatgccatagcattattaccatcacaaaagtcaaccaatgatatt ttcattgaaatatccgcactggatgattcaacatatgaggcaaagagggatgtctttgga gaaggcacttaa