GENSCAN 1.0 Date run: 8-Nov-116 Time: 04:19:30 Sequence gi568815592r:16206332_16428310 : 221979 bp : 45.31% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 2079 2074 6 1.05 1.02 Term - 10719 10423 297 0 0 -38 39 219 0.476 -0.33 1.01 Init - 11929 11843 87 1 0 66 70 48 0.532 1.49 1.00 Prom - 17345 17306 40 0.24 2.00 Prom + 19477 19516 40 -4.26 2.01 Init + 20655 20926 272 2 2 76 -8 210 0.417 4.64 2.02 Intr + 22379 22485 107 2 2 35 30 138 0.264 2.56 2.03 Intr + 32346 32449 104 1 2 71 59 166 0.103 11.89 2.04 Intr + 40511 40630 120 1 0 101 94 218 0.995 24.39 2.05 Intr + 43953 44036 84 2 0 101 96 91 0.753 11.22 2.06 Intr + 46226 46357 132 1 0 65 36 75 0.481 0.84 2.07 Intr + 48231 48404 174 2 0 63 115 136 0.792 13.94 2.08 Term + 50984 51010 27 1 0 91 41 10 0.050 -5.33 2.09 PlyA + 53124 53129 6 1.05 3.03 PlyA - 53497 53492 6 1.05 3.02 Term - 56132 55936 197 0 2 59 44 142 0.504 4.47 3.01 Init - 58202 58010 193 2 1 72 119 119 0.740 12.54 3.00 Prom - 58292 58253 40 -4.66 4.00 Prom + 63162 63201 40 -3.46 4.01 Init + 64398 64487 90 2 0 52 70 26 0.357 -2.31 4.02 Intr + 68084 68165 82 1 1 101 110 82 0.624 10.91 4.03 Intr + 72453 72559 107 1 2 72 65 140 0.990 10.03 4.04 Intr + 79462 79504 43 0 1 60 87 26 0.065 -2.49 4.05 Intr + 84131 84290 160 0 1 56 94 189 0.321 15.35 4.06 Term + 88675 88855 181 1 1 53 40 262 0.986 14.98 4.07 PlyA + 92394 92399 6 1.05 5.03 PlyA - 92804 92799 6 1.05 5.02 Term - 100528 99998 531 1 0 150 44 479 0.954 43.95 5.01 Init - 121979 120063 1917 2 0 56 97 2479 0.648 233.13 5.00 Prom - 127109 127070 40 -5.46 6.00 Prom + 133265 133304 40 -6.66 6.01 Init + 136059 136089 31 2 1 85 40 49 0.448 -0.25 6.02 Intr + 137090 137230 141 0 0 31 48 135 0.481 4.12 6.03 Term + 140855 141003 149 0 2 103 37 121 0.400 6.56 6.04 PlyA + 141477 141482 6 1.05 7.00 Prom + 150597 150636 40 -2.46 7.01 Init + 152239 152280 42 0 0 66 13 126 0.031 1.38 7.02 Intr + 165927 166052 126 2 0 63 75 66 0.533 3.68 7.03 Term + 168664 168906 243 0 0 84 48 112 0.811 2.70 7.04 PlyA + 170369 170374 6 1.05 8.07 PlyA - 172212 172207 6 1.05 8.06 Term - 173849 173658 192 2 0 64 38 66 0.007 -3.38 8.05 Intr - 183337 183320 18 1 0 100 93 19 0.001 0.41 8.04 Intr - 200634 200405 230 1 2 83 93 87 0.664 6.29 8.03 Intr - 202145 202041 105 0 0 108 73 40 0.653 4.69 8.02 Intr - 217275 217121 155 2 2 130 25 44 0.035 2.02 8.01 Intr - 219568 219479 90 0 0 51 33 121 0.141 2.01 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:16206332_16428310|GENSCAN_predicted_peptide_1|127_aa MQDWQAAPFTAPVCRHTVRDWQAAPPATPLRPGEKSIERSASGPALLGDPAHLPQPLARV LSPSARGRQGRQAVPSEGPAEPTPNRNSRWPASTPRSPGFRPRLSRHTFPQAEEADSGLG QPRKGVP >gi568815592r:16206332_16428310|GENSCAN_predicted_CDS_1|384_bp atgcaggactggcaggcagctccatttacagccccggtgtgcaggcacacggtgcgggac tggcaggcagctccaccagcaaccccgctaaggcctggcgagaaatcgattgagcgcagc gccagtgggccggcgctgctgggggacccagcacaccttccgcagccgctggcccgggtg ctaagcccctctgcccggggccgccagggccgacaggccgttccgagtgaggggcctgcc gagcccacaccgaaccggaactcgcgctggccggcaagcaccccgcgcagccccggtttc cgcccgcgcctttcccgccacactttcccgcaagctgaggaagccgactccggccttggc cagcccaggaagggggtcccttag >gi568815592r:16206332_16428310|GENSCAN_predicted_peptide_2|339_aa MVGCRSPAQPSPAPPRPALRGGSKGPARNRAQRRWAGTAGGPSPLSAPAGPGAKPLIAPG WQPARRSECGARHAHSHPELVLARKHRAQPGGVAISEKLGIQIHRILLPSTSNVTMGSWR PDEKEPARSRCTMPRIDADLKLDFKDVLLRPKRSSLKSRAEVDLERTFTFRNSKQTYSGI PIIVANMDTVGTFEMAAVMSQHSMFTAIHKHYSLDDWKLFATNHPECLQGVFGNVLLTGT VAFITKCDTSTYYTARHQIKGEFKRDNVKNLVANVAVSSGSGQNDLEKMTSILEAVPQVK FICLDVANGYSEHFVEFVKLVRAKFPEHTIMAVHLYPSS >gi568815592r:16206332_16428310|GENSCAN_predicted_CDS_2|1020_bp atggtgggctgcaggtccccagcccagcccagccccgccccgccccgccccgccctgcgg ggaggcagtaaaggcccggcgagaaatcgagcgcagcgccggtgggccggcactgctggg ggacccagcccactctccgcacctgctggcccaggtgctaagcccctcattgccccgggc tggcagccggccaggcgctccgagtgcggggcccgccatgcccacagccacccggaactc gtgctggcccgcaagcaccgcgcgcagcccggtggcgtagccatcagtgagaaacttgga atccagatacacagaatcctgctgccttcaacttctaatgtgaccatgggctcctggaga cctgatgaaaaggagcccgccaggagccgctgcaccatgccccgcatagatgcggacctc aagctcgacttcaaggatgtcctgctccgacctaagcggagcagcctcaagagccgagcc gaggtggatcttgaacgcaccttcacgtttcgaaattcaaagcagacctactcagggatt cccatcatcgtggccaacatggacactgtgggcacgtttgagatggcagccgtgatgtca cagcactccatgtttacagcaattcataagcattactccctggatgactggaagctcttt gccacaaatcacccagaatgcctgcagggagtttttggcaatgtattgttaactggaact gtggctttcattaccaagtgtgatacatcgacttactacacagcccgccaccaaataaaa ggcgaattcaaaagggataatgttaaaaacttagtagcaaatgtagccgtgagttcaggc agtgggcagaatgatctggaaaagatgaccagcatcctggaagctgtgccacaggttaag tttatttgcctggatgtggccaatgggtattcagaacattttgtggaattcgtgaaactt gtccgtgccaaatttcctgaacacaccattatggctgttcatctgtatccttcatcatag >gi568815592r:16206332_16428310|GENSCAN_predicted_peptide_3|129_aa MLLTQSLFGGLFTRTHMKFGAVTQIGGPPLGDQSPVLLLFAKDPPTTSGPRTDQPKKHLT NFKSGACYKCQKSGHQAKECLQPRIPPKLSPICAGPHWKSDCLTHLAATPSAPGILAQGS LTASHIFLA >gi568815592r:16206332_16428310|GENSCAN_predicted_CDS_3|390_bp atgttgctcacacaaagcctgtttggtggtctcttcacacggacgcacatgaaatttggt gccgtgactcagatcgggggacctcccttgggagatcaatcccctgtcctcctgctcttt gcgaaagatccacctacgacctcaggtcctcggaccgaccagcccaagaaacatctcacc aatttcaaatccggagcttgctacaagtgccagaaatctggccaccaggccaaggaatgc ctgcagcccaggattcctcctaagctgagtcccatctgtgcgggaccccactggaaatcg gactgtctaactcacctggcagccactcccagcgcccctggaattctggcccaaggctct ctgactgcttcccacatcttcttggcttag >gi568815592r:16206332_16428310|GENSCAN_predicted_peptide_4|220_aa MVEAFHNHFGFFQTVAVTTFGPWTDSAATQAGNVVTGEMVEELILSGADIIKVGVGPGSV CTTRTKTGVGYPQLSAVIECADSAHGLKGHIISDGGCTCPGDVAKAFGAGADFVMLGGMF SGHTECAGEVFERNGRKLKLFYGMSSDTAMNKHAGGVAEYRASEGKTVEVPYKGDVENTI LDILGGLRSTCTYVGAAKLKELSRRATFIRVTQQHNTVFS >gi568815592r:16206332_16428310|GENSCAN_predicted_CDS_4|663_bp atggttgaggcatttcataatcactttgggtttttccagacagtagcagtgaccacattt ggaccatggacagattcagcagccacacaggcagggaacgtggtgacaggagaaatggta gaagagcttattctttccggagcagatatcatcaaagtgggagttggaccaggttctgtg tgcaccacccgcaccaagacgggagtggggtacccccagctgagtgccgtcattgagtgt gccgactctgcccatggcctgaagggccacatcatctctgatggaggctgtacgtgtcca ggggatgtcgccaaagcctttggagctggagcagattttgtcatgctgggaggaatgttt tcgggtcatacggagtgtgctggagaagtgtttgagaggaacggacggaagctcaagctc ttctacgggatgagctctgacaccgccatgaacaagcacgcaggaggagttgctgagtac agagcctctgagggtaagactgtggaagttccttacaaaggagatgtggaaaacactatc ctggatattctcgggggactgaggtccacgtgcacctacgtgggggccgccaaactcaag gagctcagcaggagggcaacattcatccgggtgacccagcagcacaacaccgtgttcagc taa >gi568815592r:16206332_16428310|GENSCAN_predicted_peptide_5|815_aa MKSNQERSNECLPPKKREIPATSRSSEEKAPTLPSDNHRVEGTAWLPGNPGGRGHGGGRH GPAGTSVELGLQQGIGLHKALSTGLDYSPPSAPRSVPVATTLPAAYATPQPGTPVSPVQY AHLPHTFQFIGSSQYSGTYASFIPSQLIPPTANPVTSAVASAAGATTPSQRSQLEAYSTL LANMGSLSQTPGHKAEQQQQQQQQQQQQHQHQQQQQQQQQQQQQQHLSRAPGLITPGSPP PAQQNQYVHISSSPQNTGRTASPPAIPVHLHPHQTMIPHTLTLGPPSQVVMQYADSGSHF VPREATKKAESSRLQQAIQAKEVLNGEMEKSRRYGAPSSADLGLGKAGGKSVPHPYESRH VVVHPSPSDYSSRDPSGVRASVMVLPNSNTPAADLEVQQATHREASPSTLNDKSGLHLGK PGHRSYALSPHTVIQTTHSASEPLPVGLPATAFYAGTQPPVIGYLSGQQQAITYAGSLPQ HLVIPGTQPLLIPVGSTDMEASGAAPAIVTSSPQFAAVPHTFVTTALPKSENFNPEALVT QAAYPAMVQAQIHLPVVQSVASPAAAPPTLPPYFMKGSIIQLANGELKKVEDLKTEDFIQ SAEISNDLKIDSSTVERIEDSHSPGVAVIQFAVGEHRAQVSVEVLVEYPFFVFGQGWSSC CPERTSQLFDLPCSKLSVGDVCISLTLKNLKNGSVKKGQPVDPASVLLKHSKADGLAGSR HRYAEQENGINQGSAQMLSENGELKFPEKMGLPAAPFLTKIEPSKPAATRKRRWSAPESR KLEKSEDEPPLTLPKPSLIPQEVKICIEGRSNVGK >gi568815592r:16206332_16428310|GENSCAN_predicted_CDS_5|2448_bp atgaaatccaaccaagagcggagcaacgaatgcctgcctcccaagaagcgcgagatcccc gccaccagccggtcctccgaggagaaggcccctaccctgcccagcgacaaccaccgggtg gagggcacagcatggctcccgggcaaccctggtggccggggccacgggggcgggaggcat gggccggcagggacctcggtggagcttggtttacaacagggaataggtttacacaaagca ttgtccacagggctggactactccccgcccagcgctcccaggtctgtccccgtggccacc acgctgcctgccgcgtacgccaccccgcagccagggaccccggtgtcccccgtgcagtac gctcacctgccgcacaccttccagttcattgggtcctcccaatacagtggaacctatgcc agcttcatcccatcacagctgatccccccaaccgccaaccccgtcaccagtgcagtggcc tcggccgcaggggccaccactccatcccagcgctcccagctggaggcctattccactctg ctggccaacatgggcagtctgagccagacgccgggacacaaggctgagcagcagcagcag cagcagcagcagcagcagcagcagcatcagcatcagcagcagcagcagcagcagcagcag cagcagcagcagcagcacctcagcagggctccggggctcatcaccccggggtccccccca ccagcccagcagaaccagtacgtccacatttccagttctccgcagaacaccggccgcacc gcctctcctccggccatccccgtccacctccacccccaccagacgatgatcccacacacg ctcaccctggggcccccctcccaggtcgtcatgcaatacgccgactccggcagccacttt gtccctcgggaggccaccaagaaagctgagagcagccggctgcagcaggccatccaggcc aaggaggtcctgaacggtgagatggagaagagccggcggtacggggccccgtcctcagcc gacctgggcctgggcaaggcaggcggcaagtcggttcctcacccgtacgagtccaggcac gtggtggtccacccgagcccctcagactacagcagtcgtgatccttcgggggtccgggcc tctgtgatggtcctgcccaacagcaacacgcccgcagctgacctggaggtgcaacaggcc actcatcgtgaagcctccccttctaccctcaacgacaaaagtggcctgcatttagggaag cctggccaccggtcctacgcgctctcaccccacacggtcattcagaccacacacagtgct tcagagccactcccggtgggactgccagccacggccttctacgcagggactcaaccccct gtcatcggctacctgagcggccagcagcaagcaatcacctacgccggcagcctgccccag cacctggtgatccccggcacacagcccctgctcatcccggtcggcagcactgacatggaa gcgtcgggggcagccccggccatagtcacgtcatccccccagtttgctgcagtgcctcac acgttcgtcaccaccgcccttcccaagagcgagaacttcaaccctgaggccctggtcacc caggccgcctacccagccatggtgcaggcccagatccacctgcctgtggtgcagtccgtg gcctccccggcggcggctccccctacgctgcctccctacttcatgaaaggctccatcatc cagttggccaacggggagctaaagaaggtggaagacttaaaaacagaagatttcatccag agtgcagagataagcaacgacctgaagatcgactccagcaccgtagagaggattgaagac agccatagcccgggcgtggccgtgatacagttcgccgtcggggagcaccgagcccaggtc agcgttgaagttttggtagagtatcctttttttgtgtttggacagggctggtcatcctgc tgtccggagagaaccagccagctctttgatttgccgtgttccaaactctcagttggggat gtctgcatctcgcttaccctcaagaacctgaagaacggctctgttaaaaagggccagccc gtggatcccgccagcgtcctgctgaagcactcaaaggccgacggcctggcgggcagcaga cacaggtatgccgagcaggaaaacggaatcaaccaggggagtgcccagatgctctctgag aatggcgaactgaagtttccagagaaaatgggattgcctgcagcgcccttcctcaccaaa atagaacccagcaagcccgcggcaacgaggaagaggaggtggtcggcgccagagagccgc aaactggagaagtcagaagacgaaccacctttgactcttcctaagccttctctaattcct caggaggttaagatttgcattgaaggccggtctaatgtaggcaagtag >gi568815592r:16206332_16428310|GENSCAN_predicted_peptide_6|106_aa MGKREPMCIVAPRSLICLELQGSLRYLLALACIPGLDVRASFKPLKLYALSHIRKGDSPS PIDHPRTEECGRTALAGSSTYCLCEDPLGEASWAPESGGALENLYV >gi568815592r:16206332_16428310|GENSCAN_predicted_CDS_6|321_bp atggggaaacgagaacctatgtgcatcgttgcgccccgctccctcatctgcttggagcta caaggatctctgagatacctgctggcattagcctgtattccaggactggacgtcagggca tcctttaagcccttaaagctgtatgccctttctcacatccgcaaaggggactcgcccagt cccatcgaccacccaaggactgaggagtgcgggcgcacggcactggcaggcagctccacc tactgcctctgtgaagatccactaggtgaagccagctgggctcctgagtctggtggggcc ttggagaacctttatgtctag >gi568815592r:16206332_16428310|GENSCAN_predicted_peptide_7|136_aa MMAVAAAIKLAARLTQGQTNKKEGKREIKRSQRYGQKQEEGENGKEVEREQKRSTQNSAW ASKEKTPTHPPTPAALQKCCRVKTMKKGKAEESRGILPSPSKDSLDVFRPYRRIKRRTPR RKESQKLWHDQEGTWI >gi568815592r:16206332_16428310|GENSCAN_predicted_CDS_7|411_bp atgatggcagtggctgctgccatcaagctggctgcgcggctgacccagggccagaccaat aagaaagaagggaaaagggaaattaaaagaagccagagatatggacagaaacaagaggaa ggagagaatggaaaagaagttgagagggaacagaaaagaagtacacagaactcagcttgg gctagcaaagaaaagacgcccactcacccacctaccccagcagctttgcagaagtgctgc agggtgaagacaatgaagaaaggcaaggctgaagaaagccgaggcattctgccctctccc agcaaagacagcctggacgtgttcagaccctatcgccgcatcaaaagaagaacaccaagg agaaaggagtcacaaaagctctggcatgatcaggaaggcacctggatttga >gi568815592r:16206332_16428310|GENSCAN_predicted_peptide_8|263_aa XVHTLFVLEQRVICYGMWNTDPDIAAVFVVEPPPHWGASLQPVARSCKILSSVKRAPKTP NPGPVEETIAALISHVTSLRAHGFSLDEGKYLRQYFFNILNNGMDIEMGEISVKAGFERV HLPAVLPQEYTEQRRQGHYNLTRPPYCYVQFPLAGMGPHILYLSRMASSLELFKSSKSRR EQRKEEVTCEMLRKKENEQKRESDALFLRICDLGRLQMSQASDVHLDCILTKIREVPAGM LRTSFEENPVAKPIHCPWERNIL >gi568815592r:16206332_16428310|GENSCAN_predicted_CDS_8|792_bp nnggtgcacactttgtttgttctggaacagagggtcatctgctacgggatgtggaacact gaccctgatattgcagccgtgtttgtggttgagcctccacctcactggggagcgtcattg cagccagtggcaagaagctgcaaaattctctcatctgtgaagagagcccctaaaacccca aatcccggaccagtagaggaaaccattgctgccctaatatcacatgttacaagtcttcga gcccatgggttctcacttgatgaaggaaagtatttaagacaatacttctttaacatactt aataatggaatggatatagaaatgggagaaatcagtgtgaaagcaggctttgaaagggta cacttgccagcagttttgccacaagagtacacggaacaaaggagacagggtcattataac ctgacgcgtccaccctactgctatgtccagtttccattggctggaatgggacctcacatt ctgtatttgtcccgaatggctagcagcttagaactttttaaaagcagcaaaagcagaaga gaacaaaggaaggaggaagtaacttgtgaaatgctgagaaagaaggaaaatgagcagaag agggaatctgatgccttgtttctgcgcatctgtgatctgggcagattacagatgtcacaa gccagtgatgtacacctggattgcatcctgaccaaaatcagggaagtacccgctggcatg ttgaggacctcctttgaagaaaatcctgtggctaaaccaattcattgcccttgggagaga aacattctatag