GENSCAN 1.0 Date run: 5-Nov-116 Time: 05:31:51 Sequence gi568815594r:22627412_22827950 : 200539 bp : 38.13% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 706 701 6 1.05 1.03 Term - 3348 3221 128 1 2 28 49 108 0.214 -1.64 1.02 Intr - 7708 7622 87 2 0 47 80 69 0.131 1.02 1.01 Init - 14354 14057 298 2 1 52 123 130 0.867 10.33 1.00 Prom - 20239 20200 40 -4.85 2.07 PlyA - 20306 20301 6 1.05 2.06 Term - 34426 34296 131 2 2 95 54 73 0.154 1.96 2.05 Intr - 51209 51079 131 1 2 87 9 204 0.419 11.82 2.04 Intr - 52134 51997 138 2 0 86 55 88 0.842 3.96 2.03 Intr - 57194 56994 201 1 0 55 16 177 0.678 4.68 2.02 Intr - 60662 60515 148 0 1 72 25 89 0.307 -0.63 2.01 Init - 62628 62592 37 0 1 72 110 27 0.719 3.66 2.00 Prom - 65971 65932 40 -3.25 3.05 PlyA - 66392 66387 6 1.05 3.04 Term - 70421 70327 95 0 2 117 36 120 0.908 6.71 3.03 Intr - 81807 81681 127 0 1 65 77 277 0.067 23.63 3.02 Intr - 85351 85294 58 0 1 87 45 77 0.045 1.27 3.01 Init - 100539 99980 560 0 2 69 89 419 0.350 34.61 3.00 Prom - 104028 103989 40 -6.35 4.03 PlyA - 104494 104489 6 1.05 4.02 Term - 106281 106117 165 0 0 109 38 89 0.886 2.93 4.01 Init - 107319 107254 66 0 0 104 70 59 0.750 6.82 4.00 Prom - 107807 107768 40 -8.55 5.00 Prom + 111312 111351 40 -6.65 5.01 Init + 116065 116074 10 0 1 91 58 24 0.324 0.37 5.02 Intr + 119885 120681 797 0 2 102 94 615 0.959 53.75 5.03 Intr + 121426 121549 124 0 1 96 76 58 0.990 4.64 5.04 Term + 124771 124937 167 2 2 30 38 179 0.583 4.20 5.05 PlyA + 125267 125272 6 1.05 6.07 PlyA - 125515 125510 6 -0.45 6.06 Term - 125880 125749 132 0 0 117 34 44 0.140 -0.99 6.05 Intr - 129611 129502 110 0 2 -12 85 133 0.129 2.08 6.04 Intr - 132224 132147 78 0 0 69 100 31 0.149 1.00 6.03 Intr - 139364 139292 73 2 1 102 20 64 0.218 -0.94 6.02 Intr - 140128 140048 81 1 0 30 90 97 0.377 3.02 6.01 Init - 155096 154950 147 2 0 73 37 137 0.723 7.24 6.00 Prom - 161945 161906 40 -3.75 7.02 PlyA - 162319 162314 6 1.05 7.01 Sngl - 183896 183702 195 2 0 63 32 173 0.602 4.11 7.00 Prom - 187128 187089 40 -4.55 8.02 PlyA - 187191 187186 6 1.05 8.01 Term - 191506 191306 201 1 0 82 44 165 0.534 7.91 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594r:22627412_22827950|GENSCAN_predicted_peptide_1|170_aa MLSLEKSKSRGASANKTQLVESLKKTTSGSTNTLKQHQEQWHDVCRQREPEPTRRLLSAP RSMKKKTIQKCIMILRGSGKYMRRDTKPEKTFRWIKQIFAIGRSMGPGTLEQGAALIGEA RSAREPTAGHSDCCVEDLLGSGKGRMGDWNTEDRLEEILPLIERADKNFN >gi568815594r:22627412_22827950|GENSCAN_predicted_CDS_1|513_bp atgctgagtttggagaaatccaagagcagaggagcttcagccaataaaacccagttggta gagagtcttaagaagacaaccagtggcagcaccaataccctgaagcagcatcaggaacaa tggcatgatgtatgcaggcagagagaacctgagcctacaaggaggcttttgtcagctcca aggagcatgaaaaaaaaaaccatccagaagtgtatcatgatcctgagaggatcagggaag tatatgagaagagacaccaaaccagaaaagacctttcgttggataaagcagatcttcgcc attgggcggtccatgggaccaggcactttggagcagggggctgcgctcatcggtgaggcg cggtccgcgcgggagcccaccgcgggtcattctgactgctgtgtggaggatttattggga agtgggaaggggaggatgggagactggaatacagaagaccggttagaagaaatattgccc ctgatcgagagagctgataagaacttcaactag >gi568815594r:22627412_22827950|GENSCAN_predicted_peptide_2|261_aa MALRHEFAAIISEESLFSDTMSLEKKGSGREREEGAGAKGTSTCSGWCNGLFIMISCIYQ ERSRVEKRNGEAEGRIYRKLWDVALLCCVQEVINGERKPLRALTEVKSGFRLREQTRILL RGLHQKNGSGQRKGTPSFRIIKYSTKKALFMEFGEESSVRVMESFEMEMILGVIYAGITD VSHCASPLHSKKEEEKKRKKKKKKEEEEKEEEKEEEKKKVRRRMLSLTLKTDYPLKETSI GENINWLLESTAMRIKESGMK >gi568815594r:22627412_22827950|GENSCAN_predicted_CDS_2|786_bp atggctctaaggcatgaatttgcagccatcatctcagaggagagcctcttcagtgatact atgtcattggagaagaaaggaagtggaagggaacgtgaagaaggggcaggtgcaaagggt acatctacttgcagtggttggtgcaatggattattcatcatgatttcctgcatttaccaa gaaagatccagagtagagaaacgaaatggagaggcagagggaagaatctaccgaaaacta tgggatgttgccctgctctgttgtgtccaggaggtcattaatggggaaaggaaacctctg agagcactgactgaagtgaaatcgggatttaggctcagggagcagacaaggattctcctc agaggacttcatcagaaaaatggaagtggtcagaggaaaggaacaccttcattcagaatc ataaaatatagcactaaaaaggctctgttcatggaatttggagaggaatcctcagttaga gtcatggagtcttttgagatggaaatgatattaggggtcatttatgctgggattacagat gtgagccattgtgccagcccactccattccaagaaggaggaggagaagaagaggaagaag aagaagaagaaggaggaggaggagaaagaggaggagaaggaggaggagaaaaaaaaagtt agaagaagaatgttgagcttaacattgaagacagattaccctttaaaagaaacatctatc ggagaaaacattaattggttgctggaaagcacggctatgagaataaaagaatctggaatg aaatga >gi568815594r:22627412_22827950|GENSCAN_predicted_peptide_3|279_aa MQTIKCVVVGDGAVSKTCLLISYTTNKFPSEYVPTVFDNYAVTVMIGGEPYTLGLFDTAG QEDYDRLRPLSYPQTDVFLVCFSVVSPSSFENVKEKWVPEITHHCPKTPFLLVGTQIDLR DDPSTIEKPAKNKQKPITPETAEKLARDLKAVKYVECSALTKKGLKNVFDEAILAALEPP EPKKSRRLAGGLAAMPRQPSDLSDLWRIQTNMPPDGNNDEDDEKDDDDDDDDDDNGGGNS GDKNYDGKGGPDKEHICLDIPLLEEYQEQPRDQVSDCDV >gi568815594r:22627412_22827950|GENSCAN_predicted_CDS_3|840_bp atgcagacaattaagtgtgttgttgtgggcgatggtgctgttagtaaaacatgtctcctg atatcctacacaacaaacaaatttccatcggaatatgtaccgactgtttttgacaactat gcagtcacagttatgattggtggagaaccatatactcttggactttttgatactgcaggg caagaggattatgacagattacgaccgctgagttatccacaaacagatgtatttctagtc tgtttttcagtggtctctccatcttcatttgaaaatgtgaaagaaaagtgggtgcctgag ataactcaccactgtccaaagactccttttttgcttgttgggactcaaattgatctcaga gatgacccctctactattgagaaacctgccaagaacaaacagaagcctatcactccagag actgctgaaaagctggcccgtgacctgaaggctgtcaagtatgtggagtgttctgcactt acaaagaaaggcctaaagaatgtatttgacgaagcaatattggctgccctggagcctcca gaaccgaaaaagagccgcagactagctggcggcctggctgcaatgcctcgccagcccagt gatctctcagacctgtggaggatccaaactaacatgccacctgatggcaacaatgatgaa gacgatgagaaggatgatgatgatgatgatgatgatgatgataacggtggtggtaatagt ggtgataaaaattatgatggtaaaggtggtccagacaaggagcatatctgtctggatatt cctcttcttgaagagtaccaagagcagccaagagatcaagtctcagactgcgatgtttaa >gi568815594r:22627412_22827950|GENSCAN_predicted_peptide_4|76_aa MRIQKRLLKGALTKAKVVIDIKVEGKEDKRCTRTSLFTTDTSYFFVESGNSFQTLVTVSS IFGATHNTTATDMTHF >gi568815594r:22627412_22827950|GENSCAN_predicted_CDS_4|231_bp atgaggatacagaagaggctgctaaaaggagctctgacaaaagcaaaagtggtcattgat atcaaggtagaaggaaaagaagacaaaagatgtaccagaacttcactttttaccactgac acgagctacttctttgttgaatcaggtaacagttttcaaacactggtaactgtctcctca atttttggtgctactcacaacaccacagccacagacatgacacacttttaa >gi568815594r:22627412_22827950|GENSCAN_predicted_peptide_5|365_aa MNAGIDYYNKIIDDLLKNGVTPIVTLYHFDLPQTLEDQGGWLSEAIIESFDKYAQFCFST FGDRVKQWITINEANVLSVMSYDLGMFPPGIPHFGTGGYQAAHNLIKAHARSWHSYDSLF RKKQKGMVSLSLFAVWLEPADPNSVSDQEAAKRAITFHLDLFAKPIFIDGDYPEVVKSQI ASMSQKQGYPSSRLPEFTEEEKKMIKGTADFFAVQYYTTRLIKYQENKKGELGILQDAEI EFFPDPSWKNVDWIYVVPWGVCKLLKYIKDTYNNPVIYITENGFPQSDPAPLDDTQRWEY FRQTFQELFKVLHAVFEEEPTGSVTLSKAQTPYILVLIKRAELNLLINFLRLLRGNADDL GCQEL >gi568815594r:22627412_22827950|GENSCAN_predicted_CDS_5|1098_bp atgaacgctggaattgattattacaacaagatcatcgatgatttgttaaaaaatggggtt actcccattgtgaccctctaccactttgatttgcctcagactttagaagaccaaggaggt tggttgtcagaggcaatcattgaatcctttgacaaatatgctcagttttgcttcagtacc tttggggatcgtgtcaagcagtggatcaccataaatgaagctaatgttctttctgtgatg tcatatgacttaggtatgtttcctccgggtatccctcactttgggactggaggttatcag gcagctcataatttgattaaggctcatgccagatcctggcacagctatgattccttattt cgaaaaaagcagaaaggtatggtgtctctatcactttttgcggtctggttggaaccagca gatcccaactcagtgtctgaccaggaagctgctaaaagagccatcactttccatctggat ttatttgctaaacccatattcatcgatggtgattatcctgaagttgtcaagtctcagatt gcctccatgagtcaaaagcaaggctatccatcatcgaggcttccagaattcactgaagaa gagaagaaaatgatcaaaggcactgctgatttttttgctgtgcaatattatacaactcgc ttaatcaagtaccaggagaacaagaaaggagaactaggtattctccaggatgcggaaatt gaattttttccagatccatcttggaaaaatgtggattggatctacgtggtaccatgggga gtatgtaaactactgaaatatattaaggatacatataataaccctgtaatttacatcact gagaatgggtttccccagagtgacccagcgcctcttgatgacactcaacgctgggagtat ttcagacaaacatttcaggaactgttcaaagtgcttcatgctgtctttgaagaagagccg acaggttctgtgaccttgagcaaagctcagacgccctatatcctagttctcatcaaaaga gcagagctgaacttgctaatcaattttctgagactcctccgaggtaatgctgatgatctt ggttgccaagaactttaa >gi568815594r:22627412_22827950|GENSCAN_predicted_peptide_6|206_aa MHKNSGNAMERHREKAATHSLSSVQAKEKAFTRHQSCQHLDLELPTSSTNLGLEDDHIQT TAVPTTVPIPDDSKCTAKPSSEDQWNLDEHELISVYWDLQLTLLNLSLKCPSDSMNSKGK LSQAKKDREDNGFQLKRTRKDSLMCDQAASHMGNSMKPVLVKQGNNEWRCCFLLSAIHTR KSQRTNVITHIQKFLKKEEKHRNMGK >gi568815594r:22627412_22827950|GENSCAN_predicted_CDS_6|621_bp atgcacaagaacagcggaaatgccatggaaagacacagggagaaggcagccactcacagc ctttcctctgtgcaagccaaggagaaagccttcaccagacaccaatcctgccagcacctt gatcttgaacttccaacctccagcactaatttaggactggaagatgatcacatccagaca acagctgtgcctacaacagtccctatccccgacgacagcaagtgcacggcaaagccatct agtgaggaccaatggaacttggatgaacatgaacttatttcagtgtattgggacttacag ctaactttgttaaacttgagtctgaagtgcccttctgactctatgaattccaagggaaaa ctttctcaagcaaagaaagacagagaagataatggtttccagttaaaacgcacaaggaaa gattctctgatgtgtgatcaagctgcttctcacatgggcaactccatgaaacctgttctt gtgaagcagggaaataatgaatggaggtgctgctttctcctcagtgcaattcacactaga aaaagccagaggaccaatgttataacacacatccagaagttcctcaaaaaggaagagaaa cacagaaatatggggaaatag >gi568815594r:22627412_22827950|GENSCAN_predicted_peptide_7|64_aa MRSSWKTVCPKSNGKCPQKIRERAKTRRKEDNVETEAEIGVIHLQAKEHQGLPAATRSQY RSME >gi568815594r:22627412_22827950|GENSCAN_predicted_CDS_7|195_bp atgagatcatcctggaagacagtgtgccctaaatccaatggcaagtgtcctcagaagata agagaaagggcaaagacacgcaggaaagaagacaacgtggagacagaggcagagattgga gtgatacatctgcaagccaaggaacaccaagggttgccagcagccaccagaagccagtat agaagcatggagtaa >gi568815594r:22627412_22827950|GENSCAN_predicted_peptide_8|66_aa MCFKAIVSDDLGLFLGRCVRDSGSSWVFKINVEETKPAAVSLVPLKVIQKRPCTIYLKID FIKLDS >gi568815594r:22627412_22827950|GENSCAN_predicted_CDS_8|201_bp atgtgcttcaaggccattgtttcggatgatcttggcttattccttggccgatgtgtaagg gactcggggtctagctgggtcttcaaaatcaacgtggaagagaccaaaccggctgctgta tccctggttccactcaaagttatccagaagagaccatgcacaatatacttgaagattgac tttatcaagttggatagctga