GENSCAN 1.0 Date run: 3-Nov-116 Time: 08:35:43 Sequence gi568815597r:53667973_53938261 : 270289 bp : 44.32% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.11 PlyA - 995 990 6 1.05 1.10 Term - 4089 3983 107 1 2 92 49 25 0.470 -2.43 1.09 Intr - 7234 7122 113 0 2 39 94 111 0.754 6.82 1.08 Intr - 10304 10190 115 0 1 44 41 148 0.789 5.21 1.07 Intr - 12120 12034 87 1 0 87 79 25 0.560 1.44 1.06 Intr - 14317 14224 94 1 1 55 77 85 0.761 3.64 1.05 Intr - 17017 16989 29 2 2 84 86 -1 0.297 -2.87 1.04 Intr - 19660 19450 211 1 1 105 26 98 0.225 3.79 1.03 Intr - 21592 21474 119 2 2 83 94 -6 0.556 -0.32 1.02 Intr - 25732 25598 135 2 0 98 64 96 0.887 8.74 1.01 Init - 51207 51096 112 0 1 77 80 81 0.136 6.57 1.00 Prom - 64576 64537 40 -4.86 2.03 PlyA - 65016 65011 6 1.05 2.02 Term - 70134 69697 438 0 0 113 33 293 0.884 21.58 2.01 Init - 71351 71133 219 2 0 73 78 279 0.998 22.03 2.00 Prom - 90452 90413 40 -3.06 3.04 PlyA - 93386 93381 6 1.05 3.03 Term - 95699 95618 82 1 1 76 42 38 0.072 -4.83 3.02 Intr - 104517 104357 161 0 2 88 100 111 0.562 11.09 3.01 Init - 170595 170233 363 0 0 88 77 195 0.763 13.53 3.00 Prom - 170746 170707 40 -5.16 4.00 Prom + 174397 174436 40 -6.76 4.01 Init + 185780 185849 70 1 1 61 97 53 0.811 4.74 4.02 Intr + 185933 185997 65 0 2 120 96 -47 0.664 -2.26 4.03 Intr + 187191 187377 187 2 1 52 92 120 0.811 8.06 4.04 Term + 189026 189165 140 0 2 37 41 125 0.866 0.83 4.05 PlyA + 189704 189709 6 1.05 5.07 PlyA - 190493 190488 6 1.05 5.06 Term - 192181 192092 90 1 0 143 39 39 0.945 2.22 5.05 Intr - 198410 198228 183 2 0 125 52 90 0.772 9.08 5.04 Intr - 198952 198786 167 2 2 79 92 95 0.255 8.68 5.03 Intr - 203516 203400 117 0 0 118 81 99 0.657 12.64 5.02 Intr - 205278 205162 117 1 0 94 34 90 0.572 4.64 5.01 Init - 206302 206275 28 1 1 94 91 -21 0.267 -1.14 5.00 Prom - 206403 206364 40 -4.36 6.00 Prom + 210570 210609 40 -3.26 6.01 Init + 226239 226575 337 2 1 104 77 290 0.464 25.64 6.02 Intr + 238123 238322 200 2 2 92 36 203 0.513 14.57 6.03 Term + 241959 242027 69 2 0 98 33 66 0.294 0.04 6.04 PlyA + 244030 244035 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 108186 108184 3 0 0 113 81 0 0.911 1.80 S.002 Sngl + 147634 148023 390 0 0 54 32 182 0.905 5.42 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:53667973_53938261|GENSCAN_predicted_peptide_1|373_aa MRRGGCIVSANDGAEAVFKDLGSGRFQTDPAQESFLPVPPERLLYDAHTLGDPEAGTACN QIAFLPTELMLLREWKFGSVGGGRKVTVQILLGLPVSVVTCSGLAGSQLRMFLGGKIHGT QQLGGPIACKQTLWFGEENAQNHPTAGGPSGSSSALTGERGCTRVFHEELLWGLNCARPD TGLVKRGGLVGPASELWLISPPVSAVSAAACWMAEGVKNAWLQAGLELESESGDIWAGDW GSGEFDLAPDLEEFKDMWPGWGERFFVQGGPLGICGELQLSVALQQGWQAPQGTSVGDII RVWERGYFGEFQDAGVIRTLTRNPSQAKLVAADNQLSPVGHNQLGQGLLCIPGYSSQREK LRISSSYSRRKWE >gi568815597r:53667973_53938261|GENSCAN_predicted_CDS_1|1122_bp atgaggagaggtggctgcattgtttctgccaacgatggggcagaggcagtgttcaaggac ctgggcagtggtcgttttcaaactgatcctgcccaagagagcttcttgccagtgccgcca gagcgcctgctttatgatgcccacacccttggcgaccccgaggccgggactgcttgtaat cagattgccttcctccccactgagctcatgctgctgagggaatggaaatttggatctgtg ggcgggggcagaaaggtcacagtgcagatcttgctgggactccctgtctctgtggtcact tgctctggtctggccggctctcagctcaggatgtttctgggtgggaagatacacgggact caacagctggggggacccattgcatgtaagcagaccttgtggtttggggaagagaacgca cagaaccacccgacagcagggggcccttcaggaagctcctcggctctcacaggagagcgg ggatgtactcgggtcttccatgaggagttattgtgggggctcaactgtgcgaggcctgac acgggcttggtgaagcggggcgggctggttggaccggcctcagagctctggcttatctcc cctcccgtgtcagctgtgtcagcggctgcttgctggatggctgaaggagtgaagaatgca tggctgcaagcgggtctggagttggagtcggagtccggtgacatttgggctggggactgg ggatctggagaattcgacctggcacctgaccttgaggagttcaaagacatgtggcctgga tggggtgagaggttcttcgtgcaagggggccctctgggcatctgcggcgagctgcagctc agcgtggctctacagcaagggtggcaggcacctcaggggacatctgttggggacatcatc agggtttgggaaagaggctattttggggaatttcaggatgctggagttatcaggaccctc acccgtaaccccagtcaagcaaagctggtggctgcagacaaccagctgtctccagttggc cacaaccagcttggccagggcctgctgtgtatcccaggctatagctcacagagggaaaag ctgagaatttcctcatcctacagtaggaggaagtgggaatga >gi568815597r:53667973_53938261|GENSCAN_predicted_peptide_2|218_aa MPRPRAAIGLSRAAAASQWLERLLGRRGPAPVPRGTAPQDAAAAAAAAAAAGAAAPVEAR RAGRPAICSCRCEEPGATPSRAPRPGAMHCEVAEAHSDKRPKEAPGAPGPDRGPASLGAH MAFRVTVSGGGCGDRGPRDLLARPPAPPPRAHDLLRPRSPRDYGPSKAAAAGKGEFGPAN VPGGSCGDSGCPASPPRARESLVELSSEFLSFSLIVWD >gi568815597r:53667973_53938261|GENSCAN_predicted_CDS_2|657_bp atgccccgccctcgcgccgcgattggcctgagccgggcggccgcggccagccagtggctg gagcggctgttgggcagacgcgggcccgcccccgtcccccggggaaccgccccgcaggac gcggcggcggcggcggcggctgcggcggctgcgggggctgctgcgccagtggaggcgcgc cgagccgggcgcccagcgatctgcagctgccgctgcgaggagcccggcgcgacccccagc cgtgcgccccgccccggcgccatgcattgcgaggtggctgaggcacactcggacaagagg cctaaggaggcccctggtgcgcccggccccgaccgcggccccgccagcctcggcgcgcac atggccttcagggtcaccgtgagtggcggcggctgcggggacaggggcccgcgggacctg ctagcccggccgcctgcgccgccaccgcgcgcccacgacctcctccggccccgcagtccc cgagactacggtccatccaaggccgccgccgccgggaagggtgagttcggccctgccaac gtgcctggcggctcctgtggagattcgggctgcccggcgtcgccaccgcgtgctcgggag agcttggtggaactttcttcggagttcctcagtttctctcttattgtttgggactag >gi568815597r:53667973_53938261|GENSCAN_predicted_peptide_3|201_aa MQRRHLLLAPRWTLLRRDRLVARGRFLELQPRLRDVSARQSWPVELVQTKPRRDGLEPGP GFAQAAFEADPAGSSWGLPAFELRWRAGARSPVRPRLGAAISMATAVSRPCAGRSRDILW RAVDKYFKLPHASSKPPRISGSLVDTSYKTLRFAFRASLKTAIYRITTTFGEHLKAVWLD RPQVVMLSLLTCILPDFECAL >gi568815597r:53667973_53938261|GENSCAN_predicted_CDS_3|606_bp atgcagcgccgccacctgcttctggcgccccgctggaccctgcttcgaagggaccgtttg gtcgccagagggcggttcctggagttacagccgcgtttgcgggacgtgtctgcgcggcag tcctggcctgttgagcttgtccagacgaagcctcgcagggatgggttggagcctgggccg ggcttcgctcaggcagcgtttgaggcagacccagcagggtcctcctggggccttcctgcc tttgaactgcggtggcgggcgggcgcacggtctcctgtacgccctagactaggggccgcc atctccatggccacggccgtgagccggccctgcgccggcaggtcgcgggacatactgtgg cgcgcagtcgacaagtactttaagcttcctcatgcttccagtaaaccaccccggatttca ggaagccttgtggacacttcatataaaacattaagatttgcattcagagcatcactgaaa actgccatctatcgaataactactacatttggtgaacatctgaaagctgtgtggctggac cgccctcaagtggtcatgctctccctgctcacatgtattttaccagattttgagtgtgcc ttataa >gi568815597r:53667973_53938261|GENSCAN_predicted_peptide_4|153_aa MKEPKALISLLVTINLGSETLNKAVCSKSMSRLGKNMKTRKFPSQEDLMTKDSPEALSAS LIKALITLRYDYLLMALLRTAPPVTAPGKLTPIWMRSQALKPKQGFPDSIASLSLYGFDE VSDHTERPTWQATEGGLQMTVSKKALILRSHRN >gi568815597r:53667973_53938261|GENSCAN_predicted_CDS_4|462_bp atgaaagaacctaaagccctgatcagccttttagtaaccatcaacttgggttctgaaacg ctaaataaagctgtctgttccaaatcaatgtcaagactagggaagaatatgaagactaga aaattcccttctcaggaggatttaatgacaaaggactctccagaagccctctctgcatca ttaattaaagccctgatcacactgcgatatgattatctgctgatggccctcctccggaca gccccacctgtcactgcccctggaaagctcaccccaatctggatgcggtctcaagccctc aagccaaagcagggcttcccagactcgattgcctctctctccctttatggctttgatgaa gtgagtgatcatacagagaggcccacgtggcaagcaactgagggtggtctccaaatgacc gtcagcaagaaggccctcattctacgaagccacaggaattga >gi568815597r:53667973_53938261|GENSCAN_predicted_peptide_5|233_aa MAGPGMSGCRTLYYLLRLDILLIVSPECMGALEAVTVSLLPIIVSPAEGPFWICATLVFA IAISGNLSNFLIHLGEKTYHYVPEFRKVSIAATIIYAYAWLVPLALWGFLMWRNSKVMNI VSYSFLEIVCVYGYSLFIYIPTAILWIIPQKAVRWILVMIALGISGSLLAMTFWPAVRED NRRVALATIVTIVLLHMLLSVGCLAYFFDAPEMDHLPTTTATPNQTVAAAKSS >gi568815597r:53667973_53938261|GENSCAN_predicted_CDS_5|702_bp atggcaggtcctgggatgagtgggtgtcgcaccctgtactaccttctgagattagatatt ttattgatcgtctccccggagtgcatgggtgcactagaggcagtgactgtgtctctgttg cccatcatcgtgtccccagcagagggccccttttggatatgtgccacgttggtctttgcc atagcaattagtgggaatctttccaacttcttgatccatctgggagagaagacgtaccat tatgtgcccgaattccgaaaagtgtccatagcagctaccatcatctatgcctatgcctgg ctggttcctcttgcactctggggtttcctcatgtggagaaacagcaaagttatgaacatc gtctcctattcatttctggagattgtgtgtgtctatggatattccctcttcatttatatc cccaccgcaatactgtggattatcccccagaaagctgttcgttggattctagtcatgatt gccctgggcatctcaggatctctcttggcaatgacattttggccagctgttcgtgaggat aaccgacgcgttgcattggccacaattgtgacaattgtgttgctccatatgctgctttct gtgggctgcttggcatacttttttgatgcaccagagatggaccatctcccaacaactaca gctactccaaaccaaacagttgctgcagccaagtccagctaa >gi568815597r:53667973_53938261|GENSCAN_predicted_peptide_6|201_aa MGLPQPGLWLKRLWVLLEVAVHVVVGKVLLILFPDRVKRNILAMGEKTGMTRNPHFSHDN WIPTFFSTQYFWFVLKVRWQRLEDTTELGGLAPNCPVVRLSGQRCNIWEFMQDGWAFKNN MDIRNHQNLQDRLQAAHLLLARSPQCPVVVDTMQNQSSQLYAALPERLYIIQEGRILYKG KSGPWNYNPEEVRAVLEKLHS >gi568815597r:53667973_53938261|GENSCAN_predicted_CDS_6|606_bp atggggctgccccagccagggctgtggctgaagaggctctgggtgctcttggaggtggct gtgcatgtggtcgtgggtaaagtgcttctgatattgtttccagacagagtcaagcggaac atcctggccatgggcgagaagacgggtatgaccaggaacccccatttcagccacgacaac tggataccaacctttttcagcacccagtatttctggttcgtcttgaaggtccgttggcag cgactagaggacacgactgagctagggggtctggccccaaactgcccggtggtccgcctc tcaggacagaggtgcaacatttgggagtttatgcaagatggctgggcttttaagaacaac atggacatcagaaatcaccagaaccttcaggatcgcctgcaggcagcccatctactgctg gccaggagcccccagtgccctgtggtggtggacaccatgcagaaccagagcagccagctc tacgcagcactgcctgagaggctctacataatccaggagggcaggatcctctacaagggt aaatctggcccttggaactacaacccagaggaagttcgtgctgttctggaaaagctccac agttaa