GENSCAN 1.0 Date run: 4-Nov-116 Time: 12:18:20 Sequence gi568815595r:195416899_195643025 : 226127 bp : 44.35% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 25678 26117 440 0 2 41 -8 350 0.933 14.29 1.02 Term + 26157 26280 124 0 1 54 53 180 0.521 8.96 1.03 PlyA + 28523 28528 6 1.05 2.00 Prom + 38719 38758 40 -5.76 2.01 Init + 48719 48878 160 1 1 68 63 156 0.506 11.09 2.02 Term + 57316 57437 122 2 2 68 55 64 0.299 -0.26 2.03 PlyA + 58191 58196 6 1.05 3.05 PlyA - 59126 59121 6 1.05 3.04 Term - 100044 99998 47 1 2 131 42 19 0.783 -1.03 3.03 Intr - 102287 102120 168 0 0 62 72 152 0.884 10.92 3.02 Intr - 106888 106794 95 0 2 46 53 124 0.999 4.31 3.01 Init - 107994 107921 74 0 2 62 100 100 0.932 9.14 3.00 Prom - 111432 111393 40 -3.56 4.05 PlyA - 111571 111566 6 1.05 4.04 Term - 121175 121167 9 2 0 98 43 0 0.173 -5.31 4.03 Intr - 126268 126006 263 2 2 7 83 388 0.566 27.41 4.02 Intr - 133305 133236 70 0 1 59 86 48 0.542 0.45 4.01 Init - 148961 148731 231 2 0 48 98 115 0.698 6.76 4.00 Prom - 149604 149565 40 -5.06 5.05 PlyA - 150256 150251 6 1.05 5.04 Term - 152237 152002 236 0 2 102 41 281 0.843 21.28 5.03 Intr - 154467 154379 89 2 2 74 78 68 0.991 4.01 5.02 Intr - 157073 156952 122 2 2 78 105 100 0.995 9.99 5.01 Init - 162563 162441 123 2 0 93 93 201 0.980 19.29 5.00 Prom - 173239 173200 40 -4.66 6.00 Prom + 191739 191778 40 -3.36 6.01 Sngl + 201759 203219 1461 2 0 60 49 1046 0.286 92.63 6.02 PlyA + 204240 204245 6 1.05 7.07 PlyA - 205978 205973 6 1.05 7.06 Term - 208056 207783 274 2 1 77 43 118 0.237 1.14 7.05 Intr - 209464 209361 104 1 2 53 94 35 0.414 -0.43 7.04 Intr - 209883 209752 132 0 0 102 76 42 0.615 5.24 7.03 Intr - 211609 211474 136 0 1 12 90 95 0.197 2.67 7.02 Intr - 212375 212190 186 1 0 125 -9 118 0.205 4.60 7.01 Init - 214866 214736 131 0 2 53 22 162 0.752 3.72 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 137816 138067 252 1 0 51 47 182 0.932 5.49 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:195416899_195643025|GENSCAN_predicted_peptide_1|187_aa MRGDVPVPATSSPLGRARWGLQCPRRVGNGGNSRSPRRKERTAGPPPGARARLSRPQRRE GSSAVTPGGRAGYLKRGESFRHSSKSTVIFILPPPRRRRWQSRGGRGSGRAGTQTATAGA RPRLALRGAAKGASPAGHSSREDGDDYRALAAGKRPGSDSSGQPATNLAAKGRGIRQHRL RAPGQRV >gi568815595r:195416899_195643025|GENSCAN_predicted_CDS_1|564_bp atgagaggcgacgtccccgtccccgccacttcgtcgcccctcgggagggctcggtgggga ctgcagtgccctcgcagggtgggaaacggcggcaacagccgctcgccccgtcggaaggag cggacggcggggcctcctccgggtgcgcgggcccggctttcacgcccccagcgccgggaa ggcagctccgcggtgacgccgggcggccgtgccggttacctgaagcggggcgagtccttc agacactcctcgaaatccacagtcatcttcatcctgcctccgcctcgcaggcggcgctgg caaagccgagggggccgcgggagcggccgcgctgggacgcagacggctacggcgggcgca cggccgcgactagcgttgcgcggagctgcgaagggcgcctcgcccgctggtcatagcagc cgcgaagacggcgacgactatagagcgctggccgcagggaaacggccggggagtgacagc agcggccagccagccacgaacctggcggccaaggggcggggcatccggcagcaccgcctg cgggcgcccgggcaacgggtctga >gi568815595r:195416899_195643025|GENSCAN_predicted_peptide_2|93_aa MEEATSQQGPKEVGDSLVDFWGRSSPERENSRCKDPEWKECLDGPGMAVWLEQTPCNVAC YPQKANEDTLTKITSDLGDPESSEVFLMLALLF >gi568815595r:195416899_195643025|GENSCAN_predicted_CDS_2|282_bp atggaggaggcgacatcccagcagggacccaaggaagtaggagacagccttgtggacttc tgggggaggagtagtccagaaagagagaacagccggtgcaaagaccctgagtggaaagaa tgcttggatggtccaggaatggccgtgtggctggagcagaccccgtgcaatgtggcttgc taccctcaaaaggccaatgaggacactctcactaagatcaccagtgatctaggtgaccct gaatctagcgaagtctttctcatgttagctctgttgttttga >gi568815595r:195416899_195643025|GENSCAN_predicted_peptide_3|127_aa MGDDEDACSDTEATEAMAPDILARKLAAAEGLEPKYRIQEQESSGEEDSDLSPEEREKKR QFEMKRKLHYNEGLNIKLARQLISKDLHDDDEDEEMLETADGESMNTEESNQGSTPSDQQ QNKLRSS >gi568815595r:195416899_195643025|GENSCAN_predicted_CDS_3|384_bp atgggggatgatgaagatgcctgtagtgacaccgaggccactgaagccatggcgccagac atcttagccaggaaattagctgcagctgaaggcttggagccaaagtatcggattcaggaa caagaaagcagtggagaggaggatagtgacctctcacctgaagaacgagaaaaaaagcga caatttgaaatgaaaaggaagcttcactacaatgaaggactcaatatcaaactagccaga caattaatttcaaaagacctacatgatgatgatgaagatgaagaaatgttagagactgca gatggagaaagcatgaatacggaagaatcaaatcaaggatctactccaagtgaccaacag caaaacaaattacgaagttcatag >gi568815595r:195416899_195643025|GENSCAN_predicted_peptide_4|190_aa MCPPELATANPQAEQKPLTSMASLDGNDKLRQSNKSLTLSSLERGSLKALVLQGHVACSG HYRPCTTEVMCAEQKRRPRERKEGLESCLENHLIKTPDNTAVSEAAVAAALALSGICGCL RVSAVPTLLFADPTPSSDPEPTAGAPGNGGLDGLAPAHQGDLEEQDLYDFLYGGVGRTAP RECRRGAEGY >gi568815595r:195416899_195643025|GENSCAN_predicted_CDS_4|573_bp atgtgtccacctgagctagcaacagccaatccacaggctgaacagaaacctctgacttct atggcctcgctggatggaaatgataaactgagacaatcaaataagagtttgactttaagc agtctggaaagaggatcattgaaagctctggtactgcaaggtcatgtggcttgtagtggc cactaccggccatgcactactgaagtgatgtgtgcggaacagaaacgcaggccacgggag cgcaaagagggtcttgagtcctgtctagagaaccacttgataaaaacacctgataacaca gccgtttccgaggcagcagttgcggccgctttagccctgagcgggatctgcggctgcctg cgagtctctgctgtgccgacccttctcttcgcggaccccacgccaagcagcgaccctgag ccgacagccggagcgcccggcaatggcggcctcgacggcctcgcaccggcccatcaaggg gatcttgaagaacaagacctctacgacttcctctatggtggcgtcggccgaacagccccg cgggaatgtcgacgaggagctgagggatactga >gi568815595r:195416899_195643025|GENSCAN_predicted_peptide_5|189_aa MVMLLLLLSALAGLFGAAEGQAFHLGKCPNPPVQENFDVNKYLGRWYEIEKIPTTFENGR CIQANYSLMENGKIKVLNQELRADGTVNQIEGEATPVNLTEPAKLEVKFSWFMPSAPYWI LATDYENYALVYSCTCIIQLFHVDFAWILARNPNLPPETVDSLKNILTSNNIDVKKMTVT DQVNCPKLS >gi568815595r:195416899_195643025|GENSCAN_predicted_CDS_5|570_bp atggtgatgctgctgctgctgctttccgcactggctggcctcttcggtgcggcagaggga caagcatttcatcttgggaagtgccccaatcctccggtgcaggagaattttgacgtgaat aagtatctcggaagatggtacgaaattgagaagatcccaacaacctttgagaatggacgc tgcatccaggccaactactcactaatggaaaacggaaagatcaaagtgttaaaccaggag ttgagagctgatggaactgtgaatcaaatcgaaggtgaagccaccccagttaacctcaca gagcctgccaagctggaagttaagttttcctggtttatgccatcggcaccgtactggatc ctggccaccgactatgagaactatgccctcgtgtattcctgtacctgcatcatccaactt tttcacgtggattttgcttggatcttggcaagaaaccctaatctccctccagaaacagtg gactctctaaaaaatatcctgacttctaataacattgatgtcaagaaaatgacggtcaca gaccaggtgaactgccccaagctctcgtaa >gi568815595r:195416899_195643025|GENSCAN_predicted_peptide_6|486_aa MTTDDTEVPAMTLAPGHAALETQTLSAETSSRASTPAGPIPEAETRGAKRISPARETRSF TKTSPNFMVLIATSVETSAASGSPEGAGMTTVQTITGSDPREAIFDTLCTDDISEEAKTL TMDILTLAHTSTEAKGLSSESSASSDGPHPVITPSRASESSASSDGPHPVITPSRASESS ASSDGPHPVITPSRASESSASSDGPHPVITPSRASESSASSDGLHPVITPSRASESSASS DGLHPVITPSRASESSASSDGPHPVITPSWSPGSDVTLLAEALVTVTNIEVINCSITEIE TTTSSIPGASDTDLIPTEGVKASSTSDPPALPDSTNTKPHITEVTASAETLSTAGTTESA APDATIGTPLPTNSTIEREVTAPGATTLSGALATGNPLEETSALSVETPSYVKVSGAAPV SIEAGSAVGKTTSFAGSSASSYSPLEAALKNFTPSETLTTDIATKGPFPTSRAPLPSVPP TTTNSS >gi568815595r:195416899_195643025|GENSCAN_predicted_CDS_6|1461_bp atgacaacggacgacacagaagtgcccgctatgactctagcaccgggccacgccgctctg gaaactcaaacgctgagcgctgagacctcttctagggcctcaaccccagccggccccatt ccagaagcagagaccaggggagccaagagaatttcccctgcaagagagaccaggagtttc acaaaaacatctcccaacttcatggtgctgatcgccacctccgtggagacatcagccgcc agtggcagccccgagggagctggaatgaccacagttcagaccatcacaggcagtgatccc agggaagccatctttgacaccctttgcaccgatgacatctctgaagaggcaaagacactc acaatggacatattgacattggctcacacctccacagaagctaagggcctgtcctcagag agcagcgcctcttccgacggcccccatccagtcatcaccccgtcacgggcctcagagagc agcgcctcttccgacggcccccatccagtcatcaccccgtcacgggcctcagagagcagc gcctcttccgacggcccccatccagtcatcaccccgtcacgggcctcagagagcagcgcc tcttccgacggcccccatccagtcatcaccccgtcacgggcctcagagagcagcgcctct tccgacggcctccatccagtcatcaccccgtcacgggcctcagagagcagcgcctcttcc gacggcctccatccagtcatcaccccgtcacgggcctcagagagcagcgcctcttccgac ggcccccatccagtcatcaccccctcatggtccccgggatctgacgtcactctcctcgct gaagccctggtgactgtcacaaacatcgaggttattaattgcagcatcacagaaatagaa acaacgacttccagcatccctggggcctcagacacagatctcatccccacggaaggggtg aaggcctcgtccacctccgatccaccagctctgcctgactccactaacacaaaaccacac atcactgaggtcacagcctctgccgagaccctgtccacagccggcaccacagagtcagct gcacctgatgccacgattgggaccccactccccaccaacagcaccatagaaagagaagtg acagcacccggggccacgaccctcagtggagctctggccacagggaatcccctggaagaa acctcagccctctctgttgagacaccaagttacgtcaaagtctcaggagcagctccggtc tccatagaggctgggtcagcagtgggcaaaacaacttcctttgctgggagctctgcttcc tcctacagccccttggaagccgccctcaagaacttcaccccttcagagacactgaccacg gacatcgcaaccaaggggcccttccccaccagcagggcccctcttccttctgtccctccg actacaaccaacagcagctga >gi568815595r:195416899_195643025|GENSCAN_predicted_peptide_7|320_aa MGLPGLPGVAGGRVGEDGASVCMCESLKDGMSVKMASPSRGCLPAVCLGPGVRNGQFFNT VSGSGRPEGSQALSPWRAFCGLSQGSRQEPGENRCVCSRRALMPLRDTERPTVCFAGTVQ CSQGKSCIRHAGAVAAPAENQSLIFGVRKVEAPSPNTIALGVRISTDEFRGTRTFKPSQS PTHQGESVWREDIREPPGLWSPWSVVLAYGSPTAPVRWEETADTMRRHHTLRRKPPGGLW GLRSVPGVPGSCLNAHSGSCPAASAPRPRPRAGSAPWCRLGSVQGRAGCAEHLQAPPWVT EDRGADQVFYRSVGDQQLDA >gi568815595r:195416899_195643025|GENSCAN_predicted_CDS_7|963_bp atgggacttcctggcctgcccggggtggcgggggggcgggtgggagaggacggagcgtct gtgtgcatgtgtgagagcctcaaggacggcatgtctgtgaagatggcttcacccagccgc ggctgccttccagccgtgtgtctagggcccggggtgcggaacgggcagttcttcaacaca gtgagcggcagcgggcgtcccgaaggttctcaggccttgtctccgtggagggcattttgt ggcctctcccagggcagccggcaggagccaggcgagaacagatgcgtctgtagcaggagg gcgttgatgcctctgagagatacagagcgacccaccgtctgctttgctgggactgtccag tgttcgcagggaaagtcctgcatccggcacgctggggcggtggctgcaccagctgagaac cagtctctgattttcggggtgagaaaagtagaggccccatctcctaacaccatcgccttg ggggtgaggatttcaacagatgaatttcggggaacacggacattcaaaccctcacaaagc cccacccaccaaggagagtctgtctggagagaagacattagggagccaccagggttgtgg agcccctggtctgtggtgctggcttatggcagccctactgccccagtgagatgggaagag acagcagacacaatgagaagacaccacacgctcaggcggaaaccccccgggggcctctgg gggctgcggtcggtgccaggggtcccgggcagctgcctgaacgcgcacagcggctcctgc cccgcagcctccgccccgcgcccgcgtcctcgggccggcagcgccccctggtgccgcctc gggtctgtgcagggccgggcgggctgcgcggagcacctgcaggcgcctccatgggtgacg gaagacagaggggctgaccaggtcttctacaggtcagtgggcgatcaacagctggacgcg tag