GENSCAN 1.0 Date run: 3-Nov-116 Time: 00:24:27 Sequence gi568815578r:4684276_4901030 : 216755 bp : 45.95% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1475 1541 67 1 1 30 95 65 0.500 2.63 1.02 Intr + 3029 3202 174 0 0 77 10 119 0.090 2.91 1.03 Intr + 7069 7153 85 2 1 69 72 1 0.029 -4.52 1.04 Intr + 8795 8908 114 2 0 106 17 80 0.237 2.26 1.05 Term + 14936 15707 772 2 1 136 55 886 0.632 82.97 1.06 PlyA + 19529 19534 6 1.05 2.04 PlyA - 23165 23160 6 1.05 2.03 Term - 28950 28856 95 1 2 60 47 96 0.577 0.69 2.02 Intr - 29550 29451 100 0 1 110 89 42 0.627 6.18 2.01 Init - 35020 34934 87 1 0 82 42 34 0.103 -1.15 2.00 Prom - 39753 39714 40 -3.56 3.00 Prom + 39810 39849 40 -4.56 3.01 Sngl + 40277 40807 531 1 0 85 43 792 0.940 68.47 3.02 PlyA + 41359 41364 6 1.05 4.18 PlyA - 43302 43297 6 1.05 4.17 Term - 52016 51754 263 0 2 49 48 171 0.948 4.89 4.16 Intr - 64746 64546 201 1 0 66 60 59 0.011 0.16 4.15 Intr - 91775 91699 77 1 2 34 75 80 0.092 0.46 4.14 Intr - 92607 92477 131 0 2 53 63 87 0.168 2.19 4.13 Intr - 98255 98183 73 1 1 77 78 65 0.292 3.81 4.12 Intr - 102053 101956 98 2 2 126 108 86 0.992 13.21 4.11 Intr - 103479 103358 122 1 2 77 86 216 0.938 20.51 4.10 Intr - 103993 103942 52 1 1 59 103 83 0.795 5.38 4.09 Intr - 105422 105351 72 2 0 77 34 140 0.784 7.10 4.08 Intr - 106336 106176 161 2 2 80 116 215 0.999 23.21 4.07 Intr - 108352 108264 89 0 2 98 88 92 0.619 9.81 4.06 Intr - 111691 111540 152 1 2 82 84 200 0.999 17.96 4.05 Intr - 113810 113735 76 1 1 93 94 90 0.934 9.62 4.04 Intr - 117101 116929 173 2 2 67 52 2 0.190 -6.76 4.03 Intr - 118072 117951 122 2 2 43 40 109 0.304 1.81 4.02 Intr - 121704 121512 193 0 1 50 49 176 0.526 8.97 4.01 Init - 122472 122422 51 0 0 71 78 44 0.403 0.86 4.00 Prom - 122700 122661 40 -5.96 5.00 Prom + 125138 125177 40 -1.26 5.01 Init + 129257 129463 207 1 0 76 33 108 0.053 2.96 5.02 Intr + 136772 136894 123 1 0 32 94 105 0.087 6.28 5.03 Intr + 141689 141740 52 1 1 79 99 46 0.034 3.28 5.04 Intr + 151078 151201 124 2 1 11 78 83 0.049 -0.76 5.05 Intr + 155345 155468 124 2 1 74 93 29 0.461 2.59 5.06 Term + 156556 156576 21 0 0 94 55 32 0.423 -1.19 5.07 PlyA + 158960 158965 6 1.05 6.13 PlyA - 159318 159313 6 1.05 6.12 Term - 172929 172697 233 1 2 19 38 213 0.884 6.04 6.11 Intr - 175109 175014 96 0 0 118 98 85 0.995 12.48 6.10 Intr - 177810 177673 138 1 0 86 94 69 0.937 7.74 6.09 Intr - 178632 178503 130 0 1 111 91 139 0.996 16.77 6.08 Intr - 183600 183495 106 2 1 80 81 53 0.978 4.02 6.07 Intr - 185778 185631 148 1 1 88 98 256 0.997 25.89 6.06 Intr - 189817 189661 157 1 1 66 103 142 0.998 13.08 6.05 Intr - 190421 190301 121 1 1 138 76 -5 0.988 3.80 6.04 Intr - 199548 199367 182 0 2 118 87 138 0.717 15.37 6.03 Intr - 200548 200478 71 2 2 49 83 82 0.897 2.70 6.02 Intr - 201634 201546 89 0 2 92 91 51 0.939 5.41 6.01 Init - 201910 201849 62 1 2 70 35 82 0.611 0.14 6.00 Prom - 205557 205518 40 -2.66 7.04 PlyA - 206046 206041 6 1.05 7.03 Term - 212851 212672 180 1 0 67 55 178 0.972 10.01 7.02 Intr - 214921 214852 70 0 1 101 84 1 0.921 0.18 7.01 Intr - 215437 215280 158 1 2 114 94 140 0.995 16.01 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 168725 168655 71 0 2 83 48 99 0.840 3.50 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578r:4684276_4901030|GENSCAN_predicted_peptide_1|403_aa MRQLSAGILRKNTPPCFIEDISGAGREELQVCGLAPGAMADPSLASHIPPSPPGGRTLAM GGSKEQPASDDDPWVTGLPTYLRGKALNFSPLSMMLAVTLSYVAFIVLRHDPPPVCPISV DGITAISLVMEVPDPGNSHSSIHSTSRAVIMANLGCWMLVLFVATWSDLGLCKKRPKPGG WNTGGSRYPGQGSPGGNRYPPQGGGGWGQPHGGGWGQPHGGGWGQPHGGGWGQPHGGGWG QGGGTHSQWNKPSKPKTNMKHMAGAAAAGAVVGGLGGYMLGSAMSRPIIHFGSDYEDRYY RENMHRYPNQVYYRPMDEYSNQNNFVHDCVNITIKQHTVTTTTKGENFTETDVKMMERVV EQMCITQYERESQAYYQRGSSMVLFSSPPVILLISFLIFLIVG >gi568815578r:4684276_4901030|GENSCAN_predicted_CDS_1|1212_bp atgagacagttgtcagcaggaatcctgcgcaagaacacaccaccctgtttcatagaagat atctcaggggcggggagagaggagctgcaggtctgcggcctggccccaggtgcgatggcg gaccccagcttggccagtcacattcctcccagtccccctggagggagaacgctggccatg gggggctccaaggaacaaccagcctcggatgacgacccttgggtcaccggtctccccacc tatctaagaggaaaagctctcaatttttcaccattaagtatgatgttagctgtgaccttg tcatatgtggcctttattgtgttgaggcatgaccctccaccagtttgtcctatttcagta gatggcatcaccgccatctccctggttatggaagtcccagacccgggcaactctcactcc tccatccactcaaccagcagagcagtcattatggcgaaccttggctgctggatgctggtt ctctttgtggccacatggagtgacctgggcctctgcaagaagcgcccgaagcctggagga tggaacactgggggcagccgatacccggggcagggcagccctggaggcaaccgctaccca cctcagggcggtggtggctgggggcagcctcatggtggtggctgggggcagcctcatggt ggtggctgggggcagccccatggtggtggctggggacagcctcatggtggtggctggggt caaggaggtggcacccacagtcagtggaacaagccgagtaagccaaaaaccaacatgaag cacatggctggtgctgcagcagctggggcagtggtggggggccttggcggctacatgctg ggaagtgccatgagcaggcccatcatacatttcggcagtgactatgaggaccgttactat cgtgaaaacatgcaccgttaccccaaccaagtgtactacaggcccatggatgagtacagc aaccagaacaactttgtgcacgactgcgtcaatatcacaatcaagcagcacacggtcacc acaaccaccaagggggagaacttcaccgagaccgacgttaagatgatggagcgcgtggtt gagcagatgtgtatcacccagtacgagagggaatctcaggcctattaccagagaggatcg agcatggtcctcttctcctctccacctgtgatcctcctgatctctttcctcatcttcctg atagtgggatga >gi568815578r:4684276_4901030|GENSCAN_predicted_peptide_2|93_aa MAIYEPQSGFSPDTQSAGTLILDLPASRTLPHGNITDFPTGHEPMPLFHLLCSPDSNCKC YAGTRMKLEGIILTKLTREQKTKHRMFSLTSGS >gi568815578r:4684276_4901030|GENSCAN_predicted_CDS_2|282_bp atggccatctatgaaccacaaagtgggttctcaccagacacccaatctgctggcaccttg atcttggacttgccagcctccagaacactgccacatggaaacattactgattttccaact ggacatgaaccaatgcctctcttccatcttctctgctctcccgactccaactgcaagtgc tacgcagggacacgaatgaagctggaaggcatcatcctcaccaaactaacacgggaacag aaaaccaaacaccgcatgttctcactcacaagtgggagttga >gi568815578r:4684276_4901030|GENSCAN_predicted_peptide_3|176_aa MRKHLSWWWLATVCMLLFSHLSAVQTRGIKHRIKWNRKALPSTAQITEAQVAENRPGAFI KQGRKLDIDFGAEGNRYYEANYWQFPDGIHYNGCSEANVTKEAFVTGCINATQAANQGEF QKPDNKLHQQVLWRLVQELCSLKHCEFWLERGAGLRVTMHQPVLLCLLALIWLTVK >gi568815578r:4684276_4901030|GENSCAN_predicted_CDS_3|531_bp atgaggaagcacctgagctggtggtggctggccactgtctgcatgctgctcttcagccac ctctctgcggtccagacgaggggcatcaagcacagaatcaagtggaaccggaaggccctg cccagcactgcccagatcactgaggcccaggtggctgagaaccgcccgggagccttcatc aagcaaggccgcaagctcgacattgacttcggagccgagggcaacaggtactacgaggcc aactactggcagttccccgatggcatccactacaacggctgctctgaggctaatgtgacc aaggaggcatttgtcaccggctgcatcaatgccacccaggcggcgaaccagggggagttc cagaagccagacaacaagctccaccagcaggtgctctggcggctggtccaggagctctgc tccctcaagcattgcgagttttggttggagaggggcgcaggacttcgggtcaccatgcac cagccagtgctcctctgccttctggctttgatctggctcacggtgaaataa >gi568815578r:4684276_4901030|GENSCAN_predicted_peptide_4|701_aa MDRLQWLTVIPALWEAEACCTSGYLHWKFPWVCLKLSPGPHFSNCEGGTSAYVAVLASSL CVTRTLPVTPRFIRQHTLSPKVLVIITGVVLLASSEQNPEMLLNTCNAQDSPTTKNSPRP GARGGLISHPWNQCGCQCALWLGHLCLDCPRVVVFPLDVCSQRLWQAEAAASSVAPHSLG NELLLHLKTYNLYYEGQNLQLRHREEEDEFIVEGLLNISWGLRRPIRLQMQDDNERIRPP PSSSSWHSGCNLGAQGTTLKPLTVPKVQISEVDAPPEGDQMPSSTDSRGLKPLQEDTPQL MRTRSDVGVRRRGNVRTPSDQRRIRRHRFSINGHFYNHKTSVFTPAYGSVTNVRINSTMT TPQIENSAEEFALYVVHTSGEKQKLKATDYPLIARILQGPCEQISKVFLMEKDQVEEVTY DVAQYIKFEMPVLKSFIQKLQEEEDREVKKLMRKDSSTSPVGHGVEAQKGFCKLHRTGSL SPSKLPLMPPTLTCRAIYGVNSTESGVSYNILPWDNPGVLRSSPVMRFFRYTRQEHRDTE SPLSLRQGREDKTRSQSFLQLARTMPWTALRHAENNGALESLGPASSGQSSSSPDVFAEV TLLSWKGVGSCALGEKCILIDENDSNIGTETKKNCHENENIGNGLLHQALSVFLLNTKMS YSDSRDQMLKLPFEPVSPILGCSHPLSNPDKLERNDVIDIS >gi568815578r:4684276_4901030|GENSCAN_predicted_CDS_4|2106_bp atggaccggctgcagtggctcactgtaatcccagcactttgggaggctgaggcctgctgt acttctggttatctgcactggaagtttccatgggtctgcctgaaactcagtcctggtcct cacttcagcaactgtgaaggtggcacctctgcctacgttgcggtcctcgccagcagcctg tgcgtcaccaggactctgcctgtcaccccacgtttcatccggcagcacaccctttcccca aaggttttggtcatcatcactggggtagtgctgctggcatctagtgagcagaatccagag atgctgctaaacacctgcaatgcacaagacagtcccacgacaaagaacagtccaaggcca ggggcgagaggaggactcatcagtcatccttggaaccaatgtggctgtcagtgtgctctg tggctggggcatttgtgcttggattgtccaagggttgtcgtctttcctttggatgtttgt tctcagaggctgtggcaagccgaggcagcagccagcagtgttgcccctcacagccttgga aatgaacttctcttgcatctgaagacctacaacttgtactatgaaggccagaatttacag ctccggcaccgggaggaagaagacgagttcattgtggaggggctcctgaacatctcctgg ggcctgcgccggcccattcgcctgcagatgcaggatgacaacgaacgcattcgaccccct ccatcctcctcctcctggcactctggctgtaacctgggggctcagggaaccactctgaag cccctgactgtgcccaaagttcagatctcagaggtggatgccccgccggagggtgaccag atgccaagctccacagactccaggggcctgaagcccctgcaggaggacaccccacagctg atgcgcacacgcagtgatgttggggtgcgtcgccgtggcaatgtgaggacgcctagtgac cagcggcgaatcagacgccaccgcttctccatcaacggccatttctacaaccataagaca tccgtgttcacaccagcctatggctctgtcaccaacgtccgcatcaacagcaccatgacc accccacagattgagaattcagcagaggagtttgccttgtacgtggtccatacgagtggt gagaaacagaagctgaaggccaccgattacccgctgattgcccgaatcctccagggccca tgtgagcagatctccaaagtgttcctaatggagaaggaccaggtggaggaagtcacctac gacgtggcccagtatataaagttcgagatgccggtacttaaaagcttcattcagaagctc caggaggaagaagatcgggaagtaaagaagctgatgcgcaaggacagcagcaccagccct gtggggcatggagtggaagcccagaagggcttctgcaagctgcacagaactgggtcacta agcccctctaagctgcccttgatgccacctaccctcacctgcagagccatctatggggtc aacagcaccgagagtggggtatcttacaacatccttccctgggacaacccaggggttctg agaagctcacccgtgatgcgatttttccggtacacaaggcaagaacaccgggatacagaa agccctctgtccttgcgacaaggaagagaggacaaaacccggagccaaagcttccttcag ctggcccgcaccatgccctggacagctctgagacatgcagagaacaatggagccctggag agcctggggccggcttcctctggacaatcttcatcctcacctgatgtttttgctgaagtc actttgctctcatggaaaggtgttggcagttgtgccttgggggagaagtgtattcttatt gatgaaaatgacagtaatattggaactgagaccaagaagaattgtcacgagaatgaaaac attgggaatggattattgcatcaagctcttagtgtcttcttactcaacaccaaaatgagc tacagtgacagcagagatcagatgctgaaattacctttcgagccagtttcaccaatactt ggttgtagtcatccattaagtaatccagacaagcttgagagaaatgatgtcattgacata agttga >gi568815578r:4684276_4901030|GENSCAN_predicted_peptide_5|216_aa MASQTGGAATCLQGSPTGIHKCLTFRMDVAEGNSERLATNQRGRALERDEKGSIKEAQTP EKSSCVCVGLTRSAIPKQLTYGTQTRSLAPGHQEIDQEVKAKPIATGMSQASTTRGQTKR LMAQEATGLKGEMGKDILGRGSSKFKAMRSLARPENSVWSSVGGTEGGSCRGMQVGQGDR EQESECTVRQGRGTPPTPRHHTVAPCSMGLDIPAVS >gi568815578r:4684276_4901030|GENSCAN_predicted_CDS_5|651_bp atggcctcgcagaccggaggagcagccacatgtttacagggaagccccactggcatccac aagtgcctgacctttaggatggatgttgcagaaggaaattcagagcgcttagccactaat cagagaggcagggcgctggagcgggacgaaaaaggaagcataaaggaagcacagactcca gaaaaaagcagctgtgtgtgcgtgggtctgactcgctcagccattcctaagcagctcacg tatggaacgcagacacgaagcctggctccggggcaccaggaaatagaccaagaagtcaaa gcaaaaccaatagcaactgggatgtctcaggcttctaccaccagaggccagaccaagagg ctcatggcccaggaagcaacaggtcttaagggtgagatggggaaggacattctaggcaga ggcagcagcaagttcaaagccatgaggagcctggctcgaccagagaacagcgtgtggtcc agcgtgggtggaacagaaggtgggagctgcaggggaatgcaagtgggccagggggataga gaacaggaatctgaatgcacagtcagacagggccgtggaacacccccaacccctaggcac catactgtggccccatgctctatggggctggatatacctgctgtcagctga >gi568815578r:4684276_4901030|GENSCAN_predicted_peptide_6|510_aa MAGAGVGVLALTTGPTLELPGLPLFQASAFAFLAPARAILSLDKWKCNTTDVSVANGTAE LLHTEHIWYPRIREIQGAIIMSSLIEVVIGLLGLPGALLKYIGPLTITPTVALIGLSGFQ AAGERAGKHWGIAMLTIFLVLLFSQYARNVKFPLPIYKSKKGWTAYKLQLFKMFPIILAI LVSWLLCFIFTVTDVFPPDSTKYGFYARTDARQGVLLVAPWFKVPYPFQWGLPTVSAAGV IGMLSAVVASIIESIGDYYACARLSCAPPPPIHAINRGIFVEGLSCVLDGIFGTGNGSTS SSPNIGVLGITKVGSRRVIQCGAALMLALGMIGKFSALFASLPDPVLGALFCTLFGMITA VGLSNLQFIDLNSSRNLFVLGFSIFFGLVLPSYLRQNPLVTGITGIDQVLNVLLTTAMFV GGCVAFILDNTIPGTPEERGIRKWKKGVGKGNKSLDGMESYNLPFGMNIIKKYRCFSYLP ISPTFVGYTWKGLRKSDNSRSSDEDSQATG >gi568815578r:4684276_4901030|GENSCAN_predicted_CDS_6|1533_bp atggcaggggccggtgttggggtgttggcactgaccacggggcccaccctggagctgcct gggttacccctgtttcaggccagtgcttttgcatttttggcccctgctcgagccatcctg tctttagataaatggaaatgtaacaccacagatgtttcagttgccaatggaacagcagag ctgttgcacacagaacacatctggtatccccggatccgagagatccagggggccatcatc atgtcctcactgatagaagtagtcatcggcctcctcggcctgcctggggctctactgaag tacatcggtcccttgaccattacacccacggtggccctaattggcctctctggtttccag gcagcgggggagagagccgggaagcactggggcattgccatgctgacaatattcctagta ttactgttttctcaatacgccagaaatgttaaatttcctctcccgatttataaatccaag aaaggatggactgcgtacaagttacagctgttcaaaatgttccctatcatcctggccatc ctggtatcctggctgctctgcttcatcttcacggtgacagatgtcttccctcccgacagc acaaagtatggcttctatgctcgcacagatgccaggcaaggcgtgcttctggtagccccg tggtttaaggttccatacccatttcagtggggactgcccaccgtgtctgcggccggtgtc atcggcatgctcagtgccgtggtcgccagcatcatcgagtctattggtgactactacgcc tgtgcacggctgtcctgtgccccacccccccccatccacgcaataaacaggggaattttc gtggaaggcctctcctgtgttcttgatggcatatttggtactgggaatggctctacttca tccagtcccaacattggagttttgggaattacaaaggtcggcagccgccgcgtgatacag tgcggagcagccctcatgctcgctctgggcatgatcgggaagttcagcgccctctttgcg tcccttccggatcctgtgctgggagccctgttctgcacgctctttggaatgatcacagct gttggcctctctaacctgcagttcattgatttaaattcttcccggaacctctttgtgctt ggattttcgatcttctttgggctcgtccttccaagttacctcagacagaaccctctggtc acagggataacaggaatcgatcaagtgttgaacgtccttctcacaactgctatgtttgta gggggctgtgtggcttttatcctggataacaccatcccaggcactccagaggaaagagga atccggaaatggaagaagggtgtgggcaaagggaacaaatcactcgacggcatggagtcg tacaatttgccatttggcatgaacattataaaaaaatacagatgcttcagctacttaccc atcagcccaacctttgtgggctacacatggaaaggcctcaggaagagcgacaacagccgg agttcagatgaagactcccaggccacgggatag >gi568815578r:4684276_4901030|GENSCAN_predicted_peptide_7|135_aa HYLTCFSGTIAVPFLLADAMCVGYDQWATSQLIGTIFFCVGITTLLQTTFGCRALLLDSL CAPHPVPSFYFHTLLRVRGLAIVGRNGDIEAHGLQHEGQRHQGDTVHPPGDLVGALEEVV FETGPAGSKGSAQEG >gi568815578r:4684276_4901030|GENSCAN_predicted_CDS_7|408_bp cactacctgacatgcttcagcggcacgatcgcagtgcccttcctgttggccgatgccatg tgtgtggggtacgaccagtgggccaccagccagctcattgggaccattttcttctgtgtg ggaatcactactttgctacagacaacgtttggatgcagggcactgctcttggactcactt tgtgccccacacccagtgccatctttctacttccacaccctcctcagggtgcggggcctg gccattgtggggcgcaatggtgacatcgaagcacatggcttgcagcacgaagggcagcgc catcagggagacacagtgcatcctccaggggacttggtgggagccctggaggaggtggtg tttgagacaggccctgcaggctccaagggctctgcccaggagggctga