GENSCAN 1.0 Date run: 6-Nov-116 Time: 15:43:48 Sequence gi568815589r:114690455_114906012 : 215558 bp : 41.97% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 842 1104 263 0 2 81 49 179 0.921 8.00 1.02 PlyA + 1923 1928 6 1.05 2.03 PlyA - 2072 2067 6 1.05 2.02 Term - 10076 9816 261 2 0 99 48 177 0.432 9.34 2.01 Init - 20664 20599 66 0 0 73 27 97 0.065 3.22 2.00 Prom - 25451 25412 40 -4.75 3.10 PlyA - 25537 25532 6 1.05 3.09 Term - 30199 29701 499 0 1 54 38 176 0.174 2.21 3.08 Intr - 30775 30477 299 1 2 27 53 152 0.240 0.35 3.07 Intr - 31331 31166 166 1 1 0 64 122 0.083 0.04 3.06 Intr - 48340 48231 110 1 2 69 71 80 0.256 2.56 3.05 Intr - 55758 55461 298 2 1 102 36 116 0.607 3.75 3.04 Intr - 63290 63185 106 0 1 81 95 75 0.582 5.85 3.03 Intr - 66200 66139 62 1 2 86 72 83 0.493 3.96 3.02 Intr - 66593 66504 90 1 0 70 46 83 0.388 0.49 3.01 Init - 72795 72716 80 0 2 99 64 71 0.673 6.38 3.00 Prom - 81442 81403 40 -5.25 4.00 Prom + 81485 81524 40 -5.55 4.01 Init + 86310 86414 105 2 0 98 43 109 0.440 7.67 4.02 Intr + 90771 90935 165 2 0 33 86 137 0.501 7.24 4.03 Term + 93881 93895 15 1 0 113 50 0 0.366 -4.04 4.04 PlyA + 94503 94508 6 1.05 5.03 PlyA - 94537 94532 6 1.05 5.02 Term - 100452 99998 455 1 2 100 49 354 0.987 27.03 5.01 Init - 102076 101953 124 1 1 89 115 8 0.827 3.98 5.00 Prom - 109803 109764 40 -3.85 6.12 PlyA - 109944 109939 6 1.05 6.11 Term - 112394 112212 183 2 0 70 42 115 0.778 1.66 6.10 Intr - 115413 115349 65 1 2 135 115 43 0.617 9.22 6.09 Intr - 115590 115436 155 2 2 53 1 198 0.622 6.39 6.08 Intr - 119001 118800 202 1 1 38 61 168 0.725 6.52 6.07 Intr - 121021 120923 99 2 0 68 95 91 0.730 6.96 6.06 Intr - 123287 123049 239 1 2 63 85 77 0.438 1.24 6.05 Intr - 127191 127118 74 0 2 99 66 76 0.744 3.79 6.04 Intr - 128782 128566 217 0 1 50 68 162 0.817 7.98 6.03 Intr - 137687 137595 93 1 0 88 73 54 0.266 2.06 6.02 Intr - 145398 144953 446 0 2 62 93 194 0.821 8.57 6.01 Init - 149501 149406 96 2 0 97 92 79 0.545 9.56 6.00 Prom - 154059 154020 40 -6.55 7.03 PlyA - 154785 154780 6 1.05 7.02 Term - 155200 154978 223 0 1 85 54 157 0.026 7.41 7.01 Init - 175188 175115 74 0 2 64 58 111 0.351 6.29 7.00 Prom - 175764 175725 40 -7.45 8.00 Prom + 176845 176884 40 -8.35 8.01 Sngl + 180265 180729 465 0 0 77 48 324 0.874 23.19 8.02 PlyA + 181149 181154 6 1.05 9.00 Prom + 181458 181497 40 -1.45 9.01 Init + 185175 185225 51 2 0 68 99 44 0.438 4.72 9.02 Term + 185607 185690 84 2 0 114 42 40 0.219 -1.33 9.03 PlyA + 187721 187726 6 1.05 10.05 PlyA - 187749 187744 6 1.05 10.04 Term - 193019 192909 111 2 0 59 47 104 0.366 0.98 10.03 Intr - 194644 194483 162 1 0 16 27 149 0.399 0.85 10.02 Intr - 198387 198262 126 0 0 110 74 44 0.788 5.16 10.01 Init - 199679 199602 78 2 0 -11 65 101 0.508 -0.99 10.00 Prom - 200848 200809 40 -2.15 11.03 PlyA - 201841 201836 6 1.05 11.02 Term - 213871 213477 395 2 2 96 47 198 0.812 10.71 11.01 Intr - 215445 215374 72 1 0 102 86 28 0.712 2.46 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589r:114690455_114906012|GENSCAN_predicted_peptide_1|87_aa XKLHSAIICCGAKLESITLRLLGQHWKGLRFRRGGARPFLRLSPPIDGPRNSVTVQVISQ EHLVGIPLGAVFCASLLEEKEEGAAAV >gi568815589r:114690455_114906012|GENSCAN_predicted_CDS_1|264_bp ntgaaacttcacagtgccattatttgctgtggagcgaagctagagagcatcactctgagg cttcttgggcagcattggaaagggcttaggttccgaagaggtggtgcccgtccctttcta aggctctctcccccaatagatggcccaaggaacagtgttacagtgcaggtcatctctcag gagcacttggtgggcattcctcttggtgctgtgttttgtgcttcccttttggaagagaag gaggaaggagccgccgcagtgtag >gi568815589r:114690455_114906012|GENSCAN_predicted_peptide_2|108_aa MAADDLLTSYTEVLAVGIEKQMEGAHEQMGAGTREKKAGTPSHSFQQEQALYRPHGCIKA YYNSLLALPSRMGCLQPLSPRGHVTSSVALLPHRIEQLPPANKGKGPV >gi568815589r:114690455_114906012|GENSCAN_predicted_CDS_2|327_bp atggcagcagatgacctgttgacatcctacacagaggtcctggcagtgggaatagagaaa cagatggagggagcacatgagcagatgggtgcaggaaccagagaaaaaaaggctggaacc cccagccattctttccagcaggagcaggccctgtataggcctcatggctgcatcaaagca tattacaattctcttttagctctgccatccaggatggggtgcctgcaacccctgagcccc agagggcatgttaccagctcagtggcccttttgcctcatcgtatagagcagctgccccct gccaacaagggcaaagggccagtgtga >gi568815589r:114690455_114906012|GENSCAN_predicted_peptide_3|569_aa MARKKREHKINKSDLMGVCLSFHQGNRAWCLDAGPFVEAVYGSLKGSLGIALYPVPWSSD SKGFRSEIIGPIQCVEEENPGSHMLKMTEALSVGFPESLHGAECPWPLLLEFRKSAPRMD AGIVWMLLTPWRSSQDQDCSPPNKWYHIWLCSSASVLLKVLPLLCTTLLSLIDMSDPAKK HPPATQHKPPLLGCSLHLEEGLPFQILVQTSRGQGPSLCHETPNLHVCGSYPYIPEPLTQ LSHAVSSSRVHVDIAVNTTTELLPLRGFKLSGGVDNKQANLCEQIVTDRAVSRGNRTTAI AGRQQTYILMGGGGERTRRESSSYQNPPTKGRDEPSKWTRRMRECRAWFPKPHGGQSKVI GLDSVRKDNKPTRLDLVSGKGKDKEMLLSQDCPEVSPSPDLLVQVTKQPCDKLATGSKTS SQSPGAFQITWGYIPHWKWIPWHQVYKYRCHVSELLGVMPSPGARQNWVHVLPLVPVSCV ALGKSGKLPEPPCPHPYSKVMTAPTHRLFDITEHVLGTEAGPSQVQLCCSHSQWAGSELT ELGAASPYHGNNPNPGSLTRVIATLWAPV >gi568815589r:114690455_114906012|GENSCAN_predicted_CDS_3|1710_bp atggcacgcaaaaagagagaacacaaaatcaacaagtcagatctcatgggcgtctgctta tcgttccaccagggcaacagggcctggtgcttagatgcaggtccttttgtagaggcagtt tatggttctctgaagggctctctggggattgccctctacccagtcccctggtcatcggat tcaaaaggcttcagaagtgagataattggcccaattcaatgtgttgaagaggagaaccct ggaagccacatgctgaagatgacagaggctctttcagtcggattccctgaatcactgcat ggagcagagtgcccctggcccctgctgttggagtttaggaagagtgcccccaggatggat gcaggaattgtctggatgcttctcactccatggaggagcagccaggaccaagactgctcc ccgcccaacaagtggtaccacatctggctctgcagctctgcctctgtgcttctgaaagtg ctgcctcttctctgcaccaccctcctctccctaattgacatgagtgacccagcaaagaag catcctccagcgacccagcacaaacctcctctgcttgggtgctcccttcatcttgaggaa ggcctccccttccagatattagtccagacaagcaggggccaaggtccttctctctgccat gaaacaccaaaccttcatgtttgtggttcatatccctacatccctgagcccttgacccag ctcagccatgctgtcagcagcagcagagtccatgtagacatcgctgtaaatacgactaca gagttgctgcccttgcgaggctttaagcttagtgggggcgtggataacaagcaagcaaac ctgtgtgagcagattgtgactgacagggcagtgagcaggggaaacagaacaacagccatc gcagggaggcagcaaacttacattctaatgggaggaggaggggaaagaacaaggagagaa tcttctagttaccaaaatccacccacaaaaggaagagatgagccatctaagtggaccagg agaatgcgcgagtgtagagcctggtttccaaaaccccatgggggacaatcaaaagtcatt ggattagattctgttagaaaagataataaaccaactagactagatctggtgtccgggaaa gggaaggacaaggagatgttgctgagccaagattgcccagaagtgtctccatctcctgac ctcttggtacaggtcaccaagcaaccctgtgacaagttggctacagggagcaaaacatcc tcccaaagccctggagcattccagatcacatgggggtacattcctcactggaaatggatc ccttggcatcaggtttataaatatagatgccatgtctctgagttgcttggtgtgatgcca agtcctggagccagacagaactgggttcatgttctacctctagtccctgtaagctgtgtt gccctgggcaagtcagggaagctccctgaaccgccatgtccccatccatacagtaaggtg atgacagctcccacccacagactctttgacatcactgagcatgtgcttggcacagaggca ggacccagtcaagtgcagctgtgttgctctcattctcagtgggctggttctgaactcaca gagctgggggccgccagcccttaccatggcaacaacccgaaccctggcagcctcaccagg gtaattgccactttgtgggcaccagtctga >gi568815589r:114690455_114906012|GENSCAN_predicted_peptide_4|94_aa MVEGEGEADTSSYGGAGERPRAKGEVLHTFKQPDLSKLGTIGLKGNEIHMEFIKYLSSGA ATAELTGKSFGDGASWFPGVDAQAEEEGLWGCSM >gi568815589r:114690455_114906012|GENSCAN_predicted_CDS_4|285_bp atggtggaaggcgaaggcgaagctgacacatcttcatatggtggagcaggagagagaccg agagcgaagggggaggtgctacacactttcaaacaaccagatctctcaaaattggggact attgggctgaaaggtaatgagatacacatggagttcatcaagtatctttcctccggagca gccacagcagagcttactgggaaaagctttggtgatggtgcttcatggttccctggggta gatgcacaggctgaagaggaggggctttggggatgttcaatgtga >gi568815589r:114690455_114906012|GENSCAN_predicted_peptide_5|192_aa MQLTKGRLHFSHPLSHTKHISPFVTDAPLRADGDKPRAHLTVVRQTPTQHFKNQFPALHW EHELGLAFTKNRMNYTNKFLLIPESGDYFIYSQVTFRGMTSECSEIRQAGRPNKPDSITV VITKVTDSYPEPTQLLMGTKSVCEVGSNWFQPIYLGAMFSLQEGDKLMVNVSDISLVDYT KEDKTFFGAFLL >gi568815589r:114690455_114906012|GENSCAN_predicted_CDS_5|579_bp atgcaactcacaaagggccgtcttcatttcagtcaccctttgtctcatacaaagcacatt tctccttttgttacagatgcacctcttagagcagacggagataagccaagggcacacctg acagttgtgagacaaactcccacacagcactttaaaaatcagttcccagctctgcactgg gaacatgaactaggcctggccttcaccaagaaccgaatgaactataccaacaaattcctg ctgatcccagagtcgggagactacttcatttactcccaggtcacattccgtgggatgacc tctgagtgcagtgaaatcagacaagcaggccgaccaaacaagccagactccatcactgtg gtcatcaccaaggtaacagacagctaccctgagccaacccagctcctcatggggaccaag tctgtatgcgaagtaggtagcaactggttccagcccatctacctcggagccatgttctcc ttgcaagaaggggacaagctaatggtgaacgtcagtgacatctctttggtggattacaca aaagaagataaaaccttctttggagccttcttactatag >gi568815589r:114690455_114906012|GENSCAN_predicted_peptide_6|622_aa MGDRGAMSSRAAGSCQRVHTNRAHQQGSRMPWVHSGWQQVNATLGQSFQRKEQAAIVDVS QPSLMIPPGTKNTEATRVWSRSPANHSHPTEEWPDCKKKKQTENNNIDKNDLTKTPFKGQ QPQKSKVDKPTNMWKIQHKNAENSKSQSTSSPPHDHNTSPARAQNWAEAEMAELTEVGFK RMGTAPTYKCGYEVDEIMYSSWHCELPTIIKIGENGNPERKSGLCLEEHLDGEEEDPANC KLAVWNLNSKDVSVSFLLFVQGELLGQLFSNLFQGERGQSVVPWPSTDTQCLLNEELSTN STSGSLIASPKHPASPLLKPYLLLFPHICPWTYLMAPAQDPVGVLALTPAAYTFQREQST VPATRNLGSTSAPALPLVSPFWVCFKIEGLCHCDDGATVKLLVPHTYLLLNICEGFFRVY ALVSKWVSQTSGCSQHSKQDDLFKVKSEPATSPPITFQRTPTPLTTTSLPIKLSEEEAKS YLDPGGHTRSQRCLQEQQEHGRGSGTELWGNSQCGNAARARQLQAQGQEQQRTLGSHLLP GLTTYLLVSQLRAQGEACVQFQAAIFVLFQVPGGHTMDQRKKEISNTMEKTITTTIPVTE PHPSPIEELQMANLMPPTESPI >gi568815589r:114690455_114906012|GENSCAN_predicted_CDS_6|1869_bp atgggtgacaggggagccatgagcagcagggcagcagggagctgccagcgagtgcacacc aacagggcacaccaacagggtagcaggatgccatgggtacattcaggctggcaacaggtc aatgctaccctgggacagagcttccagaggaaggaacaggctgccattgttgatgtttca cagccttcactgatgatacctccaggtacaaaaaacactgaggcaactagggtctggagc agatccccagcaaaccacagccaccctacagaagagtggcctgactgtaaaaagaaaaaa caaacagaaaacaacaacatcgacaaaaatgacctcacaaaaaccccattcaaaggtcaa caacctcaaaaatctaaggtagataagcccacaaatatgtggaagattcaacacaaaaat gctgaaaactcaaaaagccagagtacctcttctccaccacatgaccacaacacatctcca gcaagagcacagaactgggctgaggctgagatggctgaattgacagaagtaggcttcaaa agaatggggacagcacctacctacaagtgtggttatgaggttgatgagataatgtattcc agctggcattgtgaactgccaaccattataaaaataggagaaaatggaaatccagaaagg aaaagcggcttgtgtctcgaggaacatttggatggagaggaagaagaccctgcaaattgc aagctggctgtctggaaccttaacagtaaagatgtttctgtatccttcttgctctttgtt cagggagagttgctggggcaactattttctaacctctttcaaggggaaagaggccagtca gtggtgccctggccttccacagatactcagtgtctgttgaatgaagaactgagcaccaac tccacatcaggctctctgattgccagccctaaacaccctgctagccctcttctaaagcca tatctcctgctcttcccccacatttgcccctggacctatctcatggctcctgctcaagac ccggtaggcgtcttggctctgacccctgctgcctacaccttccagagagaacagagcaca gtcccagcaactaggaacctgggctcaacttcagcaccagccctgccacttgtgtcaccc ttctgggtctgtttcaaaattgaaggtttgtgccactgtgatgacggtgctacggttaag ctccttgttcctcatacatacctcctgttgaacatatgtgagggtttctttcgggtatat gccttggtctccaaatgggtctcccagacttcaggctgttctcaacacagcaaacaggat gatcttttcaaagtaaaatcagagcctgccacctctccgcccatcaccttccaaagaacc cctactccactcaccaccacctctcttcccatcaaactcagtgaagaagaagccaagtcc tacctggacccaggaggacacacacggagtcagaggtgcctccaggagcagcaggagcat ggccgaggatctgggactgagctttggggaaacagccagtgtggaaatgctgccagagca cggcagctgcaggcccaaggccaggagcagcagcgcacgctgggctctcacctgctgcct ggactcaccacatacctgcttgtcagccagctccgggcccagggagaggcctgtgtgcag ttccaggcagccatatttgtgctgttccaagtgccaggtggccacaccatggatcaaagg aagaaggagatcagtaataccatggagaaaactattaccacaactataccagttactgag cctcacccctctcccattgaagaacttcagatggcaaatcttatgccacccacagagtca cccatctag >gi568815589r:114690455_114906012|GENSCAN_predicted_peptide_7|98_aa MSDGFLRSSPEGDAGAMLLVQPAEPIISIFHLWLFLQQLGLSGGHLELPDSAAAVVVVAP WTVKGSASGRRRKRAAVHVPIVPFFVFMCTQCLAPTYE >gi568815589r:114690455_114906012|GENSCAN_predicted_CDS_7|297_bp atgagtgacggcttcctgaggtcctcaccagaaggagatgctggtgccatgcttcttgta cagcctgcagaaccaattatcagcatcttccatctctggctgttcctgcagcaactgggt ctcagtggcggccatcttgaacttcctgactccgctgccgctgtggtggtggtggctccc tggaccgtgaaaggcagtgctagtggaagaagaaggaaaagagctgccgtacatgtgcct attgttcccttctttgtgttcatgtgtactcaatgtttagctcccacttatgaatga >gi568815589r:114690455_114906012|GENSCAN_predicted_peptide_8|154_aa MGRNQHKKAENSKNQNASSPPKDHNSSPARKQNWTENEFDKSTEVGFRRWIINSSKLKEC VLTQCKEAKNLEKRLDELLTRITSLEKNLNDLMELQNTAPELREAYTSINSQIDQEEERI SEIEDQLNEIKREDKIREKRMKGVNKASKKYGTM >gi568815589r:114690455_114906012|GENSCAN_predicted_CDS_8|465_bp atggggagaaaccagcacaaaaaggctgaaaattccaaaaaccagaatgcctcttctcct ccaaaggatcacaactcctcgccagcaaggaaacaaaactggacagagaatgagtttgac aaatcgacagaagtaggcttcagaaggtggataataaactcctccaagctaaaggagtgt gttctaacccaatgcaaggaagctaagaaccttgaaaaaaggttagacgaattgctaact agaataaccagtttagagaagaacctaaatgacctgatggagctgcaaaacacagcacca gaacttcgtgaagcatacacaagtatcaatagccaaatcgatcaagaggaagaaaggata tcagagattgaagatcaacttaatgaaataaagcgagaagacaagattagagaaaaaaga atgaaaggagtgaacaaagcctccaagaaatatgggactatgtga >gi568815589r:114690455_114906012|GENSCAN_predicted_peptide_9|44_aa MISFDFMSHIKVTLMQEDQHHIEAAKAWGLHPLKPWSELYCGSF >gi568815589r:114690455_114906012|GENSCAN_predicted_CDS_9|135_bp atgatctcctttgacttcatgtctcacatcaaggttacgctgatgcaagaggatcaacac cacatagaagcagccaaggcttggggcttacaccctctgaagccatggtctgagctgtac tgtggctccttttga >gi568815589r:114690455_114906012|GENSCAN_predicted_peptide_10|158_aa MFVQELEALQAAGGKASQACILPFRVHPEKLSTPRLLQGGGGVTSAIQSCFFYLLSASFS NLKLEPGTKTPETSDQQLPGVGKVDSSMCMQTAQPKGRIKGEVTQDPGSMPTYKAKVKHK TKPWKQGEEQPVCNFRQGVTKTKLKQLTMDTSKEFRRT >gi568815589r:114690455_114906012|GENSCAN_predicted_CDS_10|477_bp atgtttgttcaagaacttgaagctctacaagcagctggtggcaaagccagccaggcctgc atccttcccttcagggtgcacccagagaaactctccacaccacggctgctgcagggtggg ggaggggtgacatcagcaattcagagctgttttttctatctcctgagtgcctctttcagc aatttgaagttagaaccaggtactaaaacacctgaaactagtgatcagcagcttcctgga gttgggaaagtggactcaagcatgtgcatgcagacagcccaacccaagggaagaatcaaa ggagaagtgacgcaagaccccggaagtatgccaacatataaagccaaagttaaacataaa accaaaccttggaaacaaggtgaagaacagcctgtatgcaacttcagacagggggttacc aaaacaaagctgaagcagttgaccatggacacctcaaaagaatttcggagaacctga >gi568815589r:114690455_114906012|GENSCAN_predicted_peptide_11|155_aa XNCSEDLLCILKRAPFKKSWAYLQVAKHLNKTKLSWNKDGILHGVRYQDGNLVIQFPGLY FIICQLQFLVQCPNNSVDLKLELLINKHIKKQALVTVCESGMQTKHVYQNLSQFLLDYLQ VNTTISVNVDTFQYIDTSTFPLENVLSIFLYSNSD >gi568815589r:114690455_114906012|GENSCAN_predicted_CDS_11|468_bp ngaaattgctcagaagacctcttatgtatcctgaaaagggctccattcaagaagtcatgg gcctacctccaagtggcaaagcatctaaacaaaaccaagttgtcttggaacaaagatggc attctccatggagtcagatatcaggatgggaatctggtgatccaattccctggtttgtac ttcatcatttgccaactgcagtttcttgtacaatgcccaaataattctgtcgatctgaag ttggagcttctcatcaacaagcatatcaaaaaacaggccctggtgacagtgtgtgagtct ggaatgcaaacgaaacacgtataccagaatctctctcaattcttgctggattacctgcag gtcaacaccaccatatcagtcaatgtggatacattccagtacatagatacaagcaccttt cctcttgagaatgtgttgtccatcttcttatacagtaattcagactga