GENSCAN 1.0 Date run: 4-Nov-116 Time: 16:26:05 Sequence gi568815578f:36478775_36691428 : 212654 bp : 48.89% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 17931 18292 362 1 2 111 30 275 0.764 18.94 1.02 Intr + 20814 20902 89 2 2 39 115 89 0.920 5.47 1.03 Intr + 21425 21837 413 2 2 95 98 451 0.519 40.63 1.04 Intr + 45476 45567 92 0 2 87 97 79 0.834 8.41 1.05 Intr + 47077 47232 156 0 0 95 105 268 0.998 29.41 1.06 Term + 48039 48257 219 2 0 59 53 310 0.996 21.64 1.07 PlyA + 49840 49845 6 1.05 2.00 Prom + 53660 53699 40 -5.76 2.01 Init + 62844 62912 69 2 0 51 42 54 0.134 -3.04 2.02 Intr + 65809 65896 88 0 1 65 65 77 0.634 2.64 2.03 Intr + 66085 66294 210 2 0 97 80 433 0.877 42.28 2.04 Intr + 69258 69419 162 1 0 61 69 382 0.998 33.55 2.05 Term + 70303 70475 173 2 2 129 47 404 0.999 38.39 2.06 PlyA + 75179 75184 6 1.05 3.00 Prom + 83067 83106 40 -8.36 3.01 Init + 100001 100192 192 1 0 101 80 313 0.931 30.77 3.02 Term + 112136 112657 522 1 0 60 48 372 0.452 24.68 3.03 PlyA + 115144 115149 6 1.05 4.00 Prom + 115674 115713 40 -4.66 4.01 Init + 127178 127291 114 1 0 70 102 119 0.934 11.75 4.02 Intr + 128941 129044 104 0 2 109 110 48 0.999 8.07 4.03 Intr + 130827 130956 130 0 1 70 80 143 0.280 12.40 4.04 Term + 133236 133277 42 2 0 100 45 24 0.150 -3.54 4.05 PlyA + 133759 133764 6 1.05 5.20 PlyA - 134089 134084 6 1.05 5.19 Term - 135212 135092 121 1 1 107 47 104 0.997 6.15 5.18 Intr - 135663 135531 133 1 1 27 103 117 0.665 6.80 5.17 Intr - 136600 136451 150 2 0 128 60 70 0.653 8.33 5.16 Intr - 153924 153821 104 2 2 118 105 129 0.945 17.42 5.15 Intr - 154855 154769 87 0 0 91 92 79 0.995 7.69 5.14 Intr - 155815 155716 100 2 1 72 80 117 0.992 8.47 5.13 Intr - 161627 161593 35 1 2 52 64 34 0.109 -4.73 5.12 Intr - 162604 162471 134 1 2 108 96 82 0.388 10.44 5.11 Intr - 174927 174758 170 1 2 91 22 104 0.183 3.77 5.10 Intr - 176065 176027 39 2 0 85 82 50 0.656 2.30 5.09 Intr - 177637 177586 52 1 1 91 79 60 0.992 3.88 5.08 Intr - 177758 177723 36 2 0 124 110 9 0.913 5.16 5.07 Intr - 186323 186272 52 1 1 106 103 -4 0.968 1.81 5.06 Intr - 186527 186462 66 1 0 78 106 44 0.881 3.22 5.05 Intr - 187618 187515 104 1 2 81 15 133 0.820 4.27 5.04 Intr - 192623 192567 57 2 0 73 111 26 0.009 2.48 5.03 Intr - 202128 202042 87 0 0 81 106 48 0.502 6.07 5.02 Intr - 203804 203744 61 1 1 78 92 41 0.741 2.24 5.01 Init - 205685 205639 47 2 2 89 100 26 0.718 4.21 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 142294 142551 258 0 0 81 48 230 0.846 13.37 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578f:36478775_36691428|GENSCAN_predicted_peptide_1|443_aa XSSCLVAYKKTPPPVPPRTTSKPFISVTVQSSTESAQDTYLDSQDHKSEVTSQSGLSNSS DSLDSSTRPPSVTRGGVAPAPEAPEPPPKHAALKSEQGTLTSSESHPEAAPKRKLSSIGI QVDCIQPVPKEEPSPATKFQSIGVQVEDDWRSSVPSHSMSSRRDTDSDTQDANDSSCKSS ERSLPDCTPHPNSISIDAGPRQAPKIAQIKRNLSYGDNSDPALEASSLPPPDPWLETSSS SPAEPAQPGACRRDGYWFLKLLQAETERLEGWCCQMDKETKENNLSEEVLGKVLSAVGSA QLLMSQKFQQFRGLCEQNLNPDANPRPTAQDLAGFWDLLQLSIEDISMKFDELYHLKANS WQLVETPEKRKEEKKPPPPVPKKPAKSKPAVSRDKASDASDKQRQEARKRLLAAKRAASV RQNSATESADSIEIYVPEAQTRL >gi568815578f:36478775_36691428|GENSCAN_predicted_CDS_1|1332_bp ngttcatcatgcctagtggcgtataagaagaccccgccaccggtccctccacgcaccact tcaaagccgttcatctcagtcacagtccagagcagtactgagtctgcccaggacacctac ctggacagccaggaccacaagagcgaggtgactagccagtcgggcctgagcaactcgtcg gacagcctggacagcagtacccgaccgcccagcgtgacacggggtggagtcgccccagcc cctgaggccccagagccacccccaaaacatgcagctctgaaaagtgaacaagggacgctg accagctctgagtcccaccccgaggccgcccccaaaaggaaactgtcatcgataggaata caagttgactgcattcagccagtgccaaaagaggagcccagtcccgctaccaaattccag tccatcggggttcaggtagaggacgactggcgaagcagcgtcccctctcacagtatgtcc tcccgacgggacacagactcggatacccaggatgccaatgactcaagctgtaagtcatct gagaggagcctcccggactgtacccctcaccccaactccatcagcatcgatgccggtccc cggcaggcccccaagattgcccagatcaagcgcaacctctcctatggagacaacagcgac cctgccctagaggcgtcctcgctgcccccacccgacccctggctcgagacctcctccagc tccccagcagagccggcacagccaggggcctgccgccgagacggctactggttcctaaag ctactgcaggcagaaacagagcggctggaaggctggtgctgccagatggacaaggagacc aaagagaacaacctctctgaagaagtcttaggaaaagtcctcagtgctgtgggcagtgcc cagctactgatgtcccagaaattccagcagttccggggcctctgtgagcaaaacttgaac cctgatgccaacccacgccccacagcccaggacctggcagggttctgggacctgctacag ctgtccatcgaggatatcagcatgaagttcgatgaactctaccacctcaaggccaacagc tggcagctggtggagacccccgagaagaggaaggaagagaagaaaccaccccctccggtc ccaaagaagccagccaaatccaagccggcagtgagccgcgacaaggcctcagacgccagc gacaagcagcgccaggaggcccgcaagagactcctggcggccaagcgggcagcttctgtg cggcagaactcagccaccgagagcgcagacagcatcgagatttatgtcccggaggcccag accaggctctga >gi568815578f:36478775_36691428|GENSCAN_predicted_peptide_2|233_aa MRGGPGPTRLSRGAKAGPTDWDVELSGTESVNPYCWALLMSPVPDEDSRAQRGKPHPPEA KMSSKRAKAKTTKKRPQRATSNVFAMFDQSQIQEFKEAFNMIDQNRDGFIDKEDLHDMLA SLGKNPTDEYLEGMMSEAPGPINFTMFLTMFGEKLNGTDPEDVIRNAFACFDEEASGFIH EDHLRELLTTMGDRFTDEEVDEMYREAPIDKKGNFNYVEFTRILKHGAKDKDD >gi568815578f:36478775_36691428|GENSCAN_predicted_CDS_2|702_bp atgagaggtggcccgggcccgacccggctgtcacggggggcaaaggcgggccccacggac tgggacgtggagctctcgggcactgagtcagttaatccctactgctgggcactcctcatg tctccagtcccagatgaagactctcgggctcagagagggaagccccacccaccagaagcc aagatgtccagcaagcgggccaaagccaagaccaccaagaagcggccacagcgggccaca tccaatgtcttcgcaatgtttgaccagtcccagatccaggagtttaaggaggctttcaac atgattgaccagaaccgtgatggcttcattgacaaggaggacctgcacgacatgctggcc tcgctggggaagaaccccacagacgaatacctggagggcatgatgagcgaggccccgggg cccatcaacttcaccatgttcctcaccatgtttggggagaagctgaacggcacggacccc gaggatgtgattcgcaacgcctttgcctgcttcgacgaggaagcctcaggtttcatccat gaggaccacctccgggagctgctcaccaccatgggtgaccgcttcacagatgaggaagtg gacgagatgtaccgggaggcacccattgataagaaaggcaacttcaactacgtggagttc acccgcatcctcaaacatggcgccaaggataaagacgactag >gi568815578f:36478775_36691428|GENSCAN_predicted_peptide_3|237_aa MSDSDLGEDEGLLSLAGKRKRRGNLPKESVKILRDWLYLHRYNAYPSEQEKLSLSGQTNL SVLQICNWFINARRRLLPDMLRKDGKDPNQFTISRRGGKASDVALPRGSSPSVLAVSVPA PTNVLSLSVCSMPLHSGQGEKPAAPFPRGELESPKPLVTPGSTLTLLTRAEAGSPTGGLF NTPPPTPPEQDKEDFSSFQLLVEVALQRAAEMELQKQQDPSLPLLHTPIPLVSENPQ >gi568815578f:36478775_36691428|GENSCAN_predicted_CDS_3|714_bp atgtcggacagtgatctaggtgaggacgaaggcctcctctccctggcgggcaaaaggaag cgcagggggaacctgcccaaggagtcggtgaagatcctccgggactggctgtacttgcac cgctacaacgcctacccctcagagcaggagaagctgagcctttctggacagaccaacctg tcagtgctgcaaatatgtaactggttcatcaatgcccggcggcggcttctcccagacatg cttcggaaggatggcaaagaccctaatcagtttaccatttcccgccgcgggggtaaggcc tcagatgtggccctcccccgtggcagcagcccctcagtgctggctgtgtctgtcccagcc cccaccaatgtgctctccctgtctgtgtgctccatgccgcttcactcaggccagggggaa aagccagcagcccctttcccacgtggggagctggagtctcccaagcccctggtgacccct ggtagcacacttactctgctgaccagggctgaggctggaagccccacaggtggactcttc aacacgccaccacccacacccccagagcaggacaaagaggacttcagcagcttccagctg ctggtggaggtggcgctacagagggctgctgagatggagcttcagaagcagcaggaccca tcactcccattactgcacactcccatccctttagtctctgaaaatccccagtag >gi568815578f:36478775_36691428|GENSCAN_predicted_peptide_4|129_aa MSGGRRKEEPPQPQLANGALKVSVWSKVLRSDAAWEDKDEFLDVIYWFRQIIAVVLGVIW GVLPLRGFLGIAGFCLINAGVLYLYFSNYLQIDEEEYGGTWELTKEGFMTSFALFMVIWI IFYTAIHYD >gi568815578f:36478775_36691428|GENSCAN_predicted_CDS_4|390_bp atgagcggcgggcggcggaaggaggagccgcctcagccgcagctggccaacggggccctc aaagtctccgtctggagtaaggtgctgcggagcgacgcggcctgggaggataaggatgaa tttttagatgtgatctactggttccgacagatcattgctgtggtcctgggtgtcatttgg ggagttttgccattacgagggttcttgggaatagcaggattctgcctgatcaatgcagga gtcctgtacctctacttcagcaattacctacagattgatgaggaagaatatggtggcacg tgggagctcacgaaggaagggtttatgacctcttttgccttgttcatggtcatttggatc atcttttacactgccatccattatgactga >gi568815578f:36478775_36691428|GENSCAN_predicted_peptide_5|544_aa MDELAEMLPPVLTHLSLKSIIGIGVGAGAYILSRFALNHPELVEGLVLINVDPCAKGWID WAASKLSGLTTNVVDIILAHHFGQEELQANLDLIQTYRMHIAQDINQDNLQLFLNSYNGR RDLEIERPILGQNDNKSKTLKCSTLLVVGDNSPAVEAVMADCGGLPQVVQPGKLTEAFKY FLQGMGYIPYVQLSHLSTESVPSASMTRLARSRTHSTSSSLGSGESPFSRSVTSNQSDGT QESCESPDVLDRHQTMEISLDDVLLSALLRNNGKSAQQKKISAKPKLEFLCPRPGTCDHG SRKFCYTVLVDPRERSKATAVALGSFPAGGPAELSLRLGEPLTIVSEDGDWWTVLSEVSG REYNIPSVHVAKVSHGWLYEGLSREKAEELLLLPGNPGGAFLIRESQTRRGSYSLSVRLS RPASWDRIRHYRIHCLDNGWLYISPRLTFPSLQALVDHYSELADDICCLLKEPCVLQRAG PLPGKDIPLPVTVQRTPLNWKELDSSLLFSEAATGEESLLSEGLRESLSFYISLNDEAVS LDDA >gi568815578f:36478775_36691428|GENSCAN_predicted_CDS_5|1635_bp atggatgagctggctgaaatgctgcctcctgttcttacccacctaagcctgaaaagcatc attggaattggagttggagctggagcttacatcctcagcagatttgcactcaaccatcca gagcttgtggaaggccttgtgctcattaatgttgacccttgcgctaaaggctggattgac tgggcagcttccaaactctctggcctgacaaccaatgttgtggacattattttggctcat cactttgggcaggaagagttacaggccaacctggacctgatccaaacctacagaatgcat attgcccaagacatcaaccaagacaacctgcagctcttcttgaattcctacaatggacgc agagacctggagatcgaaagacccatactgggccaaaatgataacaaatcaaaaacatta aagtgttctactttactggtggtaggggacaattcgcctgcagttgaggctgtgatggcg gactgtgggggactgccccaggtagttcagcctgggaagctcaccgaggccttcaagtac tttttgcagggaatgggctacatcccgtatgtgcagctcagtcacctgagcaccgagtca gtaccatctgccagcatgactcggctcgcccgatcacgaacccactcaacctcgagtagc ctcggctctggagaaagtcccttcagccggtctgtcaccagcaatcagtcagatggaact caagaatcctgtgagtcccctgatgtcctggacagacaccagaccatggagatttccctc gatgatgtgcttctgagtgctctgctgaggaacaatgggaagtctgcccagcagaagaaa atctctgccaagcccaagcttgagttcctctgtccaaggccagggacctgtgaccatgga agcagaaagttctgctatactgtcctagtggacccgagagagagaagcaaggccacagcc gtggccctgggcagtttcccggcaggtggcccggccgagctgtcgctgagactcggggag ccattgaccatcgtctctgaggatggagactggtggacggtgctgtctgaagtctcaggc agagagtataacatccccagcgtccacgtggccaaagtctcccatgggtggctgtatgag ggcctgagcagggagaaagcagaggaactgctgttgttacctgggaaccctggaggggcc ttcctcatccgggagagccagaccaggagaggctcttactctctgtcagtccgcctcagc cgccctgcatcctgggaccggatcagacactacaggatccactgccttgacaatggctgg ctgtacatctcaccgcgcctcaccttcccctcactccaggccctggtggaccattactct gagctggcggatgacatctgctgcctactcaaggagccctgtgtcctgcagagggctggc ccgctccctggcaaggatatacccctacctgtgactgtgcagaggacaccactcaactgg aaagagctggacagctccctcctgttttctgaagctgccacaggggaggagtctcttctc agtgagggtctccgggagtccctcagcttctacatcagcctgaatgacgaggctgtctct ttggatgatgcctag