GENSCAN 1.0 Date run: 5-Nov-116 Time: 14:45:21 Sequence gi568815578r:36513869_36741335 : 227467 bp : 47.83% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 10382 10473 92 0 2 87 97 79 0.923 8.41 1.02 Intr + 11983 12138 156 0 0 95 105 268 0.998 29.41 1.03 Term + 12945 13163 219 2 0 59 53 310 0.996 21.64 1.04 PlyA + 14746 14751 6 1.05 2.00 Prom + 18566 18605 40 -5.76 2.01 Init + 27750 27818 69 2 0 51 42 54 0.134 -3.04 2.02 Intr + 30715 30802 88 0 1 65 65 77 0.634 2.64 2.03 Intr + 30991 31200 210 2 0 97 80 433 0.877 42.28 2.04 Intr + 34164 34325 162 1 0 61 69 382 0.998 33.55 2.05 Term + 35209 35381 173 2 2 129 47 404 0.999 38.39 2.06 PlyA + 40085 40090 6 1.05 3.00 Prom + 47973 48012 40 -8.36 3.01 Init + 64907 65098 192 1 0 101 80 313 0.931 30.77 3.02 Term + 77042 77563 522 1 0 60 48 372 0.452 24.68 3.03 PlyA + 80050 80055 6 1.05 4.00 Prom + 80580 80619 40 -4.66 4.01 Init + 92084 92197 114 1 0 70 102 119 0.934 11.75 4.02 Intr + 93847 93950 104 0 2 109 110 48 0.999 8.07 4.03 Intr + 95733 95862 130 0 1 70 80 143 0.280 12.40 4.04 Term + 98142 98183 42 2 0 100 45 24 0.150 -3.54 4.05 PlyA + 98665 98670 6 1.05 5.20 PlyA - 98995 98990 6 1.05 5.19 Term - 100118 99998 121 1 1 107 47 104 0.997 6.15 5.18 Intr - 100569 100437 133 1 1 27 103 117 0.665 6.80 5.17 Intr - 101506 101357 150 2 0 128 60 70 0.653 8.33 5.16 Intr - 118830 118727 104 2 2 118 105 129 0.945 17.42 5.15 Intr - 119761 119675 87 0 0 91 92 79 0.995 7.69 5.14 Intr - 120721 120622 100 2 1 72 80 117 0.992 8.47 5.13 Intr - 126533 126499 35 1 2 52 64 34 0.109 -4.73 5.12 Intr - 127510 127377 134 1 2 108 96 82 0.388 10.44 5.11 Intr - 139833 139664 170 1 2 91 22 104 0.183 3.77 5.10 Intr - 140971 140933 39 2 0 85 82 50 0.656 2.30 5.09 Intr - 142543 142492 52 1 1 91 79 60 0.992 3.88 5.08 Intr - 142664 142629 36 2 0 124 110 9 0.913 5.16 5.07 Intr - 151229 151178 52 1 1 106 103 -4 0.968 1.81 5.06 Intr - 151433 151368 66 1 0 78 106 44 0.881 3.22 5.05 Intr - 152524 152421 104 1 2 81 15 133 0.820 4.27 5.04 Intr - 157529 157473 57 2 0 73 111 26 0.009 2.48 5.03 Intr - 167034 166948 87 0 0 81 106 48 0.478 6.07 5.02 Intr - 168710 168650 61 1 1 78 92 41 0.709 2.24 5.01 Init - 170591 170545 47 2 2 89 100 26 0.848 4.21 5.00 Prom - 177139 177100 40 -5.96 6.04 PlyA - 177995 177990 6 1.05 6.03 Term - 180261 180199 63 0 0 108 39 86 0.966 3.59 6.02 Intr - 193139 193104 36 2 0 110 110 36 0.276 6.46 6.01 Init - 207867 207811 57 0 0 60 98 49 0.700 4.41 6.00 Prom - 223249 223210 40 -2.36 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 107200 107457 258 0 0 81 48 230 0.846 13.37 S.002 Sngl + 176593 176838 246 0 0 62 43 163 0.810 4.28 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578r:36513869_36741335|GENSCAN_predicted_peptide_1|155_aa XLGKVLSAVGSAQLLMSQKFQQFRGLCEQNLNPDANPRPTAQDLAGFWDLLQLSIEDISM KFDELYHLKANSWQLVETPEKRKEEKKPPPPVPKKPAKSKPAVSRDKASDASDKQRQEAR KRLLAAKRAASVRQNSATESADSIEIYVPEAQTRL >gi568815578r:36513869_36741335|GENSCAN_predicted_CDS_1|468_bp ntcttaggaaaagtcctcagtgctgtgggcagtgcccagctactgatgtcccagaaattc cagcagttccggggcctctgtgagcaaaacttgaaccctgatgccaacccacgccccaca gcccaggacctggcagggttctgggacctgctacagctgtccatcgaggatatcagcatg aagttcgatgaactctaccacctcaaggccaacagctggcagctggtggagacccccgag aagaggaaggaagagaagaaaccaccccctccggtcccaaagaagccagccaaatccaag ccggcagtgagccgcgacaaggcctcagacgccagcgacaagcagcgccaggaggcccgc aagagactcctggcggccaagcgggcagcttctgtgcggcagaactcagccaccgagagc gcagacagcatcgagatttatgtcccggaggcccagaccaggctctga >gi568815578r:36513869_36741335|GENSCAN_predicted_peptide_2|233_aa MRGGPGPTRLSRGAKAGPTDWDVELSGTESVNPYCWALLMSPVPDEDSRAQRGKPHPPEA KMSSKRAKAKTTKKRPQRATSNVFAMFDQSQIQEFKEAFNMIDQNRDGFIDKEDLHDMLA SLGKNPTDEYLEGMMSEAPGPINFTMFLTMFGEKLNGTDPEDVIRNAFACFDEEASGFIH EDHLRELLTTMGDRFTDEEVDEMYREAPIDKKGNFNYVEFTRILKHGAKDKDD >gi568815578r:36513869_36741335|GENSCAN_predicted_CDS_2|702_bp atgagaggtggcccgggcccgacccggctgtcacggggggcaaaggcgggccccacggac tgggacgtggagctctcgggcactgagtcagttaatccctactgctgggcactcctcatg tctccagtcccagatgaagactctcgggctcagagagggaagccccacccaccagaagcc aagatgtccagcaagcgggccaaagccaagaccaccaagaagcggccacagcgggccaca tccaatgtcttcgcaatgtttgaccagtcccagatccaggagtttaaggaggctttcaac atgattgaccagaaccgtgatggcttcattgacaaggaggacctgcacgacatgctggcc tcgctggggaagaaccccacagacgaatacctggagggcatgatgagcgaggccccgggg cccatcaacttcaccatgttcctcaccatgtttggggagaagctgaacggcacggacccc gaggatgtgattcgcaacgcctttgcctgcttcgacgaggaagcctcaggtttcatccat gaggaccacctccgggagctgctcaccaccatgggtgaccgcttcacagatgaggaagtg gacgagatgtaccgggaggcacccattgataagaaaggcaacttcaactacgtggagttc acccgcatcctcaaacatggcgccaaggataaagacgactag >gi568815578r:36513869_36741335|GENSCAN_predicted_peptide_3|237_aa MSDSDLGEDEGLLSLAGKRKRRGNLPKESVKILRDWLYLHRYNAYPSEQEKLSLSGQTNL SVLQICNWFINARRRLLPDMLRKDGKDPNQFTISRRGGKASDVALPRGSSPSVLAVSVPA PTNVLSLSVCSMPLHSGQGEKPAAPFPRGELESPKPLVTPGSTLTLLTRAEAGSPTGGLF NTPPPTPPEQDKEDFSSFQLLVEVALQRAAEMELQKQQDPSLPLLHTPIPLVSENPQ >gi568815578r:36513869_36741335|GENSCAN_predicted_CDS_3|714_bp atgtcggacagtgatctaggtgaggacgaaggcctcctctccctggcgggcaaaaggaag cgcagggggaacctgcccaaggagtcggtgaagatcctccgggactggctgtacttgcac cgctacaacgcctacccctcagagcaggagaagctgagcctttctggacagaccaacctg tcagtgctgcaaatatgtaactggttcatcaatgcccggcggcggcttctcccagacatg cttcggaaggatggcaaagaccctaatcagtttaccatttcccgccgcgggggtaaggcc tcagatgtggccctcccccgtggcagcagcccctcagtgctggctgtgtctgtcccagcc cccaccaatgtgctctccctgtctgtgtgctccatgccgcttcactcaggccagggggaa aagccagcagcccctttcccacgtggggagctggagtctcccaagcccctggtgacccct ggtagcacacttactctgctgaccagggctgaggctggaagccccacaggtggactcttc aacacgccaccacccacacccccagagcaggacaaagaggacttcagcagcttccagctg ctggtggaggtggcgctacagagggctgctgagatggagcttcagaagcagcaggaccca tcactcccattactgcacactcccatccctttagtctctgaaaatccccagtag >gi568815578r:36513869_36741335|GENSCAN_predicted_peptide_4|129_aa MSGGRRKEEPPQPQLANGALKVSVWSKVLRSDAAWEDKDEFLDVIYWFRQIIAVVLGVIW GVLPLRGFLGIAGFCLINAGVLYLYFSNYLQIDEEEYGGTWELTKEGFMTSFALFMVIWI IFYTAIHYD >gi568815578r:36513869_36741335|GENSCAN_predicted_CDS_4|390_bp atgagcggcgggcggcggaaggaggagccgcctcagccgcagctggccaacggggccctc aaagtctccgtctggagtaaggtgctgcggagcgacgcggcctgggaggataaggatgaa tttttagatgtgatctactggttccgacagatcattgctgtggtcctgggtgtcatttgg ggagttttgccattacgagggttcttgggaatagcaggattctgcctgatcaatgcagga gtcctgtacctctacttcagcaattacctacagattgatgaggaagaatatggtggcacg tgggagctcacgaaggaagggtttatgacctcttttgccttgttcatggtcatttggatc atcttttacactgccatccattatgactga >gi568815578r:36513869_36741335|GENSCAN_predicted_peptide_5|544_aa MDELAEMLPPVLTHLSLKSIIGIGVGAGAYILSRFALNHPELVEGLVLINVDPCAKGWID WAASKLSGLTTNVVDIILAHHFGQEELQANLDLIQTYRMHIAQDINQDNLQLFLNSYNGR RDLEIERPILGQNDNKSKTLKCSTLLVVGDNSPAVEAVMADCGGLPQVVQPGKLTEAFKY FLQGMGYIPYVQLSHLSTESVPSASMTRLARSRTHSTSSSLGSGESPFSRSVTSNQSDGT QESCESPDVLDRHQTMEISLDDVLLSALLRNNGKSAQQKKISAKPKLEFLCPRPGTCDHG SRKFCYTVLVDPRERSKATAVALGSFPAGGPAELSLRLGEPLTIVSEDGDWWTVLSEVSG REYNIPSVHVAKVSHGWLYEGLSREKAEELLLLPGNPGGAFLIRESQTRRGSYSLSVRLS RPASWDRIRHYRIHCLDNGWLYISPRLTFPSLQALVDHYSELADDICCLLKEPCVLQRAG PLPGKDIPLPVTVQRTPLNWKELDSSLLFSEAATGEESLLSEGLRESLSFYISLNDEAVS LDDA >gi568815578r:36513869_36741335|GENSCAN_predicted_CDS_5|1635_bp atggatgagctggctgaaatgctgcctcctgttcttacccacctaagcctgaaaagcatc attggaattggagttggagctggagcttacatcctcagcagatttgcactcaaccatcca gagcttgtggaaggccttgtgctcattaatgttgacccttgcgctaaaggctggattgac tgggcagcttccaaactctctggcctgacaaccaatgttgtggacattattttggctcat cactttgggcaggaagagttacaggccaacctggacctgatccaaacctacagaatgcat attgcccaagacatcaaccaagacaacctgcagctcttcttgaattcctacaatggacgc agagacctggagatcgaaagacccatactgggccaaaatgataacaaatcaaaaacatta aagtgttctactttactggtggtaggggacaattcgcctgcagttgaggctgtgatggcg gactgtgggggactgccccaggtagttcagcctgggaagctcaccgaggccttcaagtac tttttgcagggaatgggctacatcccgtatgtgcagctcagtcacctgagcaccgagtca gtaccatctgccagcatgactcggctcgcccgatcacgaacccactcaacctcgagtagc ctcggctctggagaaagtcccttcagccggtctgtcaccagcaatcagtcagatggaact caagaatcctgtgagtcccctgatgtcctggacagacaccagaccatggagatttccctc gatgatgtgcttctgagtgctctgctgaggaacaatgggaagtctgcccagcagaagaaa atctctgccaagcccaagcttgagttcctctgtccaaggccagggacctgtgaccatgga agcagaaagttctgctatactgtcctagtggacccgagagagagaagcaaggccacagcc gtggccctgggcagtttcccggcaggtggcccggccgagctgtcgctgagactcggggag ccattgaccatcgtctctgaggatggagactggtggacggtgctgtctgaagtctcaggc agagagtataacatccccagcgtccacgtggccaaagtctcccatgggtggctgtatgag ggcctgagcagggagaaagcagaggaactgctgttgttacctgggaaccctggaggggcc ttcctcatccgggagagccagaccaggagaggctcttactctctgtcagtccgcctcagc cgccctgcatcctgggaccggatcagacactacaggatccactgccttgacaatggctgg ctgtacatctcaccgcgcctcaccttcccctcactccaggccctggtggaccattactct gagctggcggatgacatctgctgcctactcaaggagccctgtgtcctgcagagggctggc ccgctccctggcaaggatatacccctacctgtgactgtgcagaggacaccactcaactgg aaagagctggacagctccctcctgttttctgaagctgccacaggggaggagtctcttctc agtgagggtctccgggagtccctcagcttctacatcagcctgaatgacgaggctgtctct ttggatgatgcctag >gi568815578r:36513869_36741335|GENSCAN_predicted_peptide_6|51_aa MDELQDVQLTEIKPLLNDKNGTRNFQDFDCQPTQQDNKDEDLYDDPLPLNE >gi568815578r:36513869_36741335|GENSCAN_predicted_CDS_6|156_bp atggatgaacttcaggatgttcagctcacagagatcaaaccacttctaaatgataagaat ggtacaagaaacttccaggactttgactgtcagcctactcaacaagataacaaggatgaa gacctttatgatgatccacttccacttaatgaatag