GENSCAN 1.0 Date run: 8-Nov-116 Time: 16:43:47 Sequence gi568815593r:115516212_115725792 : 209581 bp : 38.26% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 38 33 6 1.05 1.02 Term - 9406 8097 1310 2 2 109 44 845 0.658 72.95 1.01 Init - 27282 26739 544 0 1 41 113 495 0.991 42.40 1.00 Prom - 28982 28943 40 -6.85 2.00 Prom + 29107 29146 40 -5.85 2.01 Init + 29163 29316 154 2 1 91 48 33 0.090 -0.16 2.02 Intr + 33040 33176 137 2 2 18 111 70 0.089 1.67 2.03 Intr + 36077 36148 72 1 0 109 38 49 0.076 0.68 2.04 Intr + 48542 48917 376 1 1 -56 32 329 0.364 6.56 2.05 Term + 49308 49498 191 1 2 44 50 124 0.527 0.83 2.06 PlyA + 49956 49961 6 1.05 3.08 PlyA - 50520 50515 6 1.05 3.07 Term - 64986 64338 649 2 1 42 50 531 0.532 37.26 3.06 Intr - 70780 70674 107 1 2 42 73 78 0.172 -0.21 3.05 Intr - 72147 71976 172 2 1 56 62 111 0.215 4.32 3.04 Intr - 86226 86108 119 0 2 97 72 91 0.018 6.74 3.03 Intr - 100234 100061 174 1 0 70 68 175 0.615 12.91 3.02 Intr - 104469 104224 246 0 0 118 99 200 0.998 20.83 3.01 Init - 109581 109390 192 0 0 71 69 287 0.984 22.11 3.00 Prom - 111586 111547 40 -6.55 4.00 Prom + 120303 120342 40 -3.55 4.01 Init + 121687 121740 54 0 0 60 96 14 0.123 0.74 4.02 Intr + 130365 130457 93 2 0 56 78 67 0.217 1.74 4.03 Intr + 135960 136148 189 2 0 108 33 74 0.177 2.76 4.04 Intr + 136818 137011 194 2 2 19 10 184 0.640 1.07 4.05 Intr + 137165 137446 282 2 0 79 92 119 0.586 7.01 4.06 Term + 141305 141479 175 2 1 45 38 157 0.597 2.65 4.07 PlyA + 142107 142112 6 1.05 5.00 Prom + 148443 148482 40 -6.65 5.01 Sngl + 159341 159817 477 1 0 88 38 401 0.959 30.94 5.02 PlyA + 160019 160024 6 -0.45 6.00 Prom + 160191 160230 40 -8.25 6.01 Init + 161646 161896 251 2 2 59 72 180 0.107 10.48 6.02 Intr + 162478 163110 633 1 0 2 39 294 0.004 6.22 6.03 Intr + 168715 168892 178 1 1 72 80 69 0.053 3.50 6.04 Term + 170305 170460 156 0 0 37 33 128 0.314 -0.85 6.05 PlyA + 170666 170671 6 1.05 7.06 PlyA - 170864 170859 6 1.05 7.05 Term - 184577 184449 129 2 0 90 47 83 0.193 1.60 7.04 Intr - 195887 195682 206 0 2 64 27 116 0.329 1.10 7.03 Intr - 196462 196353 110 0 2 -25 57 123 0.042 -3.09 7.02 Intr - 201118 200990 129 0 0 53 100 129 0.584 9.49 7.01 Intr - 201698 201491 208 0 1 55 83 143 0.903 7.71 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:115516212_115725792|GENSCAN_predicted_peptide_1|617_aa MDLKTAVFNAARDGKLRLLTKLLASKSKEEVSSLISEKTNGATPLLMAARYGHLDMVEFL LEQCSASIEVGGSVNFDGETIEGAPPLWAASAAGHLKVVQSLLNHGASVNNTTLTNSTPL RAACFDGHLEIVKYLVEHKADLEVSNRHGHTCLMISCYKGHKEIAQYLLEKGADVNRKSV KGNTALHDCAESGSLDIMKMLLMYCAKMEKDGYGMTPLLSASVTGHTNIVDFLTHHAQTS KTERINALELLGATFVDKKRDLLGALKYWKKAMNMRYSDRTNIISKPVPQTLIMAYDYAK EVNSAEELEGLIADPDEMRMQALLIRERILGPSHPDTSYYIRYRGAVYADSGNFKRCINL WKYALDMQQSNLDPLSPMTASSLLSFAELFSFMLQDRAKGLLGTTVTFDDLMGILCKSVL EIERAIKQTQCPADPLQLNKALSIILHLICLLEKVPCTLEQDHFKKQTIYRFLKLHPRGK NNFSPLHLAVDKNTTCVGRYPVCKFPSLQVTAILIECGADVNVRDSDDNSPLHIAALNNH PDIMNLLIKSGAHFDATNLHKQTASDLLDEKEIAKNLIQPINHTTLQCLAARVIVNHRIY YKGHIPEKLETFVSLHR >gi568815593r:115516212_115725792|GENSCAN_predicted_CDS_1|1854_bp atggatctaaagacagcagtatttaacgcagctcgggatggcaaactccggcttctcacc aaattgttggcaagcaaatccaaagaggaggtttcctccttgatctctgaaaaaacaaat ggggccacgccactcttgatggccgccaggtatgggcaccttgacatggtggaattcctc ctagagcaatgcagtgcctccatagaagttgggggctccgtcaattttgatggcgaaacc attgagggggctccccctttatgggccgcttctgcagcaggacatctgaaggtggtccag tccttgttaaatcatggagcatctgtcaacaacacgactttaaccaattcaactcctctt cgagctgcgtgtttcgatggccatttggaaatagtgaagtaccttgtagaacacaaagct gatttggaagtgtcaaaccgacatgggcatacgtgcttgatgatttcatgttacaaagga cataaagagattgctcagtatttacttgaaaagggggcagatgttaatagaaaaagtgtc aaaggtaatactgcattgcatgattgtgcagaatctggaagtttggacatcatgaagatg cttcttatgtattgtgccaagatggaaaaggatggttatggaatgactccccttctctca gcaagtgtgactggtcacacaaatattgtggattttctgacacaccatgcacagaccagc aagacagaacgtattaatgctctagagcttctgggagctacatttgtagacaaaaaaaga gatctgcttggggctttgaaatactggaaaaaggcaatgaacatgaggtacagtgatagg actaatattattagtaaaccagtgccacagacactaataatggcttatgattatgccaag gaagtgaacagtgcagaagagctagaaggtcttattgctgatcctgatgagatgagaatg caggcactattaatcagagaacgtattcttggtccttctcatcctgatacctcttactat attagatatagaggcgctgtctatgcagactctggaaatttcaaacgatgcatcaaccta tggaagtatgctttggatatgcagcagagcaatttggatcctttaagcccaatgaccgcc agcagcttattatcttttgcagaactattctcctttatgctacaggatagggctaaaggc ctgctgggtactactgttacatttgatgatcttatgggcatactttgcaaaagcgtcctt gaaatagagcgagctatcaaacaaactcagtgtccagctgacccattacagttaaataag gccctttctattattttgcacttaatttgcttgttagagaaagttccttgtactctagaa caagaccatttcaaaaagcagactatatacaggtttcttaagctgcatccaaggggaaag aataacttcagccctcttcatctggctgtggacaagaatactacatgtgtagggcggtac cctgtttgtaaatttccatctctacaagttactgcaatactgatagaatgtggtgctgat gtgaacgtcagagactcggatgacaacagtcccctgcatatcgctgctcttaacaaccat ccagacatcatgaatctccttattaaatcaggtgcacattttgatgccacaaacttgcac aaacaaactgctagtgacttgctggatgagaaggaaatagctaaaaatttaatccagcct ataaatcataccacattgcagtgtcttgctgctcgtgtcatagtgaatcatagaatatat tataaagggcatatcccagaaaagctagagacttttgtttcccttcatagatga >gi568815593r:115516212_115725792|GENSCAN_predicted_peptide_2|309_aa MGETAFLIQLSPTRSLPQYVGIMGAKIQIWVRTSITLTHVKSLGLVFRPYFAITVFLKAT ECYWGITGRLIFKGSVGGAKDWKRGKGPETASELPIQVLPTQAQGVPLLPFSEEASASER KIPENVEVTLELGNRQRLEEFGGLRRRQEDEGSLELPRDWLNGCDQNADSDMDSEIQAYQ VSDGNEELIGNWSKGYSYALAKNLAILCPCPRDLWKFELENNNLGYLAEEISKKQSIQDA HGCFKQPLAQRTPDIVHAAASEGARCKPWQLPCGVKPVGTQNARVKAWQLLPRFQRMYEK AWCLGRSRL >gi568815593r:115516212_115725792|GENSCAN_predicted_CDS_2|930_bp atgggggaaacagccttcttgattcaattatctcccactaggtccctcccacaatatgtg ggaatcatgggagctaaaattcaaatttgggtgcggaccagtattaccctcacacacgtg aagtctctcggactagtcttccgtccttactttgcaattactgtcttcctgaaagccaca gaatgctactggggcataacaggaagactcatttttaaggggtctgtagggggagccaaa gactggaaaaggggaaagggaccagagacagcctctgaattacccattcaggtccttcct acacaggcacagggtgttccactgctgccattctctgaagaggccagtgcctcagaaaga aagatacctgaaaatgtggaagtgactttggaactgggtaacaggcagaggttggaagag tttggagggcttagaagaagacaggaagatgaaggaagtttggaacttcctagagactgg ttaaatggttgtgaccaaaatgctgatagtgatatggacagtgaaatccaggcttatcag gtctcagatggaaatgaggaacttattgggaactggagcaaaggttactcttatgcctta gcaaagaacttggctatattgtgtccatgtcctagggatctgtggaaatttgaacttgag aacaataacctagggtatctggcagaagaaatttctaagaagcaaagtattcaagatgct catggctgcttcaaacaacccttggctcaaaggaccccagatatagttcatgctgctgct tcagagggtgcacgatgtaagccttggcagcttccatgtggtgttaagcctgtgggtaca cagaatgcaagagtgaaggcttggcagcttctgcctagatttcagaggatgtatgaaaaa gcctggtgtctaggcagaagccggctgtag >gi568815593r:115516212_115725792|GENSCAN_predicted_peptide_3|552_aa MPRPGSAQRWAAVAGRWGCRLLALLLLVPGPGGASEITFELPDNAKQCFYEDIAQGTKCT LEFQVITGGHYDVDCRLEDPDGKVLYKEMKKQYDSFTFTASKNGTYKFCFSNEFSTFTHK TVYFDFQVGEDPPLFPSENRVSALTQMESACVSIHEALKSVIDYQTHFRLREAQGRSRAE DLNTRVAYWSVGEALILLVVSIGQRHVPGGACHGKAAVRPRAAPRGRRGGRQQQSGLTSG DLQGHYPVTQHTQELSGLFLFRKGVVVERQSLPSAQEDKTLILRRFHTYISPSLLANSLQ FKMSTCLCLARVKRPVLQMALLSAFMAPDILNKAPVQHSVDTSPGYHESDSKKSEDLSLC NVAEHSNTTEGPTGKQEGAQSVEEMFEEEAEEEVFLKFVILHAEDDTDEALRVQNLLQDD FGIKPGIIFAEMPCGRQHLQNLDDAVNGSAWTILLLTENFLRDTWCNFQFYTSLMNSVNR QHKYNSVIPMRPLNNPLPRERTPFALQTINALEEESRGFPTQVERIFQESVYKTQQTIWK ETRNMVQRQFIA >gi568815593r:115516212_115725792|GENSCAN_predicted_CDS_3|1659_bp atgccgcggccggggtccgcgcagcgctgggcggccgtcgcgggccgttgggggtgcagg ctgctcgcactgctgctactggtgcctggacccggcggcgcctctgagatcaccttcgag cttcctgacaacgccaagcagtgcttctacgaggacatcgctcagggcaccaagtgcacc ctggagttccaggtgattactggtggtcactatgatgtagattgtcgattagaagatcct gatggtaaagtgttatacaaagagatgaagaaacagtatgatagttttaccttcacagcc tccaaaaatgggacatacaaattttgcttcagcaatgaattttctactttcacacataaa actgtatattttgattttcaagttggagaagacccacctttgtttcctagtgagaaccga gtcagtgctcttacccagatggaatctgcctgtgtttcaattcacgaagctctgaagtct gtcatcgattatcagactcatttccgtttaagagaagctcaaggccgaagccgagcagag gatctaaatacaagagtggcctattggtcagtaggagaagccctcattcttctggtggtt agcatagggcagcgccacgtcccgggtggggcctgccacggcaaagcagcagtccggcct cgagcggcccctcgggggcggcggggtgggcgccaacagcagtcaggcctgacaagcggc gacctccaaggacattatcctgtgacccagcacactcaagagttaagtggcctattcctc ttcagaaaaggagtggtggtggagagacagagcttacccagtgcccaagaggataagacc ttaattctaagacgcttccacacgtatatttcaccctctttactagcaaattcccttcag ttcaaaatgtccacctgtctttgtcttgcaagggtgaagcgtcccgtgctgcagatggcc ctcctgagtgctttcatggcacctgatattctgaataaagcacctgtgcagcacagtgtg gatacaagtccaggatatcatgagtcagattccaagaagtctgaagatctatccttgtgt aatgttgctgagcacagcaatacaacagaggggccaacaggaaagcaggagggagctcag agcgtggaagagatgtttgaagaagaagctgaagaagaggtgttcctcaaatttgtgata ttgcatgcagaagatgacacagatgaagccctcagagtccagaatctgctacaagatgac tttggtatcaaacccggaataatctttgctgagatgccatgtggcagacagcatttacag aatttagatgatgctgtaaatgggtctgcatggacaatcttattactgactgaaaacttt ttaagagatacttggtgtaatttccagttctatacgtccctaatgaactccgttaacagg cagcataaatacaactctgttatacccatgcggcccctgaacaatccccttccccgagaa aggactccctttgccctccaaaccatcaatgccttagaggaagaaagtcgtggatttcct acacaagtagaaagaatttttcaggagtctgtgtataagacacaacaaactatatggaaa gagacaagaaatatggtacaaagacaatttattgcctga >gi568815593r:115516212_115725792|GENSCAN_predicted_peptide_4|328_aa MAREKGEASELCCDSSERENEQIDQQKLGFHRVQHTKGGGQHGSPREKEAAPHQAEPGPN SSSASAPPLYNPPIISPPHTRSGLQFRSTTSPPSPAQQFPLEEVARAKGIVKAPRTITDA ELCATLTVEGKAVLFLINMEATNSTLPSFQGPVSLASITVVGIDGQASKPLKTPQLCSKS PSHPHLVSPHLNPQVWDTSTPSLATDHVPLTIPLKPNHPYPAQRQYPIPQQALRGLKPDI TCLLQYGLLKPINSPYNSHILPVQKPDESYRKQSKILKILDWIVQYLKSTGGMGYGQSLT YSSITVTQTVTGDDTGGQGFGKSSSGKT >gi568815593r:115516212_115725792|GENSCAN_predicted_CDS_4|987_bp atggcacgagaaaaaggagaggcttctgaactctgttgtgatagtagtgagagggaaaat gaacagattgatcagcagaaattgggtttccatcgggtccaacacaccaagggagggggc caacatgggagcccacgagagaaggaggctgctcctcaccaggccgagccaggtcccaat tcttcctcagcctctgctcccccactgtataatcctcctatcatctcccctcctcacacc cggtctggcttacagtttcgttccacgactagccctccctcacctgcccaacagtttcct cttgaagaggtggctagagctaaaggcatagtcaaggccccccggacaatcactgatgcc gagctttgcgcaactcttacagtggagggtaaggccgtcctcttcttaattaatatggag gctaccaactccacattaccttcttttcaagggcctgtttcccttgcctccataactgtt gtgggtattgacggccaggcttctaaacctcttaaaactccccaactctgttcgaagtct ccttcacatcctcaccttgtatctccccaccttaatccacaagtatgggatacctctact ccctctttggcaaccgatcatgtgccccttaccatcccattaaaacctaatcacccttac ccagctcaacgccaatatcccatcccacaacaggctttaagaggattaaagcctgatatc acttgcctgctacagtatggccttttaaagcctataaattctccttacaattctcacatt ttacctgtccaaaaaccggacgagtcttacagaaaacaaagtaaaatccttaaaatcctg gactggattgtgcaatatttgaagagtacagggggaatggggtatggacaatccttgaca tatagtagtatcacagttactcaaactgtcactggtgatgacacaggaggtcaaggtttt gggaaatcaagtagcggtaaaacttga >gi568815593r:115516212_115725792|GENSCAN_predicted_peptide_5|158_aa MGKKQSRKTGNSKKQSTSPPPKDHSSSAAMEQSWTENDFDELREEGFSRSNYEIQEEIQT QGKEVKNFEKNLDECITRITNTEKCLKELMELKAKARELCEECRRLRSRRDQLEERVSAM EDEMNEMKREGKFREKRIKRNKQSLQEIWDYVKDQIYV >gi568815593r:115516212_115725792|GENSCAN_predicted_CDS_5|477_bp atggggaaaaaacagagcagaaaaactggaaactctaaaaagcagagcacctctcctcct ccaaaggatcacagttcctcagcagcaatggaacaaagctggacagagaatgactttgat gagttgagagaagaaggcttcagccgatcaaactacgagatacaggaggaaattcaaacc caaggcaaagaagttaaaaactttgaaaaaaatttagatgaatgtataactagaataacc aatacagagaagtgcttaaaggagctgatggagctgaaagccaaggcacgagaactatgt gaagaatgcagaaggctcaggagccgacgtgatcaactggaagaaagggtatcagcgatg gaagatgaaatgaatgaaatgaagcgagaagggaagtttagagaaaaaagaataaagaga aacaaacaaagcctccaagaaatatgggactatgtgaaagaccaaatctacgtctga >gi568815593r:115516212_115725792|GENSCAN_predicted_peptide_6|405_aa MTTDPTEIQTTIREYYKHLYANKLENLEEMDKFLDTYTLPRLNQEEVESLNRPITGSEIV SIIISLPTKKSPGPDGFTAGFYQRAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIV SAQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIHL TRDVKDHFKENYKPLLNEIKEDKNKWKNIPCSWAGRINIVKMAILPKVIYRVNAIPIKLP MTFFTELEKTTLKFLWNQKIACITKSILSQKNKAGGITLPDFKLYYKATVTKTAWVQLLH CLPPLRLGHLRIPGTFCHNVIAKVWGMALSAFYLPGLFQLLIHYGNSNCFVPTQNLGEVP KSSGLEVPTTLGWRLTRSGKAFAARRLLPLLPTSRYSPSLTPVNH >gi568815593r:115516212_115725792|GENSCAN_predicted_CDS_6|1218_bp atgaccaccgatcccacagaaatacaaactaccatcagagaatactacaaacacctctac gcaaataaactagaaaatctagaagaaatggataaattcctcgacacatacactctccca agactaaaccaggaagaagttgaatctctgaatagaccaataacaggatctgaaattgtg tcaataatcattagcttaccaaccaaaaagagtccaggaccagatggattcacagccgga ttctaccagagggcaattaggcaggagaaggaaataaagggtattcaattaggaaaagag gaagtcaaattatccctgtttgcagacgacatgattgtatacctagaaaaccccattgtc tcagcccaaaatctccttaagctgataagcaacttcagcaaagtctcaggatacaaaatc aatgtacaaaaatcacaagcattcttatacaccaataacagacaaacagagagccaaatc atgagtgaactcccattcacaattgcttcaaagagaataaaatacctaggaatccacctt acaagggacgtgaaggaccacttcaaggagaactacaaaccactgctcaatgaaattaaa gaggataaaaacaaatggaagaacattccatgctcatgggcaggaagaatcaatatcgtg aaaatggccatactgcccaaggtaatttatagagtcaatgccatccccatcaagctacca atgactttcttcacagaattggaaaaaactactttaaagttcttatggaaccaaaaaata gcctgcatcaccaagtcaatcctaagccaaaagaacaaagctggaggcatcaccctacct gacttcaaactatactataaggctacagtaaccaaaacagcatgggtccagctgctccac tgcctgccacctctgagactggggcatctcaggatccctggcaccttttgccacaatgtc attgcaaaggtttggggtatggctctttctgccttctatcttccaggactttttcagctt ctgattcactacggcaactctaactgttttgttccaacacagaaccttggcgaggttcca aagtcttcagggttggaggttccaacaactctgggctggaggttgactaggtccgggaaa gcttttgcagcccgaagacttctgcctctcttacctacttctcggtattctccatcccta acccctgtcaaccactaa >gi568815593r:115516212_115725792|GENSCAN_predicted_peptide_7|260_aa XLPKFLYEKQSQPEPPGSGVPSHPERRLGPLEKVPSQKVFFLLPYRLDQFSKQFLCQRHY VAENGRASWDVDRTCDLPLAMECGRGDRMLLQCDYVPLAGRLILGTLWQTLKKDLLNGFD QNADNDIVNDIQAEVVSDGGEELVGNWSKAAPAMAKRGQGTAQVVASEGAGPNPWQLPHG VEPAVAQKLRTEVWKPPPRFQRMYGNICMFRHMFAVGVNIIPTVWRNGPEVALCWNVTAS LTWGPVLMAQHPTALGGPIL >gi568815593r:115516212_115725792|GENSCAN_predicted_CDS_7|783_bp naacttccaaaatttctctatgaaaagcagagccaaccagaaccacctggaagtggggtg cccagccatcctgagagaagacttggtccactagagaaagtgccctcacagaaagtcttc ttcctcctcccttatcgcctggaccaattttctaaacaatttctttgccagcgacattat gttgcagaaaatggacgagctagctgggatgtggacaggacttgtgacttacctctagca atggaatgtggcagaggtgatagaatgttacttcagtgtgattatgtccctcttgctggc agacttattctaggaactctgtggcagaccttaaagaaagatttgttgaatggctttgac caaaatgctgataatgatattgtcaatgacatccaggctgaggtggtctcagatggaggt gaggaacttgttggcaactggagtaaagctgctccagccatggctaaaaggggccaaggt acagctcaggttgtggcttcagagggtgcaggtcccaacccttggcagcttccacatggt gttgagcctgcagttgcacagaagttgagaactgaggtttggaaacctccacctagattt cagaggatgtatggaaacatctgtatgttcaggcacatgtttgctgtgggggtgaatatc atccccacagtttggagaaatggccctgaagtagctctctgctggaatgtcacagcttca ctgacctggggtcctgtgctcatggctcagcaccccacagctcttggtggccccattctt tga