GENSCAN 1.0 Date run: 4-Nov-116 Time: 04:30:08 Sequence gi568815577r:7719236_7919622 : 200387 bp : 44.60% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 24597 24592 6 1.05 1.03 Term - 25791 25592 200 1 2 78 52 164 0.935 9.36 1.02 Intr - 26104 25976 129 2 0 74 75 119 0.964 9.87 1.01 Init - 28922 28868 55 2 1 35 119 73 0.904 6.55 1.00 Prom - 30203 30164 40 -5.76 2.06 PlyA - 33520 33515 6 1.05 2.05 Term - 35408 35321 88 1 1 51 47 65 0.089 -4.17 2.04 Intr - 41953 41827 127 2 1 59 113 96 0.826 8.94 2.03 Intr - 42367 42269 99 2 0 128 28 50 0.438 3.08 2.02 Intr - 44197 44085 113 0 2 65 59 79 0.822 2.72 2.01 Init - 51313 50961 353 1 2 110 34 130 0.759 6.44 2.00 Prom - 53774 53735 40 -4.16 3.00 Prom + 54479 54518 40 -3.56 3.01 Init + 54532 54552 21 0 0 95 24 42 0.598 -3.79 3.02 Intr + 55039 55159 121 0 1 119 59 -4 0.629 -0.13 3.03 Intr + 56357 57102 746 0 2 9 98 479 0.420 32.17 3.04 Intr + 67763 67796 34 1 1 93 50 106 0.121 5.10 3.05 Term + 68341 68480 140 2 2 -16 39 155 0.129 -1.67 3.06 PlyA + 68767 68772 6 -0.45 4.04 PlyA - 68779 68774 6 -0.45 4.03 Term - 70383 69971 413 1 2 73 54 365 0.951 27.00 4.02 Intr - 77082 76902 181 0 1 88 36 73 0.221 1.54 4.01 Init - 86719 86579 141 1 0 71 2 136 0.011 3.43 4.00 Prom - 93604 93565 40 -6.16 5.02 PlyA - 94655 94650 6 1.05 5.01 Sngl - 100387 99998 390 1 0 70 46 583 0.796 48.32 5.00 Prom - 105471 105432 40 -3.96 6.00 Prom + 108006 108045 40 -1.76 6.01 Init + 108715 108722 8 0 2 58 87 0 0.073 -2.58 6.02 Intr + 115007 115114 108 2 0 117 86 52 0.308 7.30 6.03 Intr + 119710 119844 135 1 0 -52 105 141 0.333 1.48 6.04 Intr + 120646 120761 116 1 2 50 66 63 0.666 0.39 6.05 Term + 123557 123795 239 0 2 29 48 184 0.682 4.83 6.06 PlyA + 123929 123934 6 1.05 7.02 PlyA - 124431 124426 6 1.05 7.01 Sngl - 199032 198694 339 0 0 29 48 211 0.505 7.23 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 64803 64865 63 2 0 91 80 26 0.856 3.25 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815577r:7719236_7919622|GENSCAN_predicted_peptide_1|127_aa MTHNPKKTSALDEEPTPSDSPGTNTVGLNQSWAPPGPRASRNTLLNGLTAPDGQVQQGRP AAEAPAPGRAQTTTSTVLPGKTTGPEMHRYVLRRSGPVPPTRVHYSVRRRSGPGQKTWGA LLTAQAQ >gi568815577r:7719236_7919622|GENSCAN_predicted_CDS_1|384_bp atgacacataaccccaagaagacttcagctctggatgaagaacctacaccctctgactct ccggggaccaacaccgtaggcttgaaccaatcctgggccccgcccgggccacgtgccagc cggaacacgctgctcaacgggttgacggcgcccgacggccaggtccagcaggggcgcccc gcagcggaggctccagcgcctggccgcgcacaaaccacgacttctaccgtcctgccgggg aaaactacaggtcccgaaatgcaccgctacgtgctcaggcgcagtgggccggtgccgccg acgagagtgcactactccgtgcgcaggcgcagtgggcccgggcagaagacctggggcgcg ctgctcactgcgcaggcgcagtga >gi568815577r:7719236_7919622|GENSCAN_predicted_peptide_2|259_aa MPRFASPLLRNVIIRSQFDGIKRKQCLQYLKTLRTLQYDGFKTVYFGETNIPESLVTGED ISDGYFIQTPTWCIVHAAGSQGWVPWKYRVFLRDELCIKQEDSLFSEFCDVVRKAYGNHS LIKYWSTFCVPGWVRQGQEYQGKQAWTLKVQVWCGDSPPSILWLHSDTDEICMGELPHGV MVRVFPAVDCVKRLNLFFTPEVITMNITSSRVASHVGRCGCRSVWRMKSTRPLLLRQQPD PVKGAGARRPGSELYFTTD >gi568815577r:7719236_7919622|GENSCAN_predicted_CDS_2|780_bp atgcctcgctttgcaagccctcttttaagaaatgtcattatcagaagtcaatttgatggc atcaagaggaagcaatgcctccaatatctgaaaaccctgagaacactgcaatatgatgga tttaagaccgtatattttggggaaaccaatatcccagaaagtctcgtaactggggaagat attagtgatggatatttcatacaaaccccaacttggtgtattgtgcatgctgcgggtagt caaggatgggtgccttggaaatatcgggtgttcctaagagacgagctgtgtatcaaacaa gaagacagcctcttctctgagttctgtgatgtggtgaggaaggcctatggaaatcattca ctcatcaaatactggagcaccttctgtgtgcctggctgggtgcgtcagggccaggagtac caaggaaaacaagcctggaccctcaaggtccaggtctggtgtggagactctccaccgtcc atcctctggcttcactctgacacagatgagatctgcatgggggagctccctcatggtgtg atggtgcgtgtgtttccggcagtggactgtgtgaaacggctgaacctcttcttcacgcct gaggtcatcaccatgaacataacatcatcacgtgtggcatcccatgtgggacgctgtggc tgccgcagtgtctggcgcatgaagagcacacggcccttgcttctgaggcagcagccagat ccagtgaaaggagctggagcccggaggcctgggtcagagctctactttaccaccgactga >gi568815577r:7719236_7919622|GENSCAN_predicted_peptide_3|353_aa MKWGVALESPASRDFPSPLPTPAEAANCLGERSGLPGATSPTLECLEDNKPTENKPTENK PTDNKPTDNKPTDNKPAENKPTENKPTDNKPTENKPAENRPRDNKLTDNKPTENKPTENK PTDNKPTDNKPTENKLTENKPTDYKPTENKPADNKPKENKPTDNKPTDNKSAENKPADNK PTENKPTDNKPTDNKPTENKPTENKPAENKPTDNKPTSTNPQQQIYNNKPTESKPTGNQP NYVCCASRQTMLPPSVYEHVCVRLSHRLQTCFQNNASSEMSRPHEVTKPLLCSAPGDDDN DDDDNDDASSPPPLMPTSTTTTYATTVSIITTTNNITNSNTAININHHHHQNH >gi568815577r:7719236_7919622|GENSCAN_predicted_CDS_3|1062_bp atgaagtggggggtggcactggagtctccagcctcccgggacttcccctccccgctgccc actccagcagaggctgccaactgcctgggagagagaagtgggcttcctggggccacctcc ccaactttggagtgtttggaagacaacaaacccacagagaacaaacccacagagaacaaa cccacagacaacaaacccacagacaacaagcccacagacaacaagcccgcagaaaacaaa cccacagaaaacaaacccacagacaacaaacccacagaaaacaaacccgcagagaacaga cccagagacaacaaactcacagacaacaaacccacagagaacaaacccacagagaacaaa cccacagacaacaaacccacagacaacaaacccacagagaacaaactcacagagaacaaa cccacagactacaaacccacagagaacaaacccgcagacaacaaacccaaagagaacaaa cccacagacaacaaacccacagacaacaaatccgcagaaaacaaacccgcagacaacaaa cccacagagaacaaacccacagacaacaaacccacagacaacaaacccacagaaaacaaa cccacagaaaacaaacccgcagagaacaaacccacagacaacaaacccacatcaacaaac ccacaacaacaaatctacaacaacaaacccacagagagcaagcccacagggaaccagcca aattatgtctgctgtgcatctcggcagacgatgctgccaccgtctgtgtatgagcatgtg tgtgtcagactttcccatcgtctccaaacttgttttcagaataatgcttccagtgaaatg agtcggccacatgaggtcacaaagcccctactctgttcagcacctggggatgatgacaat gatgatgacgacaatgatgatgcatcatcaccaccaccattaatgccaacatcaaccacc accacctacgccaccaccgttagcatcataaccaccaccaataacatcaccaacagcaac actgccatcaacataaaccatcaccaccaccaaaaccattag >gi568815577r:7719236_7919622|GENSCAN_predicted_peptide_4|244_aa MGKDFTSKTPKAMAAKAKIDKWDLIKLKSFCTAKETTIRVNRQPTEWDQGHKILLILHLL HHLHLTDEETEAHKDEGAGPGSCSKDVAGARALTPSPYHCHEPLIITGAKWTPHEASNQT QASTLLGLLLGDHTEGRNDTNSTRALKVPDGTSAAWYILTIIGIYAVIFVFRLASNILRK NDKSLEDVYYSNLTSELKMTGLQGKVAKCSTLSISNRAVLQPCQAHLGAKGGSSGPQTAT PETP >gi568815577r:7719236_7919622|GENSCAN_predicted_CDS_4|735_bp atgggcaaggacttcacgtctaaaacaccaaaagcaatggcagcaaaagccaaaattgac aaatgggatctaattaaactaaagagcttctgcacagcaaaagaaactaccatcagagtg aacaggcaacctacagaatgggaccaaggacataagatcctgctcatccttcacctcctt caccaccttcacctcacagatgaggaaactgaggctcacaaagatgagggggctggccca gggtcatgcagcaaggatgtggctggggccagagccctgactcccagcccttatcactgc cacgagccccttattatcacgggggccaagtggactccccatgaagcctccaaccagacc caggccagcaccctcctggggctcctgctgggtgaccacacagaggggaggaatgacacc aactccaccagggctctgaaggtgccagacggaaccagcgctgcctggtatatactcacc atcatcggcatctacgcggtgattttcgtcttccggctggccagcaacatcctcagaaag aatgacaagtccttagaagatgtttattactcaaatctgacctctgaactcaaaatgaca gggctgcagggcaaggtcgccaagtgctccaccctgtctatcagcaacagagctgtgctg cagccctgccaggcccacctgggggcaaagggcggaagcagcgggccccaaaccgcaacc ccagagaccccctga >gi568815577r:7719236_7919622|GENSCAN_predicted_peptide_5|129_aa MILSNTTAVTPFLTKLWQETVQQGGNMSGLARRSPRSGDGKLEALYVLMVLGFFGFFTLG IMLSYIRSKKLEHSNDPFNVYIESNAWQEKDKAYVQARVLESYRSCYVVENHLAIEQPNT HLPETKPSP >gi568815577r:7719236_7919622|GENSCAN_predicted_CDS_5|390_bp atgatcctgtctaacaccacagcggtgacgccctttctgaccaagctgtggcaggagaca gttcagcagggtggcaacatgtcgggcctggcccgcaggtccccccgcagcggtgacggc aagctggaggccctctacgtcctcatggtactgggattcttcggcttcttcaccctgggc atcatgctgagctacatccgctccaagaagctggagcactcgaacgacccattcaacgtc tacatcgagtccaatgcctggcaagagaaggacaaggcctatgtccaggcccgggtcctg gagagctacaggtcgtgctatgtcgttgaaaaccatctggccatagaacaacccaacaca caccttcctgagacgaagccttccccatga >gi568815577r:7719236_7919622|GENSCAN_predicted_peptide_6|201_aa MGKHFFTGLASVSSFTQHRLMPATERDPGLFQISDPQQRQLGSGADPEQTTAALRKSDLT DKRKTNKQTESTTDNINQENPIQRDTDGAGSRYPQQTNAGTENQTLHDLIYKWELNDKNT WPQTDRYSMLKISANIVEFSSTINQLDLIHLKRHPTTAEYTFFSSSYGTLTKIDHILRHK IHLANLEESKSSYVCPQTTME >gi568815577r:7719236_7919622|GENSCAN_predicted_CDS_6|606_bp atgggaaaacacttcttcacgggtctagcctctgtttccagctttacacagcacaggctg atgcctgccactgaacgggatccaggactttttcaaatctcagatccccagcaaaggcaa ctagggtctggagcagaccctgagcaaaccacagcagccctgcggaagagtgacctgact gataaaagaaaaacaaacaaacaaacagaaagcaccaccgacaacatcaaccaggaaaac cccattcaaagggacaccgatggagctggaagccgttatcctcagcaaactaatgcagga acagaaaaccaaacactgcatgatctcatttataagtgggagctgaatgacaagaacaca tggccacaaacagacagatacagcatgctaaaaatcagtgcgaacatagttgaattcagc agcaccatcaatcaactggatctgattcatctaaaacgtcatccaacaacagcagaatac acattcttctcaagctcatatggaacactcaccaagatagaccacattctgcgtcataaa atacacttggcaaatttagaagaatcaaaatcatcctatgtctgccctcagaccacaatg gaatga >gi568815577r:7719236_7919622|GENSCAN_predicted_peptide_7|112_aa MEWNRVQCTDIVWNGMEGNAMISCGMEWNGMEWNGMEWNGMDLNGMDSNAMDSNGMDSNA MDWRAIDSNEWKRLEWNGKEQNEMELDGTEWNGMEWNRMECTELEWTRMEWT >gi568815577r:7719236_7919622|GENSCAN_predicted_CDS_7|339_bp atggaatggaatagagtgcaatgcacagatatcgtgtggaatggaatggagggcaatgca atgatatcgtgtggaatggaatggaatggaatggaatggaatggaatggaatggaatgga atggacttgaatggaatggactcgaatgcaatggactcgaatggaatggattccaatgca atggactggagggcaattgactcgaatgaatggaaacgactggaatggaatggaaaggag cagaatgaaatggaattggatggaactgaatggaatggaatggaatggaatcgaatggaa tgcactgaattggaatggactcgaatggaatggacttga