GENSCAN 1.0 Date run: 3-Nov-116 Time: 03:46:31 Sequence gi568815590f:122851511_123054021 : 202511 bp : 44.68% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 779 774 6 1.05 1.02 Term - 5149 5066 84 1 0 68 54 80 0.229 0.25 1.01 Init - 10821 10807 15 0 0 121 105 37 0.832 9.12 1.00 Prom - 19821 19782 40 -6.36 2.00 Prom + 20169 20208 40 -7.16 2.01 Init + 20575 20654 80 0 2 95 66 102 0.608 9.23 2.02 Term + 33052 33187 136 1 1 69 39 104 0.145 1.09 2.03 PlyA + 35034 35039 6 1.05 3.00 Prom + 38752 38791 40 -4.46 3.01 Init + 46461 46523 63 2 0 83 91 87 0.989 9.65 3.02 Term + 64696 64824 129 0 0 88 54 116 0.886 6.38 3.03 PlyA + 67138 67143 6 1.05 4.00 Prom + 74938 74977 40 -3.56 4.01 Init + 76995 77034 40 2 1 71 86 32 0.639 1.65 4.02 Intr + 77334 77384 51 1 0 98 113 -18 0.544 0.58 4.03 Intr + 78645 78766 122 1 2 83 49 105 0.006 6.31 4.04 Intr + 84715 84840 126 0 0 51 59 104 0.007 4.68 4.05 Term + 94659 94754 96 2 0 46 48 76 0.068 -2.63 4.06 PlyA + 96457 96462 6 1.05 5.00 Prom + 97964 98003 40 -4.26 5.01 Sngl + 100001 102514 2514 1 0 102 42 2709 0.983 260.40 5.02 PlyA + 103860 103865 6 1.05 6.08 PlyA - 106480 106475 6 1.05 6.07 Term - 107705 107608 98 0 2 79 48 37 0.117 -3.07 6.06 Intr - 110135 110038 98 1 2 75 74 62 0.069 3.15 6.05 Intr - 119159 118993 167 2 2 47 73 127 0.139 5.86 6.04 Intr - 123129 123002 128 1 2 30 24 100 0.005 -1.80 6.03 Intr - 134694 134556 139 0 1 64 58 107 0.238 5.34 6.02 Intr - 139439 139345 95 0 2 64 5 129 0.089 1.88 6.01 Init - 145909 145894 16 1 1 94 74 20 0.401 0.60 6.00 Prom - 147385 147346 40 -2.56 7.05 PlyA - 148140 148135 6 1.05 7.04 Term - 164075 163937 139 1 1 89 49 121 0.858 5.74 7.03 Intr - 169989 169937 53 0 2 74 99 28 0.739 0.21 7.02 Intr - 171269 171174 96 2 0 90 75 30 0.642 2.11 7.01 Init - 190612 190460 153 1 0 67 80 200 0.984 17.11 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 78645 78811 167 1 2 83 38 139 0.961 6.48 S.002 Sngl - 115653 115345 309 0 0 74 32 170 0.826 5.90 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590f:122851511_123054021|GENSCAN_predicted_peptide_1|32_aa MAVDLASGNSGSNTAAGRCERDEYTFDYGSCF >gi568815590f:122851511_123054021|GENSCAN_predicted_CDS_1|99_bp atggctgtggacctggcctcagggaactctggctccaacacggctgctggcagatgtgag agagatgaatacacctttgattacggctcctgtttctga >gi568815590f:122851511_123054021|GENSCAN_predicted_peptide_2|71_aa MTNGNVPHPSGLQAKKKDAQTADGGDRSQSSWVNQVEPECKSCSNSRKYRGRPLAMEAWS SPRRRGDGGNW >gi568815590f:122851511_123054021|GENSCAN_predicted_CDS_2|216_bp atgaccaatgggaatgtgccgcatccttcaggtctgcaggcgaagaaaaaagatgcacag acagctgatggaggtgacaggagccagagctcttgggtcaatcaggtagaacctgagtgc aaaagctgcagcaactcccgcaagtatcgtgggcgtccgctggccatggaagcctggtct tcaccgaggaggaggggtgatggaggaaactggtag >gi568815590f:122851511_123054021|GENSCAN_predicted_peptide_3|63_aa MVVRSIRVNETLQDIKMEISKQGSKDWQRGENKKRRLIPGCIPAGQGDPACTSSSRILLS HRG >gi568815590f:122851511_123054021|GENSCAN_predicted_CDS_3|192_bp atggtggtccgctccattcgagtgaatgaaacacttcaggacattaaaatggaaatcagc aagcaaggctccaaagactggcaacgaggcgagaacaagaagaggaggctcattcctggc tgcatccctgctgggcagggtgacccggcctgcacatcttccagccgcatccttttgagc catcgaggctga >gi568815590f:122851511_123054021|GENSCAN_predicted_peptide_4|144_aa MGNQHGLLLKEEMGVGKEMRKPGDHLDLGQGLRSSCKRMHVEMFCGLQCTLKTQGILITI SSITAVITGAEGGVEDRDDVIRFVFRNMMFSHGISCEVWGSKQSAAKNKALKFAYQTRRL CSKLQVPDFMVLVLTLLVVHGLSK >gi568815590f:122851511_123054021|GENSCAN_predicted_CDS_4|435_bp atggggaaccagcatggattattactgaaagaagagatgggtgtgggaaaggaaatgagg aagccaggtgatcatcttgacttgggacaagggctaagaagctcctgcaagaggatgcat gtagaaatgttttgtgggctgcaatgcacactcaagacccaagggatacttattaccatc agcagcatcaccgcggtcattactggtgcagagggaggtgttgaagacagggatgacgtg atcagattcgtgtttagaaacatgatgttcagccatggcatcagttgcgaggtatggggc agcaagcaatctgcggctaaaaataaagccttgaagtttgcatatcagacccgaagactt tgctcaaaactgcaagtaccagacttcatggtgctggtcctcaccttgctggtggttcac gggctttcaaaatga >gi568815590f:122851511_123054021|GENSCAN_predicted_peptide_5|837_aa MASKRKSTTPCMVRTSQVVEQDVPEEVDRAKEKGIGTPQPDVAKDSWAAELENSSKENEV IEVKSMGESQSKKLQGGYECKYCPYSTQNLNEFTEHVDMQHPNVILNPLYVCAECNFTTK KYDSLSDHNSKFHPGEANFKLKLIKRNNQTVLEQSIETTNHVVSITTSGPGTGDSDSGIS VSKTPIMKPGKPKADAKKVPKKPEEITPENHVEGTARLVTDTAEILSRLGGVELLQDTLG HVMPSVQLPPNINLVPKVPVPLNTTKYNSALDTNATMINSFNKFPYPTQAELSWLTAASK HPEEHIRIWFATQRLKHGISWSPEEVEEARKKMFNGTIQSVPPTITVLPAQLAPTKVTQP ILQTALPCQILGQTSLVLTQVTSGSTTVSCSPITLAVAGVTNHGQKRPLVTPQAAPEPKR PHIAQVPEPPPKVANPPLTPASDRKKTKEQIAHLKASFLQSQFPDDAEVYRLIEVTGLAR SEIKKWFSDHRYRCQRGIVHITSESLAKDQLAIAASRHGRTYHAYPDFAPQKFKEKTQGQ VKILEDSFLKSSFPTQAELDRLRVETKLSRREIDSWFSERRKLRDSMEQAVLDSMGSGKK GQDVGAPNGALSRLDQLSGAQLTSSLPSPSPAIAKSQEQVHLLRSTFARTQWPTPQEYDQ LAAKTGLVRTEIVRWFKENRCLLKTGTVKWMEQYQHQPMADDHGYDAVARKATKPMAESP KNGGDVVPQYYKDPKKLCEEDLEKLVTRVKVGSEPAKDCLPAKPSEATSDRSEGSSRDGQ GSDENEESSVVDYVEVTVGEEDAISDRSDSWSQAAAEGVSELAESDSDCVPAEAGQA >gi568815590f:122851511_123054021|GENSCAN_predicted_CDS_5|2514_bp atggctagcaaacgaaaatctacaactccatgcatggttcggacatcacaagtagtagaa caagatgtgcccgaggaagtagacagggccaaagagaaaggaatcggcacaccacagcct gacgtggccaaggacagttgggcagcagaacttgaaaactcttccaaagaaaacgaagtg atagaggtgaaatctatgggggaaagccagtccaaaaaactccaaggtggttatgagtgc aaatactgcccctactccacgcaaaacctgaacgagttcacggagcatgtcgacatgcag catcccaacgtgattctcaaccccctctacgtgtgtgcagaatgtaacttcacaaccaaa aagtacgactccctatccgaccacaactccaagttccatcccggggaggccaacttcaag ctgaagttaattaaacgcaataatcaaactgtcttggaacagtccatcgaaaccaccaac catgtcgtgtccatcaccaccagtggccctggaactggtgacagtgattctgggatctcg gtgagtaaaacccccatcatgaagcctggaaaaccaaaagcggatgccaagaaggtgccc aagaagcccgaggagatcacccccgagaaccacgtggaagggaccgcccgcctggtgaca gacacagctgagatcctctcgagactcggcggggtggagctcctccaagacacattagga cacgtcatgccttctgtacagctgccaccaaatatcaaccttgtgcccaaggtccctgtc ccactaaatactaccaaatacaactctgccctggatacaaatgccacgatgatcaactct ttcaacaagtttccttacccgacccaggctgagttgtcctggctgacagctgcctccaaa cacccagaggagcacatcagaatctggtttgccacccagcgcttaaagcatggcatcagc tggtccccagaagaggtggaggaggcccggaagaagatgttcaacggcaccatccagtca gtacccccgaccatcactgtgctgcccgcccagttggcccccacaaaggtgacgcagccc atcctccagacggctctaccgtgccagatcctcggccagactagcctggtgctgactcag gtgaccagcgggtcaacaaccgtctcttgctcccccatcacacttgccgtggcaggagtc accaaccatggccagaagagacccttggtgactccccaagctgcccccgaacccaagcgt ccacacatcgctcaggtgccagagcccccacccaaggtggccaaccccccgctcacacca gccagtgaccgcaagaagacaaaggagcagatagcacatctcaaggccagctttctccag agccagttccctgacgatgccgaggtttaccggctcatcgaggtgactggccttgccagg agcgagatcaagaagtggttcagtgaccaccgatatcggtgtcaaaggggcatcgtccac atcaccagcgaatcccttgccaaagaccagttggccatcgcggcctcccgacacggtcgc acgtatcatgcgtacccagactttgccccccagaagttcaaagagaaaacacagggtcag gttaaaatcttggaagacagctttttgaaaagttcttttcctacccaagcagaactggat cggctaagggtggagaccaagctgagcaggagagagatcgactcctggttctcggagagg cggaagcttcgagacagcatggaacaagctgtcttggattccatggggtctggcaaaaaa ggccaagatgtgggagcccccaatggtgctctgtctcgactcgaccagctctccggtgcc cagttaacaagttctctgcccagcccttcgccagcaattgcaaaaagtcaagaacaggtt catctcctgaggagcacgtttgcaagaacccagtggcctactccccaggagtacgaccag ttagcggccaagactggcctggtccgaactgagattgtgcgttggttcaaggagaacaga tgcttgctgaaaacgggaaccgtgaagtggatggagcagtaccagcaccagcccatggca gatgatcacggctacgatgccgtagcaaggaaagcaacaaaacccatggccgagagccca aagaacgggggtgatgtggttccacaatattacaaggaccccaaaaagctctgcgaagag gacttggagaagttggtgaccagggtaaaagtaggcagcgagccagcaaaagactgtttg ccagcaaagccctcagaggccacctcagaccggtcagagggcagcagccgggacggccag ggtagcgacgagaacgaggagtcgagcgttgtggattacgtggaggtgacggtcggggag gaggatgcgatctcagatagatcagatagctggagtcaggctgcggcagaaggtgtgtcg gaactggctgaatcagactccgactgcgtccctgcagaggctggccaggcctag >gi568815590f:122851511_123054021|GENSCAN_predicted_peptide_6|246_aa MTKVHALEDDWFLLCVLLKAVSVPDAMPVKQEPESGWPSMTTNMKERLLQRKTIHSPGLE VFEEILRGYNTRGTYANMSARFELQFAMSDPSRMENEFETGGIQVRTLTPSPREPDDNKE GEVEEKILGDKAQGGKTATDVLSPNPAKERWVSTGLQDGVERAVELEKAGIQDLPLPGYV LSTSSCHLVKKVPCFPFAFRHDRKFPEASPAMLNFIPNHQELETVDRNQEAAKVVAGALE SILDTG >gi568815590f:122851511_123054021|GENSCAN_predicted_CDS_6|741_bp atgacgaaggtccatgccttggaggatgactggtttctgctgtgtgtgctgctgaaggca gtcagtgtccctgatgccatgccagttaaacaggaacctgaatctggatggccttccatg accacaaacatgaaagaaagactcctacaaaggaagactatccactctcctggactggaa gtgtttgaagaaatcctcaggggatataacacccggggcacatatgccaatatgtcggcc cgttttgaactccagtttgcgatgtcagatccctccaggatggaaaatgaatttgaaacg ggaggtatccaagtcagaacccttacaccaagccctcgggagccagacgacaataaagaa ggcgaagtagaggaaaagatcctaggagacaaggcccaaggcgggaagacagctaccgat gttctttctcccaacccggctaaagaacgttgggtcagcacaggactgcaggatggggtg gaaagagctgtggagttggagaaagctgggatccaggatctgccacttcctggctacgtg ctgagcacttcttcctgccaccttgtgaagaaggtgccttgcttccccttcgccttccgc catgatcgtaagtttcctgaggcctccccagccatgctgaacttcatccctaatcatcag gagttggaaactgttgacagaaatcaggaggcagcgaaggtggtggctggagctttggag tcaattttagacactggctga >gi568815590f:122851511_123054021|GENSCAN_predicted_peptide_7|146_aa MSDIGDWFRSIPAITRYWFAATVAVPLVGKLGLISPAYLFLWPEAFLYRFQLLMIPLIMS VLYVWAQLNRDMIVSFWFGTRFKACYLPWVILGFNYIIGGSYRWLPSRRGGVSGFGVPPA SMRRAADQNGGGGRHNWGQGFRLGDQ >gi568815590f:122851511_123054021|GENSCAN_predicted_CDS_7|441_bp atgtcggacatcggagactggttcaggagcatcccggcgatcacgcgctattggttcgcc gccaccgtcgccgtgcccttggtcggcaaactcggcctcatcagcccggcctacctcttc ctctggcccgaagccttcctttatcgctttcagttgctgatgattcctctgatcatgtca gtactttatgtctgggcccagctgaacagagacatgattgtatcattttggtttggaaca cgatttaaggcctgctatttaccctgggttatccttggattcaactatatcatcggaggc tcgtaccgctggctgcccagtaggagaggaggagtatcaggatttggtgtgccccctgct agcatgaggcgagctgctgatcagaatggcggaggcgggagacacaactggggccagggc tttcgacttggagaccagtga