GENSCAN 1.0 Date run: 5-Nov-116 Time: 05:59:06 Sequence gi568815595f:16170953_16403559 : 232607 bp : 43.95% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 4200 4738 539 2 2 82 93 433 0.840 35.54 1.02 Intr + 12581 12619 39 2 0 68 121 15 0.064 0.14 1.03 Intr + 24808 24974 167 1 2 140 63 272 0.393 29.50 1.04 Intr + 29667 29871 205 1 1 93 93 144 0.633 13.66 1.05 Intr + 37551 37718 168 0 0 57 109 137 0.951 11.76 1.06 Intr + 40172 40289 118 2 1 88 95 78 0.521 8.97 1.07 Intr + 41617 41811 195 0 0 91 116 85 0.524 11.21 1.08 Intr + 48958 49062 105 0 0 91 94 69 0.995 8.21 1.09 Intr + 51663 51806 144 2 0 75 105 124 0.955 13.28 1.10 Term + 56402 56548 147 1 0 122 42 165 0.999 13.20 1.11 PlyA + 58065 58070 6 1.05 2.00 Prom + 61683 61722 40 -2.06 2.01 Init + 72318 72423 106 2 1 79 71 85 0.574 6.28 2.02 Term + 80644 80756 113 2 2 99 54 41 0.207 0.52 2.03 PlyA + 81175 81180 6 1.05 3.04 PlyA - 81202 81197 6 1.05 3.03 Term - 81550 81417 134 2 2 112 41 65 0.287 2.45 3.02 Intr - 88914 88839 76 0 1 49 98 9 0.054 -2.91 3.01 Init - 93924 93817 108 0 0 85 88 167 0.652 16.62 3.00 Prom - 99306 99267 40 -0.46 4.12 PlyA - 100657 100652 6 1.05 4.11 Term - 112745 112635 111 2 0 100 42 55 0.081 0.66 4.10 Intr - 125147 125086 62 0 2 97 87 28 0.006 2.05 4.09 Intr - 134162 134140 23 1 2 73 119 -15 0.004 -2.71 4.08 Intr - 144706 144673 34 2 1 80 111 48 0.228 3.58 4.07 Intr - 146280 145896 385 0 1 68 60 251 0.138 14.72 4.06 Intr - 152505 152424 82 2 1 66 116 27 0.611 2.94 4.05 Intr - 155924 155821 104 2 2 101 92 177 0.898 18.37 4.04 Intr - 167017 166952 66 1 0 99 80 15 0.127 0.90 4.03 Intr - 176733 176680 54 0 0 91 105 -11 0.099 0.08 4.02 Intr - 187095 186980 116 1 2 146 100 104 0.994 17.57 4.01 Init - 199325 199124 202 2 1 81 80 267 0.882 24.24 4.00 Prom - 199993 199954 40 -1.86 5.03 PlyA - 200445 200440 6 1.05 5.02 Term - 207150 206704 447 0 0 118 36 141 0.074 7.12 5.01 Init - 209226 209224 3 0 0 83 101 0 0.436 0.80 5.00 Prom - 212697 212658 40 -3.66 6.00 Prom + 214617 214656 40 -2.86 6.01 Init + 216939 217038 100 2 1 56 50 135 0.456 4.93 6.02 Intr + 217360 217384 25 2 1 93 98 23 0.235 0.88 6.03 Intr + 230442 230541 100 0 1 111 68 1 0.000 0.51 6.04 Intr + 231854 231958 105 1 0 27 55 103 0.000 1.31 6.05 Term + 232081 232146 66 0 0 57 55 79 0.000 -0.56 6.06 PlyA + 232550 232555 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 7735 7960 226 1 1 67 46 148 0.852 4.75 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:16170953_16403559|GENSCAN_predicted_peptide_1|608_aa MLLRKRYRHRPCRLQFLLLLLMLGCVLMMVAMLHPPHHTLHQTVTAQASKHSPEARYRLD FGESQDWVLEAEDEGEEYSPLEGLPPFISLREDQLLVAVALPQARRNQSQGRRGGSYRLI KQPRRQDKEAPKRDWGADEDGEVSEEEELTPFSLDPRGLQEALSARIPLQRALPEVRHPL EQIAIEQLSEVRRCLQQHPQDSLPTASVILCFHDEAWSTLLRTVHSILDTVPRAFLKEII LVDDLSQQGQLKSALSEYVARLEGVKLLRSNKRLGAIRARMLGATRATGDVLVFMDAHCE CHPGWLEPLLSRIAGDRSRVVSPVIDVIDWKTFQYYPSKDLQRGVLDWKLDFHWEPLPEH VRKALQSPISPIRSPVVPGEVVAMDRHYFQNTGAYDSLMSLRGGENLELSFKAWLCGGSV EILPCSRVGHIYQNQDSHSPLDQEATLRNRVRIAETWLGSFKETFYKHSPEAFSLSKLHN TGLGLCADCQAEGDILGCPMVLAPCSDSRQQQYLQHTSRKEIHFGSPQHLCFAVRQEQVI LQNCTEEGLAIHQQHWDFQENGMIVHILSGKCMEAVVQENNKDLYLRPCDGKARQQWRFD QINAVDER >gi568815595f:16170953_16403559|GENSCAN_predicted_CDS_1|1827_bp atgctcctaaggaagcgatacaggcacagaccatgcagactccagttcctcctgctgctc ctgatgctgggatgcgtcctgatgatggtggcgatgttgcaccctccccaccacaccctg caccagactgtcacagcccaagccagcaagcacagccctgaagccaggtaccgcctggac tttggggaatcccaggattgggtactggaagctgaggatgagggtgaagagtacagccct ctggagggcctgccaccctttatctcactgcgggaggatcagctgctggtggccgtggcc ttaccccaggccagaaggaaccagagccagggcaggagaggtgggagctaccgcctcatc aagcagccaaggaggcaggataaggaagccccaaagagggactggggggctgatgaggac ggggaggtgtctgaagaagaggagttgaccccgttcagcctggacccacgtggcctccag gaggcactcagtgcccgcatccccctccagagggctctgcccgaggtgcggcacccactt gaacaaatagctattgaacaactatcagaagtaagaaggtgtctgcagcagcaccctcag gacagcctgcccacagccagcgtcatcctctgtttccatgatgaggcctggtccactctc ctgcggactgtacacagcatcctcgacacagtgcccagggccttcctgaaggagatcatc ctcgtggacgacctcagccagcaaggacaactcaagtctgctctcagcgaatatgtggcc aggctggagggggtgaagttactcaggagcaacaagaggctgggtgccatcagggcccgg atgctgggggccaccagagccaccggggatgtgctcgtcttcatggatgcccactgcgag tgccacccaggctggctggagcccctcctcagcagaatagctggtgacaggagccgagtg gtatctccggtgatagatgtgattgactggaagactttccagtattacccctcaaaggac ctgcagcgtggggtgttggactggaagctggatttccactgggaacctttgccagagcat gtgaggaaggccctccagtcccccataagccccatcaggagccctgtggtgcccggagag gtggtggccatggacagacattacttccaaaacactggagcgtatgactctcttatgtcg ctgcgaggtggtgaaaacctcgaactgtctttcaaggcctggctctgtggtggctctgtt gaaatccttccctgctctcgggtaggacacatctaccaaaatcaggattcccattccccc ctcgaccaggaggccaccctgaggaacagggttcgcattgctgagacctggctggggtca ttcaaagaaaccttctacaagcatagcccagaggccttctccttgagcaagctccacaac actggacttgggctctgtgcagactgccaggcagaaggggacatcctgggctgtcccatg gtgttggctccttgcagtgacagccggcagcaacagtacctgcagcacaccagcaggaag gagattcactttggcagcccacagcacctgtgctttgctgtcaggcaggagcaggtgatt cttcagaactgcacggaggaaggcctggccatccaccagcagcactgggacttccaggag aatgggatgattgtccacattctttctgggaaatgcatggaagctgtggtgcaagaaaac aataaagatttgtacctgcgtccgtgtgatggaaaagcccgccagcagtggcgttttgac cagatcaatgctgtggatgaacgatga >gi568815595f:16170953_16403559|GENSCAN_predicted_peptide_2|72_aa MEVRPELGLISKEAQGCDTGHRNICNKSQPTKVPSAQKGSSPLEFAPSDKPGQLHPSIEI VTCFLKVTSEQS >gi568815595f:16170953_16403559|GENSCAN_predicted_CDS_2|219_bp atggaagttcggccagagctggggctgatatccaaggaggcccaaggatgcgacactgga caccgaaacatctgcaataagagccaaccaacaaaggttccctcagcccagaagggcagc agtcctctcgagtttgccccttcagacaaaccaggacagcttcatccaagtatagaaata gtcacctgctttctcaaagtcacatcagaacagagctga >gi568815595f:16170953_16403559|GENSCAN_predicted_peptide_3|105_aa MAVFHDEVEIEDFQYDEDSETYFYPCPCGDNFSITKVGVNVKFPWLVAVLLAYHSVVSLL QASSLIHFSGAQQFVMQTVSNQDSSSHFKSDMIRVTIAETLNFGP >gi568815595f:16170953_16403559|GENSCAN_predicted_CDS_3|318_bp atggcagtgtttcatgacgaggtggaaatcgaggacttccaatatgacgaggactcggag acgtatttctatccctgcccatgtggagataacttctccatcaccaaggtaggggttaac gtcaaatttccatggctggtagctgtgcttttggcatatcacagtgttgtgtcactacta caagcgagttccctgatacatttcagtggtgcccaacagtttgttatgcagactgtttca aatcaggacagcagctctcacttcaagtctgatatgatccgcgtgaccatagctgaaacc ttgaactttggaccttaa >gi568815595f:16170953_16403559|GENSCAN_predicted_peptide_4|412_aa MEIFTLFNKPKSHQKCRQYYPVTIPLHVSKNGQTVSGLDANWLEHMSDHFRKGGMLVNAV FYLGIVNDSLHGLTDGVFIFEAVSTEDSKTIQGYDAIVVEQWTVLEFPFHRESSFQRGGG VHTEGKLVALLGLTVGYHKLDFRAPEGVEVQTDYVPLLNSLAAYGWQLTCVLPTPVVKTT SEGSVSTKQIVFLQRPCLPQKIKKKESKFQWRFSREEMHNRQMRKSKGKLSARDKQQAEE NEKNLEDQSSKAGDMGNCVSGQQQEGGVSEEMKGPVQEDKGEQLSPGGLLCGVGVEGEAV QNGPASHSRALVGICTGHSNPGEDARDGDAEEVRELERDGKWNSDATSLGTLGPTGGPST ATKRNCVSIPSLQARAVEALKDSQTTTLLPPSADNSSLLFCQSQTDINTPHK >gi568815595f:16170953_16403559|GENSCAN_predicted_CDS_4|1239_bp atggagatcttcacccttttcaacaaaccgaagagccatcagaagtgccggcaatactac cctgtcaccattcctctccatgtctccaagaatggccagacagtgagcggtttggacgcc aactggttagagcacatgagcgaccacttccggaaaggaggcatgctggtgaacgcagtc ttctaccttggaatagtgaatgattccttacatggcttgacagatggagtattcatcttt gaagctgtttccacagaagatagcaaaaccatacagggctatgatgctattgtggttgaa caatggacagtcctggaattcccctttcacagagaaagctccttccagagaggaggtggt gtacacacagagggcaagctggtggcactgctgggcctcacagtaggttatcataaatta gacttcagggctccagagggtgtcgaagtgcagacagactacgtgcccctgctgaactcg ctggcggcctatggctggcagctcacctgtgtgctaccaactcccgtcgtcaagactacc agcgaggggagtgtatccaccaagcagattgtctttcttcagagaccttgtctacctcag aaaatcaagaagaaggaatcgaagtttcagtggcgattctccagagaagaaatgcacaac aggcagatgaggaaatcaaaaggtaaactcagtgccagagacaaacaacaagcagaagaa aatgagaagaacttagaagaccagtcttccaaagctggagacatgggaaactgtgtttca ggacagcagcaggagggtggagtctccgaggagatgaagggccctgtccaagaggacaag ggagaacagctgtcccctggtggcctgctgtgtggggtgggtgtggagggtgaggctgtg cagaatggtcctgccagccacagcagggccctggtggggatttgcactgggcactccaat cctggagaggatgccagggacggggatgctgaggaagtcagagagcttgaaagagatggc aagtggaactcagatgccaccagcctgggcactttggggccaactggtgggccttctaca gccacaaaacgcaactgtgtttccataccctctttgcaggcaagagctgttgaggccctc aaggactcacagactactactctcctcccaccttctgctgacaactcatcattacttttc tgccaatcacaaactgacatcaacactccccataagtaa >gi568815595f:16170953_16403559|GENSCAN_predicted_peptide_5|149_aa MIQEAASQGLKFVGVIPQYHSSVNSAGSSAPVSTANSTEDARDAKNARGDHASLENEKPG TGDVCSAPAGRNQSPEPSSGPRGEVPLAKQPSSPSGEGDGGELSPQGVSKTLDGPESNPL EVHEEPLSGSKYVSNGSFDPLKEQLTAFH >gi568815595f:16170953_16403559|GENSCAN_predicted_CDS_5|450_bp atgatccaggaggctgcaagccagggcctgaaattcgttggtgttatacctcagtaccat tcctctgtgaactcggcaggcagcagtgctccggtgtctactgccaacagcaccgaggat gccagagatgcaaaaaacgcacgtggggatcacgcgtcactggagaatgagaaaccgggg actggggatgtgtgcagtgctccggctgggagaaaccaaagcccagagcccagctcaggc cccagaggggaggtgcccctcgccaagcagcccagctcaccctccggagagggagatggt ggagaactttcaccacagggggtgagcaagacactggatggaccggagagcaaccccttg gaggtgcatgaagagccactctcagggagtaagtatgtcagcaatgggagttttgaccca cttaaagagcagctaacagcttttcattaa >gi568815595f:16170953_16403559|GENSCAN_predicted_peptide_6|131_aa MLFLGLTTTFLLVFFLHAGLYTALTAARLVSERSSGTVPGTQHIYLSPWQEGLCLFYTMT TALAKLTRGIQWPNPDSTLIEWDQPPLNGLHCKGSVGQCKMNLVLKEIREAGRGNFHQQN LGGKGLEPILH >gi568815595f:16170953_16403559|GENSCAN_predicted_CDS_6|396_bp atgctcttccttggcctcaccaccacctttctgctggttttcttcctgcatgctggcctg tatacagccctcacagcagccaggctggtatctgaaagatcatctggaactgttcctggt acacaacacatttacttaagcccatggcaggaaggcctctgcctgttttatactatgaca actgctcttgcaaaacttacccgtggcatccagtggccaaacccggacagtacactcatt gagtgggaccagccgcccctcaacggtcttcattgcaaaggctcagtgggccagtgcaag atgaacctagttcttaaggagatcagggaggcaggacgaggcaatttccatcagcagaac ctgggcggcaagggcctggagcccatccttcactga