GENSCAN 1.0 Date run: 5-Nov-116 Time: 17:49:52 Sequence gi568815589f:123912349_124132704 : 220356 bp : 48.35% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1755 1860 106 2 1 71 22 132 0.471 5.28 1.02 Intr + 57775 57893 119 2 2 104 81 12 0.012 2.28 1.03 Term + 64329 64439 111 2 0 118 55 49 0.065 3.16 1.04 PlyA + 64718 64723 6 1.05 2.00 Prom + 67439 67478 40 -4.26 2.01 Init + 77970 78027 58 2 1 15 84 49 0.079 -1.33 2.02 Intr + 85764 85970 207 1 0 83 82 66 0.450 4.55 2.03 Intr + 87817 87901 85 2 1 97 116 15 0.087 4.08 2.04 Intr + 100015 100120 106 1 1 61 78 254 0.089 21.92 2.05 Intr + 101613 101815 203 2 2 117 68 422 0.999 41.18 2.06 Intr + 102774 103177 404 0 2 98 105 558 0.970 52.57 2.07 Intr + 108751 108956 206 2 2 97 76 377 0.815 36.32 2.08 Term + 120072 120359 288 2 0 143 39 198 0.999 15.78 2.09 PlyA + 120931 120936 6 1.05 3.13 PlyA - 124496 124491 6 1.05 3.12 Term - 126110 126028 83 0 2 64 44 75 0.538 -1.44 3.11 Intr - 126732 126561 172 0 1 88 70 84 0.872 6.12 3.10 Intr - 130235 130120 116 0 2 82 94 99 0.856 10.07 3.09 Intr - 140284 140170 115 1 1 55 90 13 0.005 -1.78 3.08 Intr - 148885 148754 132 1 0 91 94 6 0.071 2.34 3.07 Intr - 149540 149436 105 2 0 114 78 -19 0.049 0.11 3.06 Intr - 166435 166349 87 1 0 114 92 11 0.328 4.27 3.05 Intr - 167485 167437 49 0 1 118 54 28 0.304 1.08 3.04 Intr - 171950 171878 73 0 1 87 111 13 0.040 1.96 3.03 Intr - 180064 180022 43 1 1 90 81 42 0.260 1.51 3.02 Intr - 181460 181291 170 0 2 65 65 82 0.064 3.27 3.01 Init - 193694 193664 31 2 1 68 71 48 0.149 1.00 3.00 Prom - 198634 198595 40 -5.36 4.00 Prom + 211218 211257 40 -3.26 4.01 Init + 213181 213361 181 0 1 84 60 181 0.866 12.83 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 100001 100120 120 1 0 93 78 235 0.910 23.19 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589f:123912349_124132704|GENSCAN_predicted_peptide_1|111_aa MHFAKKHKKGQKTQANKMKTIKAFVKLKVLKGTCVDLSKLSSPVNLLTPPEPSPTLSHMQ STAMPITFLPYVVQLSSGSGRSSSTRTPGEAACDPNLSQSVHSTAVATVIG >gi568815589f:123912349_124132704|GENSCAN_predicted_CDS_1|336_bp atgcactttgccaagaaacacaagaagggtcaaaagacacaggccaataaaatgaagacc atcaaggcctttgtcaagctcaaggtcttgaagggcacctgtgttgatctgagtaaatta tcttcccctgtgaatttgctgacaccccctgagcccagccccactctttctcatatgcaa agcactgcgatgcccatcacctttcttccttatgttgtgcagctgtccagtggttctggt agaagcagttccaccaggactccaggggaagcggcatgtgacccaaacctaagccaatcc gtgcattccactgctgtggccacagtgattggctga >gi568815589f:123912349_124132704|GENSCAN_predicted_peptide_2|518_aa MGEDWKNMSHIVNSCYFGGASSRSPPHTGHSGFRAGLAPSCQERSPSTTSFSTPCPLPTN CNPGAPIDFRSQLRCHHTLGKAFLSPRPARHPRPPPPGPGPGPPAEPATPAAEDSPSLSG PEVHGVIDEMDRRAKSEAPAISSAIDRGDTETTMPSISSDRAALCAGCGGKISDRYYLLA VDKQWHMRCLKCCECKLNLESELTCFSKDGSIYCKEDYYRRFSVQRCARCHLGISASEMV MRARDLVYHLNCFTCTTCNKMLTTGDHFGMKDSLVYCRLHFEALLQGEYPAHFNHADVAA AAAAAAAAKSAGLGAAGANPLGLPYYNGVGTVQKGRPRKRKSPGPGADLAAYNAALSCNE NDAEHLDRDQPYPSSQKTKRMRTSFKHHQLRTMKSYFAINHNPDAKDLKQLAQKTGLTKR VLQVWFQNARAKFRRNLLRQENTGVDKSTDAALQTGTPSGPASELSNASLSPSSTPTTLT DLTSPTLPTVTSVLTSVPGNLEGHEPHSPSQTTLTNLF >gi568815589f:123912349_124132704|GENSCAN_predicted_CDS_2|1557_bp atgggagaagactggaagaatatgtctcatattgtcaacagctgctattttgggggagcc tcatcccgctccccaccccacaccggccactccggctttcgcgctggccttgctccctcg tgccaggaacgctctccctccaccacctcttttagcaccccctgcccacttcctaccaac tgcaatcctggtgccccaattgacttcaggtctcagcttcgatgtcatcacactttggga aaggccttcctgagccccagaccagcccggcacccccgccccccgccccccggccctggc ccggggccgcccgcagagccggccacacccgctgcggaggactcaccaagtctgtcgggc cccgaggtgcacggggtcatcgacgagatggaccgcagggccaagagcgaggctcccgcc atcagctccgccatcgaccgcggcgacaccgagacgaccatgccgtccatcagcagtgac cgcgccgcgctgtgcgccggctgcgggggcaagatctcggaccgctactacctgctggcg gtggacaagcagtggcacatgcgctgcctcaagtgctgcgagtgcaagctcaacctggag tcggagctcacctgtttcagcaaggacggtagcatctactgcaaggaagactactacagg cgcttctctgtgcagcgctgcgcccgctgccacctgggcatctcggcctcggagatggtg atgcgcgctcgggacttggtttatcacctcaactgcttcacgtgcaccacgtgtaacaag atgctgaccacgggcgaccacttcggcatgaaggacagcctggtctactgccgcttgcac ttcgaggcgctgctgcagggcgagtaccccgcacacttcaaccatgccgacgtggcagcg gcggccgctgcagccgcggcggccaagagcgcggggctgggcgcagcaggggccaaccct ctgggtcttccctactacaatggcgtgggcactgtgcagaaggggcggccgaggaaacgt aagagcccgggccccggtgcggatctggcggcctacaacgctgcgctaagctgcaacgaa aacgacgcagagcacctggaccgtgaccagccatacccgagcagccagaagaccaagcgc atgcgcacgtccttcaagcaccaccagcttcggaccatgaagtcttactttgccattaac cacaaccccgacgccaaggacttgaagcagctcgcgcaaaagacgggcctcaccaagcgg gtcctccaggtctggttccagaacgcccgagccaagttcaggcgcaacctcttacggcag gaaaacacgggcgtggacaagtcgacagacgcggcgctgcagacagggacgccatcgggc ccggcctcggagctctccaacgcctcgctcagcccctccagcacgcccaccaccctgaca gacttgactagccccaccctgccaactgtgacgtccgtcttaacttctgtgcctggcaac ctggagggccatgagcctcacagcccctcacaaacgactcttaccaaccttttctaa >gi568815589f:123912349_124132704|GENSCAN_predicted_peptide_3|391_aa MAPEKAMQQRATLAMKGFRWTPNVKGWPSPGVIPMTIQSDDSKKMNGPHTTKTTDGIHMD SFRSLMLAGKSKLNVTVNFVPDEKTEDERDRMTCSRTHSREVASPRCLCICEHVVYVCAV MVIQEHHVPALYSCSGGFPHAGIGFYSTRKYARCHRLSVCFASRVSFSLTAAWEGKEEPH PHLPDEETQELHQEEVGRKQAFGTPSSCTPHPRSPGADPTRLLPPRPGQHESSRACAGGS AELKVHSGSWEMDQVGFPAWKWHHSIDGEQGKLERFESWRQDPKMQPELCFLITPNPSQP MERRGGQVPTCKPLPLFELTLIIPKMKDVPVRIPEGLLKNAQVIVINHHHQVGTREWVEY RGYKRSGGEPVGECGNAAGVVLTSSSGQRLF >gi568815589f:123912349_124132704|GENSCAN_predicted_CDS_3|1176_bp atggctccagagaaggccatgcagcagagagccaccctggcaatgaagggattcaggtgg actccaaatgtgaaaggttggccttctccaggggtcattcccatgaccatacagagtgat gactccaaaaaaatgaatgggccccacaccaccaagaccacagatggaattcacatggac tccttccgatctctcatgctggctgggaagtccaaactcaatgtgacagtcaatttcgtt cctgatgagaaaacggaagatgagagagatcgaatgacttgttcaaggacacacagtcgg gaagtagcatctcccaggtgtctgtgtatctgtgagcatgtggtgtatgtgtgcgctgtc atggtgatccaggagcaccatgtgcctgccctatacagctgctctgggggcttcccccat gctggaattggtttctactccaccaggaagtatgctcggtgtcatcggctgagtgtgtgt tttgcatctcgtgtctcctttagtctaacagcagcgtgggaaggcaaagaggagcctcat ccccatcttccagatgaggaaactcaagagctgcaccaggaagaggtaggaagaaagcag gcctttggtacaccatctagctgcactcctcacccacggtctccgggtgcagacccaaca aggctgctcccacctcgcccggggcagcacgagagttccagggcctgtgccgggggctct gctgagcttaaagtccactccgggtcctgggagatggaccaggtgggttttcctgcctgg aagtggcatcactcaattgatggagagcagggcaagctggagcgctttgagagctggcgc caggatcctaaaatgcagccagagctttgtttcctcatcacacccaacccgagccagccc atggaacggcgcggggggcaggtgcccacatgtaagccccttcccctctttgagcttacc ctgatcatccctaagatgaaggatgtgcctgtccgaatccctgagggacttttaaaaaat gcacaggtcattgtcatcaatcatcatcatcaagttggaaccagggagtgggtggagtat agaggttacaagagaagtggtggcgagccagtaggggaatgtggaaacgccgctggagtg gttctgacgagcagctccggccagaggctcttctag >gi568815589f:123912349_124132704|GENSCAN_predicted_peptide_4|61_aa MCPPVSLGWGCPGQRPGPAFALKLALQEQLGFHLSPKDDTGAGGLLLLEAEPTQRKAFVL X >gi568815589f:123912349_124132704|GENSCAN_predicted_CDS_4|183_bp atgtgcccgccggtcagcctgggctggggctgccctggtcagaggcctgggcctgccttt gccctcaagctggctcttcaggagcagcttgggttccacttgagccccaaggatgatact ggagccgggggcctgctgctgctggaagcagagccaacgcaaagaaaagcctttgttctt gnn