GENSCAN 1.0 Date run: 3-Jul-120 Time: 20:34:43 Sequence gi568815575f:136548249_136759412 : 211164 bp : 40.27% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 341 619 279 0 0 97 66 208 0.334 16.03 1.02 Intr + 2520 2599 80 1 2 131 66 13 0.468 1.75 1.03 Intr + 16231 16332 102 0 0 52 44 105 0.054 1.95 1.04 Intr + 17244 17432 189 2 0 89 113 23 0.712 3.76 1.05 Term + 19023 19109 87 2 0 153 43 67 0.971 5.38 1.06 PlyA + 19482 19487 6 1.05 2.05 PlyA - 19642 19637 6 1.05 2.04 Term - 28178 28097 82 1 1 72 53 95 0.299 0.59 2.03 Intr - 29727 29615 113 0 2 64 74 128 0.606 7.26 2.02 Intr - 35924 35866 59 0 2 64 115 48 0.403 2.78 2.01 Init - 46350 46251 100 0 1 71 100 65 0.731 6.47 2.00 Prom - 59233 59194 40 -5.35 3.03 PlyA - 59678 59673 6 1.05 3.02 Term - 61826 61616 211 1 1 1 50 191 0.110 2.38 3.01 Init - 83149 83007 143 1 2 12 99 148 0.819 7.95 3.00 Prom - 86644 86605 40 -7.35 4.00 Prom + 89219 89258 40 -5.15 4.01 Init + 89664 89719 56 2 2 83 45 113 0.749 5.31 4.02 Intr + 99561 99777 217 0 1 73 -8 118 0.022 -1.72 4.03 Intr + 100106 100156 51 1 0 86 116 17 0.224 2.49 4.04 Intr + 102018 102149 132 2 0 82 121 117 0.998 14.32 4.05 Intr + 102411 102592 182 2 2 21 52 221 0.893 9.54 4.06 Intr + 109216 109284 69 1 0 87 56 84 0.341 2.48 4.07 Intr + 109653 109702 50 0 2 80 82 27 0.419 -1.29 4.08 Term + 110791 111167 377 2 2 95 47 328 0.886 23.32 4.09 PlyA + 112122 112127 6 1.05 5.19 PlyA - 112361 112356 6 1.05 5.18 Term - 119921 119781 141 2 0 90 41 131 0.878 5.55 5.17 Intr - 121288 121234 55 0 1 105 90 83 0.990 8.26 5.16 Intr - 123871 123772 100 2 1 104 87 165 0.993 16.25 5.15 Intr - 126848 126759 90 0 0 40 58 95 0.492 0.75 5.14 Intr - 128469 128376 94 0 1 92 53 78 0.901 3.32 5.13 Intr - 132628 132483 146 2 2 2 84 134 0.940 3.48 5.12 Intr - 133720 133642 79 1 1 72 82 57 0.953 1.71 5.11 Intr - 134596 134510 87 1 0 93 92 82 0.993 8.35 5.10 Intr - 137575 137429 147 1 0 70 94 88 0.967 7.11 5.09 Intr - 142500 142362 139 2 1 49 94 99 0.475 6.15 5.08 Intr - 158782 158660 123 0 0 105 99 -10 0.131 0.58 5.07 Intr - 160522 160427 96 0 0 70 106 77 0.929 5.91 5.06 Intr - 165122 165028 95 2 2 112 71 103 0.955 8.84 5.05 Intr - 183924 183854 71 1 2 85 103 47 0.581 3.88 5.04 Intr - 195538 195337 202 1 1 82 103 104 0.811 9.24 5.03 Intr - 197099 196975 125 0 2 120 103 74 0.999 11.48 5.02 Intr - 199344 199260 85 0 1 67 121 90 0.933 8.67 5.01 Init - 207437 207432 6 2 0 86 100 10 0.402 2.36 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575f:136548249_136759412|GENSCAN_predicted_peptide_1|245_aa XDSMSPNQWRYSSPWTKPQPEVPVTNRAANCNLHVPGPMAVNQFSPSLARRASVRPGELW HFSSLAGTSSLEPGYSHPFPARHLVPEPQPDGKHKKLYVSRGSASTSLPNESRYLGQPLM EPHPMAATSTGPWGVLPGYHRCPLKAQGLFSHLVGSTEFNPISHNPCALPLPSAQILSPP HTAAPWGWGRGSISDSRLSSLPSSSASFSDMKLKPGTVLLTRPHKPLTPLTPQLLLSGGP ADTRI >gi568815575f:136548249_136759412|GENSCAN_predicted_CDS_1|738_bp natgatagcatgtctccaaatcagtggcgttactcgtctccatggacaaagccacaacca gaagtacctgtcacaaaccgtgccgccaactgcaacttgcatgtgcctggtcccatggct gtgaatcagttctcaccgtccctggctaggagggcctctgttcggcctggggagctgtgg catttctcctccctggcgggcaccagctccttagagcctggctactctcatcccttcccc gctcggcacctggttccagagccccagcctgatgggaaacataagaaactatatgtatct cgtggatctgccagtaccagccttccaaatgaaagtaggtatctgggccagcctttgatg gagccacaccccatggctgccaccagcacaggcccatggggagtactgccaggttaccac cgatgtcctcttaaggcccaagggctcttcagtcatcttgtgggcagcacagagttcaat ccaatatctcacaatccctgtgctcttcctctcccaagtgcacagattctctctccaccc cacacagctgctccatggggatggggaaggggtagcatcagcgattcaagactgtcttcc ctaccctcttccagtgcctctttcagcgatatgaagttaaaaccaggtacggtgttactt accaggccccataaacccctcacacctttaactcctcagctcctgctttctggaggacct gctgatacaaggatatag >gi568815575f:136548249_136759412|GENSCAN_predicted_peptide_2|117_aa MAKGSSLKNKKMLTKGDMDLQKGRKNNGIGKNRSYEPEQHVAMLNTVGNGNTMTELAKDE DQNLSSLTPWHLAQNVAPSSEPGTERALNKRSIPEPTTLTDTGSQEAELFTQGHTVG >gi568815575f:136548249_136759412|GENSCAN_predicted_CDS_2|354_bp atggctaaaggaagttctctaaagaacaagaaaatgttaacaaaaggagatatggacctt cagaaaggaaggaagaataatggaattggtaaaaatagaagctacgaacctgaacagcat gttgctatgttgaatactgtaggcaatgggaacacaatgactgagcttgccaaggatgag gaccaaaacttgtcatctctgactccgtggcacctagctcagaacgtggcacctagctca gaacctggcaccgaaagggcactcaataaaaggtccatccccgagcccactactctaact gacactggctcccaggaagctgagctgtttacccaaggtcacacagttggatga >gi568815575f:136548249_136759412|GENSCAN_predicted_peptide_3|117_aa MKNEREPEKNSCVYICFDLPKAYRYVQEAKEPLQLHKKLWKAIKLMPRHGARSTVRGHVW TKVLRRSVSRKRRMEQVYCEEQRHQEKYHGPLEREPEIVAALLPVPGPTMVAVLGVP >gi568815575f:136548249_136759412|GENSCAN_predicted_CDS_3|354_bp atgaaaaatgaaagagaaccagagaaaaacagctgtgtatatatctgttttgatttacca aaagcctaccgctatgttcaagaagcgaaggagcccctacaacttcacaaaaagctgtgg aaagctataaagctgatgcccagacatggagctcggtcaaccgtgaggggccatgtatgg acaaaagttctgaggagatcagtcagtagaaagaggagaatggagcaggtatactgtgag gagcagagacaccaagagaaatatcatgggcctctggagagagaaccagaaattgtagct gccttgctacctgttccaggtcccactatggtggctgtcttgggagtaccatga >gi568815575f:136548249_136759412|GENSCAN_predicted_peptide_4|377_aa MRFLRLTVCPLLMNIFLLRFPLRISKIEIEISLLLVDEFVTFLETGEPKKLDSDRKILPL SVKKSMTFQGKNEYMEEETCFFFTYKKESLEMIGSALFAVYLHRRLDKIEDERNLHEDFV FMKTIQRCNTGERSLSLLNCEEIKSQFEGFVKLPLGKMMKSTKLGFCIAVSPHPNLDALA LGTQSLIPISQGSNSRQQLPWQQATSEGTPEDWVALSIKSANVAESDAVECCTTVSHAAK ILPYIPQQVLKLVLQWAEKGYYTMSNNLVTLENGKQLTVKRQGLYYIYAQVTFCSNREAS SQAPFIASLCLKSPGRFERILLRAANTHSSAKPCGQQSIHLGGVFELQPGASVFVNVTDP SQVSHGTGFTSFGLLKL >gi568815575f:136548249_136759412|GENSCAN_predicted_CDS_4|1134_bp atgcggttcctccgcttgactgtgtgtcctctgctgatgaacatctttctgctcaggttc ccgcttcgtattagtaagattgaaattgaaataagtctattgctggtggatgaatttgtc actttccttgaaactggtgaacccaaaaagttagacagtgataggaaaatactgccattg tctgttaagaagtctatgacatttcaaggcaagaatgaatatatggaagaagaaacttgt ttcttctttacttacaaaaaggaaagcctggaaatgattgggtcagcactttttgctgtg tatcttcatagaaggttggacaagatagaagatgaaaggaatcttcatgaagattttgta ttcatgaaaacgatacagagatgcaacacaggagaaagatccttatccttactgaactgt gaggagattaaaagccagtttgaaggctttgtgaagctaccactaggcaaaatgatgaag tccaccaagcttggtttttgcattgctgtgtctccccatccaaaccttgatgctctcgca ctggggacccagagtctgatccccatttcccagggaagcaatagccgtcaacagctgccg tggcagcaggccacaagtgaagggacacctgaagactgggtggctttgagcatcaaatca gctaatgtggccgaaagtgatgctgtcgagtgctgtacaaccgtaagccatgctgctaaa atcttgccctacatcccacagcaagtactaaaattagtgttacagtgggctgaaaaagga tactacaccatgagcaacaacttggtaaccctggaaaatgggaaacagctgaccgttaaa agacaaggactctattatatctatgcccaagtcaccttctgttccaatcgggaagcttcg agtcaagctccatttatagccagcctctgcctaaagtcccccggtagattcgagagaatc ttactcagagctgcaaatacccacagttccgccaaaccttgcgggcaacaatccattcac ttgggaggagtatttgaattgcaaccaggtgcttcggtgtttgtcaatgtgactgatcca agccaagtgagccatggcactggcttcacgtcctttggcttactcaaactctga >gi568815575f:136548249_136759412|GENSCAN_predicted_peptide_5|626_aa MKIFDPDDLYSGVNFSKVLSTLLAVNKATEDQLSERPCGRSSSLSAANTSQTNPQGAVSS TVSGLQRQSKTVEMTENGSHQLIVKARFNFKQTNEDELSVCKGDIIYVTRVEEGGWWEGT LNGRTGWFPSNYVREIKSSERPLSPKAVKGFETAPLTKNYYTVVLQNILDTEKEYAKELQ SLLVTYLRPLQSNNNLSTVEVTSLLGNFEEVCTFQQTLCQALEECSKFPENQHKVGGCLL SLMPHFKSMYLAYCANHPSAVNVLTQHSDELEQFMENQGASSPGILILTTNLSKPFMRLE KYVTLLQELERHMEGQCQDLRKRKQLELQILSEPIQAWEGEDIKNLGNVIFMSQVMVQYG ACEEKEERYLMLFSNVLIMLSASPRMSGFIYQGKIPIAGTVVTRLDEIEGNDCTFEITGN TVERIVVHCNNNQDFQEWLEQLNRLIRGPASCSSLSKTSSSSCSAHSESSKSPKTMKKFL HKRKTERKPSEEEYVIRKSTAALEEDAQILKVIEAYCTSANFQQGHGSSTRKDSIPQVLL PEEEKLIIEETRSNGQTIMEEKSLVDTVYALKDEVRELKQENKRMKQCLEEELKSRRDLE KLVRRLLKQTDECIRGESSSKTSILP >gi568815575f:136548249_136759412|GENSCAN_predicted_CDS_5|1881_bp atgaagatatttgatcctgatgacctttattcaggggtcaatttctccaaggtactgagt actcttttagctgtcaacaaagcaacagaagatcagctatcagaaagaccatgtggacgt tcctcttctcttagtgctgctaatacttctcagacaaacccacagggagcagtttctagc acagtttcagggctgcaaaggcagtcaaagacagtggagatgacggaaaatggaagtcat cagttgatagtaaaagcaagattcaactttaagcagactaatgaggatgaactgtcagtt tgtaagggggacatcatttacgtcacacgagttgaagaaggaggctggtgggaaggcaca ttaaatgggagaacaggctggttccccagtaattatgtccgtgaaattaaatccagtgag agacctctctccccaaaagccgtcaaaggatttgaaactgctccacttaccaagaattat tatactgtggtgttacagaacatcctggacactgaaaaagaatatgctaaagaacttcag tctcttcttgttacttacttaagacccctgcagtccaataacaatctgagtactgtggag gttacatctttactgggaaacttcgaggaagtatgcacatttcaacagacactctgccaa gccttggaagaatgttcaaagtttccagaaaaccagcacaaagtaggaggttgtctactg agtctcatgcctcattttaaatctatgtatctggcttactgtgcaaaccatccttcagct gtaaatgtgctcactcagcacagtgatgagttggaacaattcatggaaaatcaaggtgca tcgagcccaggtatcctcattttaacaacaaacctcagcaaaccattcatgcgactggag aaatatgttactctcttgcaagagttagaacggcatatggaggggcaatgtcaagatctg aggaagagaaaacagctggagttacagatactgtccgaacctattcaggcatgggaagga gaagatattaaaaacttgggaaatgtgatttttatgtcacaagtaatggtgcagtatgga gcatgtgaggaaaaagaggagcggtaccttatgttattttcaaatgtcctgataatgtta tctgcaagtcctcggatgagtggctttatctatcagggaaaaataccaatagcaggaacg gtggtgactagattagatgaaattgaagggaatgactgcacatttgaaatcactggtaac acagtggagagaattgtggtccattgtaacaacaaccaggacttccaggaatggttggag cagctgaacagactgatcagaggacctgcctcttgcagttcattatccaaaacctcatcg tcatcatgtagtgctcattctgagtctagtaaaagccctaaaacgatgaagaaatttctt cataaaaggaagactgagagaaaaccatcggaggaggaatatgtgattaggaaaagtaca gctgctctggaagaggatgctcaaatccttaaagtgatcgaagcctactgcaccagcgca aattttcaacaaggccatggctcaagtactcgaaaagattccattccacaagtcctactc cctgaggaagagaaactcatcattgaagaaaccagaagcaacggccagaccatcatggaa gaaaagagccttgttgatactgtttacgccttgaaggacgaggtcagagaactgaagcag gaaaataaaagaatgaagcaatgcctggaagaagaactgaaatcaagaagggacctagaa aagctggtgcggaggcttttgaagcaaacagatgagtgtattcgaggcgagtccagtagc aagacctcaattcttccataa