GENSCAN 1.0 Date run: 3-Nov-116 Time: 18:02:06 Sequence gi568815590r:100604301_100821583 : 217283 bp : 43.67% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 1106 1222 117 1 0 65 48 166 0.184 8.84 1.02 PlyA + 1682 1687 6 1.05 2.14 PlyA - 2312 2307 6 1.05 2.13 Term - 2778 2755 24 0 0 104 42 1 0.042 -4.68 2.12 Intr - 4263 4194 70 2 1 79 94 51 0.067 3.98 2.11 Intr - 7787 7700 88 0 1 81 103 32 0.244 3.03 2.10 Intr - 8785 8695 91 1 1 76 99 97 0.984 9.17 2.09 Intr - 10955 10848 108 2 0 13 111 59 0.647 1.18 2.08 Intr - 13430 13320 111 2 0 90 99 112 0.968 13.08 2.07 Intr - 26091 25958 134 1 2 81 86 53 0.139 4.76 2.06 Intr - 31711 31597 115 1 1 102 80 53 0.309 5.92 2.05 Intr - 40280 40034 247 1 1 88 75 66 0.155 2.76 2.04 Intr - 41831 41663 169 0 1 75 82 25 0.157 -0.40 2.03 Intr - 44801 44738 64 2 1 21 60 92 0.179 -2.01 2.02 Intr - 45048 44974 75 0 0 82 99 99 0.610 10.11 2.01 Init - 45214 45149 66 1 0 87 50 112 0.352 8.38 2.00 Prom - 51215 51176 40 -4.76 3.20 PlyA - 54064 54059 6 1.05 3.19 Term - 100090 99998 93 1 0 137 39 104 0.997 8.23 3.18 Intr - 100756 100626 131 2 2 81 55 77 0.935 4.11 3.17 Intr - 101373 101289 85 0 1 76 92 10 0.770 -0.41 3.16 Intr - 102505 102351 155 2 2 45 80 78 0.676 2.49 3.15 Intr - 104923 104833 91 1 1 148 97 33 0.997 9.77 3.14 Intr - 105431 105159 273 2 0 111 96 161 0.999 16.93 3.13 Intr - 108157 108062 96 1 0 100 116 124 0.999 16.61 3.12 Intr - 108489 108352 138 0 0 11 27 208 0.991 7.66 3.11 Intr - 108881 108787 95 0 2 26 106 103 0.252 5.58 3.10 Intr - 111301 111162 140 0 2 54 23 246 0.252 14.81 3.09 Intr - 113588 113473 116 2 2 65 84 147 0.999 11.25 3.08 Intr - 113980 113787 194 2 2 81 121 116 0.969 13.41 3.07 Intr - 117319 117091 229 1 1 81 101 386 0.364 36.64 3.06 Intr - 137585 137455 131 0 2 132 103 -9 0.685 5.41 3.05 Intr - 137895 137857 39 1 0 118 74 41 0.374 3.90 3.04 Intr - 155635 155591 45 2 0 131 78 39 0.658 5.58 3.03 Intr - 156448 156430 19 1 1 101 64 -6 0.171 -5.52 3.02 Intr - 157058 156921 138 2 0 131 72 24 0.311 5.76 3.01 Init - 174749 174723 27 2 0 103 83 23 0.052 2.93 3.00 Prom - 179212 179173 40 -1.86 4.05 PlyA - 179754 179749 6 1.05 4.04 Term - 204150 204033 118 2 1 57 40 86 0.560 -1.29 4.03 Intr - 205400 205307 94 0 1 72 79 69 0.754 3.42 4.02 Intr - 205676 205475 202 2 1 82 100 88 0.932 8.26 4.01 Init - 207462 207415 48 0 0 60 70 61 0.504 2.46 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590r:100604301_100821583|GENSCAN_predicted_peptide_1|38_aa LLYPDSGNNDDDDDDDKDYYDINYNFLETLPGNELRFV >gi568815590r:100604301_100821583|GENSCAN_predicted_CDS_1|117_bp ctcctctacccagattcagggaacaatgatgatgatgatgatgatgataaagattattat gacatcaactacaattttttggagactttgccaggcaatgagctaaggtttgtatga >gi568815590r:100604301_100821583|GENSCAN_predicted_peptide_2|453_aa MKMHFCIPVSQQRSDALGGRYVLYSVHLDGFLFCRVRYSQLHGWNEQLDECMLRLLVAYS CEMREPGAGGFSVKLQRMKGKFTLGPDSLFALNPLHSALTPGLQHLLSVMRVPDTGFCSS KSPVSPVPSEAPSGSGSFLTLCCSVLLIFLLCELAGPTFSVPLDKQNQFDLCPCSCWHHP AQIDLKKSVRETFNQLILTKLQMRPSCLRRVFGNCLPPFPPKYYLAMTTAMADERRDQLE QYLQNVTMDPNVLRSDVFVEFLKLAQLVSLLFFNGAIIPLQDTLHEPSVQNTFDIATKKA YLDIFLPNEQSIRIEIITSDTAERVLEHTFDEVHLQSEKCASSLPIMLACEKFRPKTLTS SDQVVSHKIGLCRELLGYFGLFLIRFGKEGKLSVVKKLADFELPYVSLGSSEVENCKVGL RKWYMAPSLDSVLMDCRVAVDLLYMQHFYHLPM >gi568815590r:100604301_100821583|GENSCAN_predicted_CDS_2|1362_bp atgaagatgcatttctgtatcccggtgtcccagcagcggtccgacgcgctggggggccgc tacgtgctgtactccgtgcacctggacgggttcctcttctgcagggtgcgctacagccag ctgcacggttggaacgaacagttggatgaatgcatgctgcggctcctggtcgcctacagc tgtgagatgagggaaccgggcgcaggtggtttttcagtgaagcttcagaggatgaagggg aagtttacccttggccccgatagtctttttgctctcaatccccttcattcagccctgacc ccaggcctccaacatctactctctgtcatgcgagttccagacacaggattttgttccagc aagtccccagtaagtcctgtccccagtgaggctccctcgggctctggcagctttttgacc ctctgctgctcagtgctgctcatcttcctgctgtgtgaactggctggccccacattttct gtgcctttagataaacaaaaccagtttgatttatgcccctgctcttgctggcaccaccct gcccaaatagatttaaagaaatctgtgagggaaacatttaatcagctcattttgaccaag ttacagatgaggccatcctgtctaaggcgggtctttggaaattgcctgccacccttccca ccaaagtactatctggcaatgaccacagctatggctgatgagaggagggaccaactggaa caatatttgcaaaatgtaaccatggacccaaacgtgttgagaagtgatgtcttcgttgag tttttaaaactggcgcagctggtaagcttgctcttctttaatggggccatcattcctctg caggacactcttcatgaaccaagtgtgcagaatacatttgacatcgccaccaagaaagct tatctggacatatttctgcccaatgaacagagtattagaatcgaaattataacatcagac actgctgaaagagtcctagagcatacatttgatgaagtacatttgcagtcagaaaaatgt gcttcctctcttccaatcatgttggcttgtgagaaattcagacccaagacgttgacatca tcagatcaggtggtgtcacacaaaattggactgtgtcgagagctcttgggctacttcggc ctctttctcattcggtttggcaaggagggcaagctctctgttgtgaaaaaattggctgac tttgaactcccttatgttagtcttggaagttctgaggtggaaaactgtaaggttggactc cgaaagtggtatatggctccatccctcgactccgtgctgatggactgcagggtggcggta gatttgctctacatgcagcatttctaccatctgccaatgtaa >gi568815590r:100604301_100821583|GENSCAN_predicted_peptide_3|744_aa MKPKTSEEVCSPLVWQYSLFPSHTLGNPVHIQALTTMTFSKSLSPTSTIFRVLGQNLYHR PGCSHSSNMIDEDKILVDTMITKVFILVNYCPLELKIYCVHYFTPNLFLPSRQDCLWTTK MACESALLNLVLQVAACGPAGSRAEMNPSAPSYPMASLYVGDLHPDVTEAMLYEKFSPAG PILSIRVCRDMITRRSLGYAYVNFQQPADAERALDTMNFDVIKGKPVRIMWSQRDPSLRK SGVGNIFIKNLDKSIDNKALYDTFSAFGNILSCKVVCDENGSKGYGFVHFETQEAAERAI EKMNGMLLNDRKVFVGRFKSRKEREAELGARAKEFTNVYIKNFGEDMDDERLKDLFGKFG PALSVKVMTDESGKSKGFGFVSFERHEDAQKAVDEMNGKELNGKQIYVGRAQKKVERQTE LKRKFEQMKQDRITRYQGVNLYVKNLDDGIDDERLRKEFSPFGTITSAKVMMEGGRSKGF GFVCFSSPEEATKAVTEMNGRIVATKPLYVALAQRKEERQAHLTNQYMQRMASVRAVPNP VINPYQPAPPSGYFMAAIPQTQNRAAYYPPSQIAQLRPSPRWTAQGARPHPNTSTQTMGP RPAAAAAAATPAVRTVPQYKYAAGVRNPQQHLNAQPQVTMQQPAVHVQGQEPLTASMLAS APPQEQKQMLGERLFPLIQAMHPTLAGKITGMLLEIDNSELLHMLESPESLRSKVDEAVA VLQAHQAKEAAQKAVNSATGVPTV >gi568815590r:100604301_100821583|GENSCAN_predicted_CDS_3|2235_bp atgaagcccaagacaagtgaggaggtgtgctcacctcttgtgtggcagtactccctcttc ccatctcatactctgggcaatcctgttcatatccaagctctgactaccatgacgttttct aaatctctgtctccaacttcaaccattttcagggtccttggacagaatctctaccacagg ccaggctgctcccatagttctaacatgatagatgaggacaaaatcctagttgacaccatg ataactaaagtcttcatcctggtcaactactgtcctctggaattgaaaatttattgcgtc cactattttactcctaacttattcctgccatcccgtcaggactgcctttggacaaccaaa atggcctgtgagagtgctttgctaaatctagttcttcaggtcgcggcctgtggccctgcg ggcagccgtgccgagatgaaccccagtgcccccagctaccccatggcctcgctctacgtg ggggacctccaccccgacgtgaccgaggcgatgctctacgagaagttcagcccggccggg cccatcctctccatccgggtctgcagggacatgatcacccgccgctccttgggctacgcg tatgtgaacttccagcagccggcggacgcggagcgtgctttggacaccatgaattttgat gttataaagggcaagccagtacgcatcatgtggtctcagcgtgatccatcacttcgcaaa agtggagtaggcaacatattcattaaaaatctggacaaatccattgataataaagcactg tatgatacattttctgcttttggtaacatcctttcatgtaaggtggtttgtgatgaaaat ggttccaagggctatggatttgtacactttgagacgcaggaagcagctgaaagagctatt gaaaaaatgaatggaatgctcctaaatgatcgcaaagtatttgttggacgatttaagtct cgtaaagaacgagaagctgaacttggagctagggcaaaagaattcaccaatgtttacatc aagaattttggagaagacatggatgatgagcgccttaaggatctctttggcaagtttggg cctgccttaagtgtgaaagtaatgactgatgaaagtggaaaatccaaaggatttggattt gtaagctttgaaaggcatgaagatgcacagaaagctgtggatgagatgaacggaaaggag ctcaatggaaaacaaatttatgttggtcgagctcagaaaaaggtggaacggcagacggaa cttaagcgcaaatttgaacagatgaaacaagataggatcaccagataccagggtgttaat ctttatgtgaaaaatcttgatgatggtattgatgatgaacgtctccggaaagagttttct ccatttggtacaatcactagtgcaaaggttatgatggagggtggtcgcagcaaagggttt ggttttgtatgtttctcctccccagaagaagccactaaagcagttacagaaatgaacggt agaattgtggccacaaagccattgtatgtagctttagctcagcgcaaagaagagcgccag gctcacctcactaaccagtatatgcagagaatggcaagtgtacgagctgttcccaaccct gtaatcaacccctaccagccagcacctccttcaggttacttcatggcagctatcccacag actcagaaccgtgctgcatactatcctcctagccaaattgctcaactaagaccaagtcct cgctggactgctcagggtgccagacctcatcctaacacatcaacacagacaatgggtcca cgtcctgcagctgcagccgctgcagctactcctgctgtccgcaccgttccacagtataaa tatgctgcaggagttcgcaatcctcagcaacatcttaatgcacagccacaagttacaatg caacagcctgctgttcatgtacaaggtcaggaacctttgactgcttccatgttggcatct gcccctcctcaagagcaaaagcaaatgttgggtgaacggctgtttcctcttattcaagcc atgcaccctactcttgctggtaaaatcactggcatgttgttggagattgataattcagaa cttcttcatatgctcgagtctccagagtcactccgttctaaggttgatgaagctgtagct gtactacaagcccaccaagctaaagaggctgcccagaaagcagttaacagtgccaccggt gttccaactgtttaa >gi568815590r:100604301_100821583|GENSCAN_predicted_peptide_4|153_aa MGPKLYVVLSSPTNTSVAGKEGNKEASPSQLSQPKLGQGSEGGGDVRSAASPGRLLRFRG LPGPLAVVVLLPNVFPVSSVFRKDPAIKGLKDLYQARPLFTGGDAAWKPAAARKQLGKSK VLPWEKESRDKTGGNVKSQGTKSLPKLIFMEMG >gi568815590r:100604301_100821583|GENSCAN_predicted_CDS_4|462_bp atgggacccaagctctacgtggtgctgtccagtcccacaaacacatcagttgcggggaaa gaggggaacaaagaagccagcccctcccagttgtctcagcctaagctaggacagggctcc gaaggtgggggggatgtgcgtagcgctgcctctcctggccgcctcctgcgtttccgcggg cttcccgggcccctcgctgtcgtagtattactgccaaacgtctttcctgtgtcgtctgtg ttccgcaaagatcctgccatcaaaggtctcaaggatctctaccaagcacgcccccttttt actggaggggacgcggcctggaagccagcggctgcccgtaaacaattagggaagagcaaa gtgctcccctgggaaaaagaatccagggacaaaactggtggcaacgtgaaaagtcaaggg acaaaaagtttgccgaagctgatcttcatggaaatgggatga