GENSCAN 1.0 Date run: 2-Nov-116 Time: 23:41:01 Sequence gi568815590f:22304711_22519695 : 214985 bp : 43.39% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 74 158 85 2 1 87 101 40 0.943 5.02 1.02 Intr + 1217 1306 90 1 0 80 53 105 0.754 6.39 1.03 Intr + 3223 3363 141 0 0 67 92 42 0.655 3.05 1.04 Intr + 6438 6590 153 2 0 37 83 135 0.607 8.17 1.05 Intr + 9618 9719 102 2 0 69 109 102 0.913 10.77 1.06 Intr + 10319 10435 117 1 0 111 105 63 0.997 10.96 1.07 Intr + 11535 11623 89 2 2 61 84 110 0.991 6.67 1.08 Intr + 13460 13565 106 2 1 66 78 110 0.992 8.02 1.09 Intr + 48249 48502 254 2 2 128 103 215 0.998 23.23 1.10 Intr + 49561 49668 108 1 0 112 64 32 0.857 2.60 1.11 Term + 50639 50795 157 2 1 89 42 132 0.991 6.11 1.12 PlyA + 52837 52842 6 1.05 2.06 PlyA - 52891 52886 6 1.05 2.05 Term - 62190 62036 155 1 2 95 43 129 0.979 7.18 2.04 Intr - 64441 64381 61 1 1 73 95 41 0.617 1.61 2.03 Intr - 65058 65033 26 1 2 75 85 34 0.492 -0.46 2.02 Intr - 66248 66132 117 0 0 116 117 -3 0.683 5.84 2.01 Init - 74194 74176 19 1 1 93 99 -1 0.366 1.78 2.00 Prom - 84235 84196 40 -4.06 3.00 Prom + 95951 95990 40 -4.16 3.01 Init + 100001 100270 270 1 0 104 103 376 0.982 35.67 3.02 Intr + 103600 103786 187 0 1 100 105 222 0.999 24.36 3.03 Intr + 105236 105405 170 0 2 116 46 116 0.209 9.87 3.04 Intr + 106085 106229 145 1 1 55 3 34 0.095 -8.54 3.05 Intr + 107327 107496 170 0 2 60 64 250 0.956 19.47 3.06 Intr + 110070 110192 123 2 0 101 78 56 0.989 6.68 3.07 Intr + 111059 111247 189 1 0 106 82 298 0.994 30.78 3.08 Intr + 111363 111570 208 2 1 68 87 276 0.965 24.15 3.09 Intr + 112941 113125 185 1 2 93 115 211 0.999 23.81 3.10 Term + 114842 114988 147 1 0 74 49 167 0.731 9.30 3.11 PlyA + 117946 117951 6 1.05 4.03 PlyA - 120453 120448 6 1.05 4.02 Term - 136678 136511 168 1 0 73 48 181 0.159 10.48 4.01 Init - 158627 158589 39 2 0 48 80 105 0.741 5.88 4.00 Prom - 161614 161575 40 -2.56 5.00 Prom + 164250 164289 40 -3.36 5.01 Init + 164649 164658 10 2 1 77 81 5 0.355 -0.70 5.02 Intr + 170244 170441 198 1 0 85 75 138 0.983 11.52 5.03 Intr + 170790 170914 125 1 2 74 83 78 0.925 6.20 5.04 Intr + 187835 188078 244 1 1 29 32 121 0.011 -2.33 5.05 Term + 188167 188405 239 2 2 64 38 192 0.752 8.13 5.06 PlyA + 188642 188647 6 1.05 6.00 Prom + 192303 192342 40 -4.06 6.01 Init + 193358 193402 45 1 0 72 101 17 0.112 2.08 6.02 Intr + 197517 197603 87 2 0 56 84 36 0.062 0.17 6.03 Intr + 205941 206073 133 2 1 75 93 45 0.949 3.92 6.04 Intr + 206376 206521 146 1 2 84 106 56 0.977 7.00 6.05 Intr + 208583 208722 140 1 2 98 115 58 0.609 8.76 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 48115 48197 83 2 2 69 22 72 0.837 -1.94 S.002 Init - 177901 177777 125 1 2 71 91 107 0.843 8.94 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590f:22304711_22519695|GENSCAN_predicted_peptide_1|467_aa XKNYGITVKEEDQPLLIHRPSERQDNHGMLLKGEILLLPELSFMTGIPEKMKKDFRAMKD LAQQINLSPKQHHSALECLLQRIAKNEAATNELMRWGLRLQKDVHKRAMDQARELVNMLE KIAGPIGMRMSPPAWVELKDDRIETYVRTIQSTLGAEGKIQMVVCIIMGPRDDLYGAIKK LCCVQSPVPSQVVNVRTIGQPTRLRSVAQKILLQINCKLGGELWGVDIPLKQLMVIGMDV YHDPSRGMRSVVGFVASINLTLTKWYSRVVFQMPHQEIVDSLKLCLVGSLKKFYEVNHCL PEKIVVYRDGVSDGQLKTVANYEIPQLQKCFEAFENYQPKMVVFVVQKKISTNLYLAAPQ NFVTPTPGTVVDHTITSCEWVDFYLLAHHVRQGCGIPTHYVCVLNTANLSPDHMQRLTFK LCHMYWNWPGTIRVPAPCKYAHKLAFLSGHILHHEPAIQLCENLFFL >gi568815590f:22304711_22519695|GENSCAN_predicted_CDS_1|1404_bp nncaaaaattatgggatcacagttaaggaagaggaccagccattgctgattcacaggccc agtgagagacaggataatcatgggatgctgctaaaaggggaaatcctgctgctgcctgag ctttcttttatgaccggaatcccagagaagatgaagaaggacttcagagccatgaaggat ttggctcagcaaatcaatctgagccccaagcaacaccatagtgctttggaatgcttgctg caaagaattgcaaagaacgaggcagccaccaatgaactgatgcgttgggggctccgtctg caaaaggatgtacataagagagcaatggaccaggctcgagaactggtcaacatgttggag aagatagccggccccattggcatgcgtatgagcccaccggcctgggttgaactaaaggat gaccgaatagagacttatgtcagaaccattcaatccacgttaggagctgaggggaagata cagatggttgtttgcatcatcatgggcccacgtgatgatctctatggggccatcaagaag ctgtgctgtgtgcagtccccagtgccctcccaggttgtcaatgttcgaaccattggtcag cccaccaggcttcggagtgtggcccagaagattttacttcagattaactgtaaattgggt ggtgagctctggggagtggatattcctctgaaacagttaatggtgatcgggatggatgtt taccatgaccccagtagaggcatgcgctccgtggttggcttcgtggcaagcatcaatctc accctcacaaaatggtattcccgggtggtgttccagatgccgcatcaggagattgtggac agcctgaagctatgcctcgtgggctccttaaaaaagttttatgaggtgaaccactgtcta ccagagaagattgtggtgtaccgtgatggagtgtctgatggccaactgaagacagttgcc aactatgagattcctcaactacagaagtgttttgaagcttttgagaattatcagcccaag atggtggtgtttgtagttcagaagaaaatcagtactaatctatatctggctgctcctcag aactttgtaactcccactcctggaactgtggtagatcatacaataacaagctgtgagtgg gtggatttctatcttcttgcccatcatgtacggcagggctgtggcattcctacgcattat gtctgtgttctcaacaccgcaaacctgagccctgatcatatgcagaggctgactttcaaa ctgtgccacatgtactggaattggcctggcaccatcagagttccagctccttgcaagtat gcccacaagctagctttcctgtcaggacacatcttgcatcatgaaccagccatccagctg tgcgagaacctgttcttcctgtga >gi568815590f:22304711_22519695|GENSCAN_predicted_peptide_2|125_aa MGHRSTGGHEALAGDSRWGGLWTPYLLAFHQPTSLEVREAPLQTPGKITTSCKLSYLQTN PSKRTTSLSPGYHTVRKGTSEGDAPLDGVSSAYFHQPKEPKFPGRVPNKPYYIHFSDLMP HRFPQ >gi568815590f:22304711_22519695|GENSCAN_predicted_CDS_2|378_bp atgggacacagatccacaggtgggcatgaagccctggctggcgacagtcgatggggcgga ctctggactccttatctcctggcattccaccagcccaccagcctggaggtcagagaagcc ccactgcagacacccggcaaaatcacgacatcctgcaagctgtcttacctgcaaaccaac cccagcaagaggaccactagcctatctcctggctaccacacagttcggaaaggtacttca gaaggggacgcgcccttagatggtgtttcttctgcatatttccaccagccaaaagaacca aaatttcctggtcgagttccaaataaaccatattatattcacttttcagacttgatgccg caccgcttcccacaatga >gi568815590f:22304711_22519695|GENSCAN_predicted_peptide_3|597_aa MKLLLLHPAFQSCLLLTLLGLWRTTPEAHASSLGAPAISAASFLQDLIHRYGEGDSLTLQ QLKALLNHLDVGVGRGNVTQHVQGHRNLSTCFSSGDLFTAHNFSEQSRIGSSELQEFCPT ILQQLDSRACTSENQENEENEQTEEGRPSAVEVWGFGFLSVSLINLASLLGVLVLPCTEK AFFSRVLTYFIALSIGTLLSNALFQLIPERSYKNKAQVDSLPTFLAQAGMLLWRVRIRRR VVDPIRESWMLPFTKIPLWGYGLLCVTVISLCSLLGASVVPFMKKTFYKRLLLYFIALAI GTLYSNALFQLIPEAFGFNPLEDYYVSKSAVVFGGFYLFFFTEKILKILLKQKNEHHHGH SHYASESLPSKKDQEEGVMEKLQNGDLDHMIPQHCSSELDGKAPMVDEKVIVGSLSVQDL QASQSACYWLKGVRYSDIGTLAWMITLSDGLHNFIDGLAIGASFTVSVFQGISTSVAILC EEFPHELGDFVILLNAGMSIQQALFFNFLSACCCYLGLAFGILAGSHFSANWIFALAGGM FLYISLADMFPEMNEVCQEDERKGSILIPFIIQNLGLLTGFTIMVVLTMYSGQIQIG >gi568815590f:22304711_22519695|GENSCAN_predicted_CDS_3|1794_bp atgaagctgctgctgctgcacccggccttccagagctgcctcctgctgaccctgcttggc ttatggagaaccacccctgaggctcacgcttcatccctgggtgcaccagctatcagcgct gcctccttcctgcaggatctaatacatcggtatggcgagggtgacagcctcactctgcag cagctgaaggccctactcaaccacctggatgtgggagtgggccggggtaatgtcacccag cacgtgcaaggacacaggaacctctccacgtgctttagttctggagacctcttcactgcc cacaatttcagcgagcagtcgcggattgggagcagcgagctccaggagttctgccccacc atcctccagcagctggattcccgggcctgcacctcggagaaccaggaaaacgaggagaat gagcagacggaggaggggcggccaagcgctgttgaagtgtggggctttggttttctcagt gtctcactgattaacctggcctctctcctgggagtcctcgtcctgccctgcacagagaaa gcgtttttcagccgtgtgctcacttacttcatcgccctgtccattggaacgctgctgtct aacgcgctattccagctcatcccagagaggagctataaaaataaggcccaagttgacagt ctgcccacttttttagctcaagcagggatgctgctgtggagggtgaggatacgcagacgt gtggttgaccccattagggagtcatggatgctgcccttcacaaagatcccattgtgggga tacggtctcctctgtgtgaccgtcatctccctctgctccctcctgggggccagcgtggtg cccttcatgaagaagaccttttacaagaggctgctgctctacttcatagctctggcgatt ggaaccctctactccaacgccctcttccagctcatcccggaggcatttggtttcaaccct ctggaagattattatgtctccaagtctgcagtggtgtttgggggcttttatcttttcttt ttcacagagaagatcttgaagattcttcttaagcagaaaaatgagcatcatcatggacac agccattatgcctctgagtcgcttccctccaagaaggaccaggaggagggggtgatggag aagctgcagaacggggacctggaccacatgattcctcagcactgcagcagtgagctggac ggcaaggcgcccatggtggacgagaaggtcattgtgggctcgctctctgtgcaggacctg caggcttcccagagtgcttgctactggctgaaaggtgtccgctactctgatatcggcact ctggcctggatgatcactctgagcgacggcctccataatttcatcgatggcctggccatc ggtgcttccttcactgtgtcagttttccaaggcatcagcacctcggtggccatcctctgt gaggagttcccacatgagctaggagactttgtcatcctgctcaacgctgggatgagcatc caacaagctctcttcttcaacttcctttctgcctgctgctgctacctgggtctggccttt ggcatcctggccggcagccacttctctgccaactggatttttgcgctagctggaggaatg ttcttgtatatttctctggctgatatgttccctgagatgaatgaggtctgtcaagaggat gaaaggaagggcagcatcttgattccatttatcatccagaacctgggcctcctgactgga ttcaccatcatggtggtcctcaccatgtattcaggacagatccagattgggtag >gi568815590f:22304711_22519695|GENSCAN_predicted_peptide_4|68_aa MHTADIIAMDKQWAPGARRTCAYAAAAAFSSGSLSCPEPSASRARPRLLSGVRAPEGRGH RSAGLRRH >gi568815590f:22304711_22519695|GENSCAN_predicted_CDS_4|207_bp atgcatactgccgacatcattgccatggacaaacagtgggctccaggagcccgccggacg tgcgcctacgccgcggccgccgccttctcctcgggcagccttagctgcccggagccgtca gccagccgtgcccggccccggctcctcagcggtgtccgagcaccagaagggcgcggccac cgcagcgccggcctccgacgccactga >gi568815590f:22304711_22519695|GENSCAN_predicted_peptide_5|271_aa MKRTVPFPPTQRLTFKEVFENGKPKVDVLKNHLVKEGRLEEEVALKIINDGAAILRQEKT MIEVDAPITVCGDIHGQFFDLMKLFEVGGSPSNTRYLFLGDYVDRGYFSIELVVHPRPPS TNPSPPRWPLICSEKMKQTVMSQERLAKLQAQVRIGGKEMVHRKKKAVHRTATADDKKLQ FSLKKLEVNNVSGHAETKQLMEMLPSILNQLGAHCLTSLRRLAEALPKQSVNGKAPLATG EDDDEVPALVENFDEASKMENFDEASKNEAN >gi568815590f:22304711_22519695|GENSCAN_predicted_CDS_5|816_bp atgaagagaactgtcccctttcctccaacccaacggcttactttcaaggaagtatttgag aatgggaaacctaaagttgatgttttaaaaaaccatttggtaaaggaaggacgactggaa gaggaagtagccttaaagataatcaatgatggggctgccatcctgaggcaagagaagact atgatagaagtagatgctccaatcacagtatgtggtgatattcatggacaattctttgac ctaatgaagttatttgaagttggaggatcacctagtaacacacgctacctctttctgggt gactatgtggacagaggctatttcagtatagagctggtggtccacccgagacccccaagc accaaccctagccccccacgttggccccttatctgctctgagaagatgaaacaaacagtt atgagccaggaaagacttgccaaactgcaggcacaagtgcgcattggtgggaaagaaatg gttcacagaaagaagaaggcggttcatagaacagccacagcagatgataaaaaacttcag ttctccttaaagaagttagaggtaaacaatgtctctggccatgctgagacaaagcagctg atggaaatgctacccagcatcttaaaccagcttggtgcacactgtctgactagtttaagg agactggctgaagctctgcccaaacagtctgtgaatggaaaagcaccacttgctactgga gaggatgacgatgaagttccagctcttgtggagaattttgatgaggcttccaagatggag aattttgatgaggcttccaagaatgaggcaaactga >gi568815590f:22304711_22519695|GENSCAN_predicted_peptide_6|184_aa MNAGILQTISPSNRNIYSQKDSKIWMLKDVHCSVNCNNNNNNKKLRYKEISTVGRLGSHP GIEASCIKNVNVCCKTTFRGAQSYSFPPGRIKYSEQVYDACMETFDCLPLAALLNQQFLC VHGGMSPEITSLDDIRKLDRFTEPPAFGPVCDLLWSDPSEDYGNEKTLEHYTHNTVRGCS YFYS >gi568815590f:22304711_22519695|GENSCAN_predicted_CDS_6|552_bp atgaatgcaggcatcttacagactatttcaccttcaaacaggaatatatactcacaaaaa gatagtaaaatatggatgcttaaggatgttcactgcagtgttaattgtaacaacaacaac aacaacaaaaagctcaggtacaaggaaatatccacagtgggaagattagggagccatcct ggtatagaagcctcgtgcatcaaaaatgtcaatgtgtgttgtaaaactaccttcagagga gcccaaagctatagcttcccaccaggtcgaatcaaatattcggaacaggtgtatgatgcc tgtatggagacatttgactgtcttcctcttgctgccctcttaaaccagcagtttctctgt gtacatggaggaatgtcacctgaaattacttctttagatgacattaggaaattagacagg tttacggaacctcccgcctttggacctgtgtgtgacctgctttggtctgatccctcagag gattatggcaatgagaagaccttggagcactatacccacaacactgtccgagggtgctct tatttctacagn