GENSCAN 1.0 Date run: 6-Nov-116 Time: 15:36:17 Sequence gi568815588r:5399001_5599438 : 200438 bp : 46.85% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 Intr - 2087 1844 244 2 1 35 55 211 0.192 9.57 1.01 Init - 5792 5790 3 2 0 67 115 0 0.231 0.60 1.00 Prom - 7709 7670 40 -4.76 2.00 Prom + 10341 10380 40 -5.86 2.01 Init + 11616 11732 117 2 0 81 101 26 0.370 3.40 2.02 Intr + 13618 13820 203 0 2 69 105 125 0.433 10.38 2.03 Intr + 27655 27721 67 1 1 64 97 14 0.209 -1.19 2.04 Intr + 30170 30229 60 1 0 81 121 66 0.439 8.13 2.05 Intr + 47703 47858 156 2 0 -4 94 92 0.006 0.81 2.06 Intr + 53358 53525 168 2 0 61 69 80 0.823 3.54 2.07 Intr + 53858 53920 63 1 0 79 86 38 0.753 1.61 2.08 Intr + 54250 54346 97 0 1 115 72 14 0.995 2.08 2.09 Intr + 54484 54560 77 2 2 92 71 87 0.879 6.63 2.10 Intr + 55265 55522 258 1 0 67 87 118 0.949 7.26 2.11 Intr + 55948 56118 171 0 0 84 53 224 0.939 18.64 2.12 Intr + 57087 57273 187 2 1 59 93 129 0.898 9.76 2.13 Term + 57588 57994 407 1 2 114 44 416 0.998 35.15 2.14 PlyA + 61379 61384 6 1.05 3.00 Prom + 61727 61766 40 -2.56 3.01 Init + 66676 66745 70 0 1 84 97 89 0.953 10.61 3.02 Intr + 69390 69592 203 1 2 19 4 163 0.219 0.00 3.03 Intr + 70380 70396 17 2 2 131 103 10 0.231 1.14 3.04 Term + 81397 81550 154 1 1 131 48 91 0.189 6.79 3.05 PlyA + 84045 84050 6 -0.45 4.00 Prom + 85257 85296 40 -0.96 4.01 Init + 86218 86359 142 0 1 81 53 118 0.395 6.31 4.02 Term + 92465 92649 185 0 2 98 44 82 0.315 2.51 4.03 PlyA + 95285 95290 6 1.05 5.02 PlyA - 96543 96538 6 1.05 5.01 Sngl - 100438 99998 441 1 0 100 54 970 0.996 91.05 5.00 Prom - 102322 102283 40 -8.66 6.10 PlyA - 102539 102534 6 1.05 6.09 Term - 103160 103104 57 2 0 65 54 86 0.351 0.59 6.08 Intr - 104778 104720 59 1 2 103 77 -5 0.162 -1.50 6.07 Intr - 109639 109525 115 1 1 -11 94 131 0.534 3.82 6.06 Intr - 110493 110441 53 1 2 73 48 72 0.695 0.33 6.05 Intr - 112834 112621 214 1 1 40 81 91 0.415 1.89 6.04 Intr - 113234 112950 285 2 0 66 40 166 0.371 7.14 6.03 Intr - 114026 113975 52 1 1 41 71 47 0.182 -2.79 6.02 Intr - 114147 114114 34 1 1 75 100 45 0.162 1.68 6.01 Init - 122985 122916 70 0 1 68 93 99 0.486 9.61 6.00 Prom - 123247 123208 40 -10.55 7.00 Prom + 123282 123321 40 -7.86 7.01 Init + 126086 126531 446 1 2 77 61 1178 0.295 109.49 7.02 Intr + 129984 130267 284 0 2 76 73 55 0.230 -0.14 7.03 Intr + 130850 131032 183 0 0 61 64 106 0.308 5.26 7.04 Intr + 133801 133831 31 2 1 122 82 -3 0.150 -0.31 7.05 Intr + 139791 139817 27 0 0 136 88 1 0.055 2.13 7.06 Intr + 151267 151393 127 1 1 103 59 51 0.016 4.38 7.07 Intr + 152914 153044 131 0 2 89 82 1 0.017 -0.91 7.08 Intr + 161547 161692 146 0 2 37 89 71 0.033 2.03 7.09 Intr + 163533 163707 175 1 1 104 64 60 0.260 4.20 7.10 Intr + 164342 164485 144 2 0 66 48 108 0.214 4.00 7.11 Intr + 175132 175210 79 1 1 40 -3 94 0.051 -4.95 7.12 Intr + 175736 175954 219 1 0 100 37 120 0.457 6.60 7.13 Term + 177869 178018 150 1 0 68 47 85 0.632 0.31 7.14 PlyA + 180692 180697 6 1.05 8.04 PlyA - 181022 181017 6 1.05 8.03 Term - 182776 182756 21 1 0 130 49 -3 0.355 -1.69 8.02 Intr - 184905 184711 195 0 0 99 87 45 0.065 5.11 8.01 Init - 192042 191881 162 0 0 37 84 124 0.194 4.66 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 47766 47858 93 2 0 90 94 34 0.900 4.58 S.002 Intr - 184862 184711 152 0 2 46 87 73 0.820 2.88 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:5399001_5599438|GENSCAN_predicted_peptide_1|83_aa MRECLSIHIGQAGIQIGDACWELYCLEHGIQPNGVVLDTQQDQLENAKMEHTNASFDTFF CETRAGKHVPRALFVDLEPTVIX >gi568815588r:5399001_5599438|GENSCAN_predicted_CDS_1|249_bp atgagggagtgcctttccatccacatcggtcaagctggcatccagattggggacgcctgc tgggaactctattgcctggaacatggaatccagccaaatggcgttgttcttgacactcaa caggatcagctggaaaatgcaaaaatggagcacacaaatgcatctttcgataccttcttc tgtgagacaagagctgggaagcatgtgcctagagcactcttcgtggacttggagccaact gttatagnn >gi568815588r:5399001_5599438|GENSCAN_predicted_peptide_2|676_aa MGKVEQKWFLTKVNLCSELVPSGGFMVSLTSRMKPWTFMRLAGIVPVPAGLPGTRPPPHP LLRAMEPELAAQKQPRPRRRSRRASGLSTEGATGPSADTSGSELDGRCSLRRGSSFTFLT PGPNWDFTLKRKRREKDDDVVSLSSLDLKRRYSADSVWIDWKGLREYLGSMVAHDETGGL LPIKRTIRVLDVNNQSFREQESFTLRGDHRSPASAQKFSSRSTVPTPAKRRSSALWSEML DITMKESLTTREIRRQEAIYEMSRGEQDLIEDLKLARKAYHDPMLKLSIMSEEELTHIFG DLDSYIPLHEDLLTRIGEATKPDGTVEQIGHILVSWLPRLNAYRGYCSNQLAAKALLDQK KQDPRVQDFLQRCLESPFSRKLDLWSFLDIPRSRLVKYPLLLKEILKHTPKEHPDVQLLE DAILIIQGVLSDINLKKGESECQYYIDKLEYLDEKQRDPRIEASKVLLCHGELRSKSGHK LYIFLFQDILVLTRPVTRNERHSYQVYRQPIPVQELVLEDLQDGDVRMGGSFRGAFSNSE KAKNIFRIRFHDPSPAQSHTLQANDVFHKQQWFNCIRAAIAPFQSAGSPPELQGLPELHE ECEGNHPSARKLTAQRRASTVSSVTQVEVDENAYRCGSGMQMAEDSKSLKTHQTQPGIRR ARDKALSGGKRKETLV >gi568815588r:5399001_5599438|GENSCAN_predicted_CDS_2|2031_bp atggggaaggtagaacagaagtggttcttgaccaaggtcaacttatgttcagagctggtt ccttccggtgggttcatggtctcgctgacttcaagaatgaagccatggaccttcatgagg ctggcgggcatcgtgcccgtccctgccggtctcccgggcacccggccaccgccccacccc ctcctccgtgccatggagcccgagctggcggctcagaagcagcctcgaccgcggaggcga agccgccgggcctctgggctcagcacggagggagcgacggggccttcggccgacacctcc gggtcggagctggacgggagatgttcccttcggagaggcagctccttcacattcttaaca cctggccccaactgggacttcactttgaaaagaaaacgcagagagaaagatgatgatgtt gtaagccttagcagccttgatctgaagcgtcgctacagcgctgactcggtgtggattgat tggaaaggtttgagggagtacttgggaagcatggtggcacatgatgagactggaggtctc ctacctattaaaaggaccatacgagtcctagatgtcaataaccagtccttcagagaacaa gagtcatttacccttcgtggtgaccacagatccccagcctctgcccagaagttttctagc aggtcaacagtcccaacacccgccaagagaaggagcagtgcactgtggtcagagatgctg gacatcaccatgaaggagtctctcaccaccagggagatcagacggcaggaggcaatatat gaaatgtcccgaggtgaacaggatttaattgaggatctcaaacttgcaagaaaggcctat catgaccccatgttaaagttgtccatcatgtcagaagaggaactcacacatatatttggt gatctggactcttacatacctctgcatgaagatttgttgacaagaataggagaagcaacc aagcctgatggaacagtggagcagattggtcacattctcgtgagctggttaccgcgcttg aatgcctacagaggttactgtagtaaccagctggcagccaaagctcttcttgatcaaaag aaacaggatccaagagtccaagacttcctccagcgatgtctcgagtctcccttcagtcga aaactagatctttggagtttcctagatatccctcgaagtcgcctagtcaaatacccttta ctgttaaaagaaattcttaaacacactccaaaagagcaccctgatgttcagcttctggag gatgctatattgataatacagggagtcctctctgatatcaacttgaagaaaggtgaatcc gagtgccagtattacatcgacaagctggagtacctggatgaaaagcagagggaccccaga atcgaagcgagcaaagtgctgctgtgccatggggagctgcggagcaagagtggacataaa ctttacattttcctgtttcaagacatcttggttctgactcggcccgtcacacggaacgaa cggcactcttaccaggtttaccggcagccaatcccagtccaagagctagtcctagaagac ctgcaggatggagatgtgagaatgggaggctcctttcgaggagctttcagtaactcagag aaagctaaaaatatctttagaattcgcttccatgacccctctccagcccagtctcacact ctgcaagccaatgacgtgttccacaagcagcagtggttcaactgtattcgagcggccatt gcccccttccagtcggcaggcagtccacctgagctgcagggcctgccggagctgcacgaa gagtgtgaggggaaccacccctctgcgaggaaactcacagcccagaggagggcatccaca gtttccagtgttactcaggtagaagttgatgaaaacgcttacagatgtggctctggcatg cagatggcagaggacagcaagagcttaaagacacaccagacacagcccggcatccgaaga gcgagggacaaagccctttctggtggcaaacggaaagagactttggtgtag >gi568815588r:5399001_5599438|GENSCAN_predicted_peptide_3|147_aa MVELCRYTFVQTHTMYNTKNEPSERSKTGKTEASRRQPAPTACCRREPQSSLQMMEPQSS LQMMQPHLLLSGCSLMGDVETAIQLSSPQRNGAGKEPFFKDDAKFKVTTTPRDGASASFP TWVHVRLCPTNARCLLELSSSAVHCHS >gi568815588r:5399001_5599438|GENSCAN_predicted_CDS_3|444_bp atggtggagctatgtcgttatacatttgtccaaacccacacgatgtacaacaccaagaac gagccttcagagaggtccaagacaggaaaaactgaggcctcccgcagacagccagcaccc actgcctgctgcaggagggagccccagtcgagccttcagatgatggagccccagtcgagc cttcagatgatgcagccccacctgctgctctctggctgcagcctcatgggagatgttgag acagccatccagctgagcagcccccagagaaatggtgcaggaaaagaacccttcttcaaa gatgatgccaagttcaaggtcacgacaactccccgtgacggagcctctgcttctttcccc acgtgggtccacgtgaggctctgccccaccaatgcccggtgccttctagaactcagtagc tcagctgtccactgccactcatga >gi568815588r:5399001_5599438|GENSCAN_predicted_peptide_4|108_aa MVPSRSPHLFSTQRVVFSAAAEHTPSVHHSRHMGLPGPSSRNRFQVRNLQISTELPATAL FVGGSPLPPSLGPTAKATALAWAPRLPDAKSATSLEQALALGFQTVFP >gi568815588r:5399001_5599438|GENSCAN_predicted_CDS_4|327_bp atggttcctagcaggtccccccatcttttcagcacccagcgtgtggtcttctcggcggca gctgagcacactccttctgtccaccattcccgacacatgggtcttcctggtccttcctct cggaaccggttccaggttagaaacctccaaatcagcacagagcttcctgccacagccctc tttgtagggggctcacccctgcccccctccctgggtccaactgccaaggccacagccctg gcatgggcacccagactccctgatgccaagtcggccacaagcctggagcaggccttggct ctgggattccagactgtattcccatga >gi568815588r:5399001_5599438|GENSCAN_predicted_peptide_5|146_aa MAGELTPEEEAQYKKAFSAVDTDGNGTINAQELGAALKATGKNLSEAQLRKLISEVDSDG DGEISFQEFLTAAKKARAGLEDLQVAFRAFDQDGDGHITVDELRRAMAGLGQPLPQEELD AMIREADVDQDGRVNYEEFARMLAQE >gi568815588r:5399001_5599438|GENSCAN_predicted_CDS_5|441_bp atggccggtgagctgactcctgaggaggaggcccagtacaaaaaggctttctccgcggtt gacacggatggaaacggcaccatcaatgcccaggagctgggcgcggcgctgaaggccacg ggcaagaacctctcggaggcccagctaaggaaactcatctccgaggttgacagcgacggc gacggcgaaatcagcttccaggagttcctgacggcggcgaagaaggccagggccggcctg gaggacctgcaggtcgccttccgcgccttcgaccaggatggcgacggccacatcaccgtg gacgagctcaggcgggccatggcggggctggggcagccgctgccgcaggaggagctggac gccatgatccgcgaggccgacgtggaccaggacgggcgggtgaactacgaggagttcgcg aggatgctcgcccaggagtga >gi568815588r:5399001_5599438|GENSCAN_predicted_peptide_6|312_aa MDKDRNTVIPPAHLTNDQDEKPTEVLREVRGGKGSCHGSGPHVYNGRHGSVQGVLKERWL GRRILQRKGEKGLEVTHGFLEGSGEQGLRGCGSQAPGQEALHGNRSMNSTEVRPGTDPKA ATFVEIRFDVHFGSAEKVDSVLQNGPARALRPLCVSAAPGLRTSGWCRQSSAAKEFTGRV SCQAADTLCVEPEEVLCEISPCGKKEAEARGWSNHTPDGNRGSGGFDVVLPSLGTEARAE KEGPKELSTPGCDQELNVPEEGPGPRAAGEGRTPHHTLHTGPSHFPGAALSSEKKTHIVR IVTIILSRDETA >gi568815588r:5399001_5599438|GENSCAN_predicted_CDS_6|939_bp atggacaaggacaggaatacggtcatacctcctgcccacttgaccaacgaccaggatgag aagccaacagaagtccttcgagaggttcgtgggggcaaaggaagctgtcatggctcaggt cctcatgtgtacaatgggcgccatggcagcgtccagggagtcctgaaggaacgatggtta gggcggagaatcctgcagaggaaaggagagaaaggactagaggtcacacacggcttcttg gaggggagcggggagcaggggctccgaggctgcgggtcgcaggctccagggcaggaagct ctccatgggaaccgtagtatgaattccaccgaggtgcgtcccggcacagacccgaaggca gccacatttgtggaaataagatttgacgtgcatttcggatccgcagagaaggtggactct gttttgcagaatggaccagcgcgcgcactgaggccactttgtgtcagcgcagcgcctggc ctgaggacctcagggtggtgtcggcaaagttccgctgccaaggagttcacgggccgtgtt tcctgccaggccgcagacactctctgcgtggagcccgaggaggttctgtgtgagatctca ccctgcgggaagaaagaggcggaggctcggggttggtccaatcacacccctgatggtaac cggggcagcggtggcttcgatgtggttctgcccagtttgggcaccgaggcgcgggcagag aaagaggggcccaaggagctgtcgacccctggttgcgatcaggagctgaacgtgcccgag gaagggcctggacccagggcagccggtgagggacgaacaccacatcacaccctccacaca ggcccttctcatttccctggggctgccctgtcctcagagaagaaaacgcacatcgtgaga atagtgaccatcatcctgtctcgtgatgagacagcctga >gi568815588r:5399001_5599438|GENSCAN_predicted_peptide_7|713_aa MADQLTEEQVTEFKEAFSLFDKDGDGCITTRELGTVMRSLGQNPTEAELRDMMSEIDRDG NGTVDFPEFLGMMARKMKDTDNEEEIREAFRVFDKDGNGFVSAAELRHVMTRLGEKLSDE EVDEMIRAADTDGDGQVNYEEFVRVLVSNWVTALTTVSTVCCHMALPPPNLLCWGKEDQA RGGRSSDSALWHGSFCLTGRDAQGTPCPHQQTPNPQYECISHRASIADLHGMASATLTQF THGNYIKLASDSRVSNFKSNAAQPCRLLFILKDSKGLFKGSQNQPAKNDPRLPLEKCGIK TRVWGSVQRFLPDSGSCPPGECSSFLSAPHCGVQQGDSGKGQEMQLPPGSWKLLTSQVAS EDLQNREGRNLHRQGPGFRQKTTPTPATWIRKIPVHTCMGVKLLTWWLKSLAINQSIHSC LLRTAGKRKHDIGAQRAWLELQRPGKIFSGQKHNTTPYESHKSSTKQLLLPHRTIYGASY WTATYDLPSPPQNRSTKLPPHCSLTQTKIPPLSAREEPTSAAPRQVPLTDPLLQPPVQVA APGLDTGWRVIVVVASESAKRCCHLAWACYEVILQEVQDNRQLLHQSEWYRDFTLPCTWF MIQHILQDPKATGQGPSCTHRHLLLSGSHRKRAVTLNNTGYGTLKGTSTDAQPARADART LPAQGLVPELRTLYSGKQKAALGPRAPLRVCTCSDADTRACTRLHMYTLTGTC >gi568815588r:5399001_5599438|GENSCAN_predicted_CDS_7|2142_bp atggccgaccagctgactgaggagcaggtcacagaattcaaggaggccttctccctgttt gacaaggatggggacggctgcatcaccacccgcgagctgggcacggtcatgcggtccctg ggccagaaccccacggaggccgagctgcgggacatgatgagtgagatcgaccgggacggc aacggcaccgtggacttccccgagttcctgggcatgatggccaggaagatgaaggacacg gacaacgaggaggagatccgcgaggccttccgcgtgttcgacaaggacggcaacggcttc gtcagcgccgccgagctgcgacacgtcatgacccggctgggggagaagctgagtgacgag gaggtggacgagatgatccgggccgcggacacggacggagacggacaggtgaactacgag gagtttgtccgtgtgctggtgtccaactgggtcactgccctcaccacagtgagcaccgtc tgctgccacatggctctgccccctcccaaccttctgtgttggggtaaggaagaccaggcc agagggggacgcagctctgacagtgctctctggcacggctccttctgcctcacggggcgt gatgcccagggaaccccctgccctcaccagcagaccccaaacccacagtatgaatgcatc tctcaccgggctagcattgctgacctccacggcatggcatcagctactctgactcagttc acccacggcaactatataaagctcgcctctgactcccgtgtttccaattttaaatcgaat gcagcccagccctgccggctcctgtttatcctcaaagattccaaaggcctgttcaaaggc tcccagaaccagccagctaaaaatgacccacggcttccgctagagaaatgtggtatcaaa actcgtgtatggggctctgtccagaggttcttacctgacagtgggtcctgccctccaggg gaatgttccagcttcttgtctgcaccacactgtggagtccagcaaggtgactcagggaag ggacaggagatgcagctacctcctggttcctggaagcttctgacatcccaggtggcctct gaagatctgcagaatagggaggggaggaatctgcacagacaagggccagggttcagacag aagacaacgcccacaccagccacttggatccggaagattcctgtacacacttgcatgggt gtaaaattactaacctggtggctaaagagccttgcgatcaaccagtcaatccatagctgc ttgttgagaacagcgggaaaacggaaacatgatattggcgctcagagagcttggctggag ctccagagaccagggaaaatcttcagtggccaaaagcacaacacgacaccttacgagtct cacaagtcctccacaaagcagctgcttctgcctcacagaaccatttatggggccagttac tggactgctacctatgaccttccatctccaccccagaaccgaagcacaaaactcccacct cattgctccctgacccagacgaaaattcctcccctctcagccagagaggaacccacttct gcagccccccgtcaggtccccctcacagacccacttctgcagccccctgtccaagtcgct gccccaggtctggacacgggctggagggtcatcgtggtagttgcctctgagtcagcaaaa cgctgctgtcaccttgcctgggcctgctacgaggtcatcttacaggaggttcaggacaat aggcagctgctccaccagagtgagtggtacagggacttcaccctcccctgcacctggttc atgatccagcacattcttcaggatccaaaggccacagggcagggtcccagctgcactcac aggcacctgctgctttctggcagtcatcgtaaacgcgcagtgaccttaaataatacaggc tacgggacactcaaaggcacgtccacagatgcacagccagcccgtgcagacgctcggacg ctgcctgcccagggcctagtccctgagctcaggaccttgtactcagggaaacagaaggca gccctgggccccagagctcccctgcgagtgtgcacatgctcagatgctgacacccgagcc tgcacacgcctgcacatgtatacgctcacaggcacatgctga >gi568815588r:5399001_5599438|GENSCAN_predicted_peptide_8|125_aa MLGLHSPRQSEGFLGFLTLLTPPSPWPLEREGFWLCFLLECPPLHLFVIALLFGTLFPPL TCGASSGTGSETTAVGQRSTYCLRASRAEQTCLRFSLQLVIWETGQLRLLPGQPVVRKKI PLKHF >gi568815588r:5399001_5599438|GENSCAN_predicted_CDS_8|378_bp atgctggggctgcactcaccccgccagtctgaagggttcctgggctttctcaccctcctg accccgccctccccctggcccctggagcgggagggcttctggctctgcttcctgctcgag tgccctccactccatctctttgtcattgccttactttttgggaccctgttccctcccctt acttgcggggcatcctcggggacaggttctgaaaccacagctgtgggccagcgttccacc tattgtctgcgagccagcagagcagaacaaacctgcctgagattctcattacagcttgtg atttgggagacgggccagctgcggctgttgccagggcaaccagtggtgcggaaaaagatc ccattaaaacacttctga