GENSCAN 1.0 Date run: 3-Nov-116 Time: 17:54:41 Sequence gi568815587f:69541314_69751279 : 209966 bp : 50.28% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 PlyA - 150 145 6 -0.45 1.05 Term - 1261 1187 75 1 0 80 48 64 0.481 -0.56 1.04 Intr - 2968 2894 75 1 0 103 102 45 0.618 7.11 1.03 Intr - 3166 3063 104 2 2 92 55 77 0.314 4.69 1.02 Intr - 4632 4487 146 2 2 67 74 23 0.219 -1.27 1.01 Init - 5672 5425 248 2 2 59 89 80 0.223 2.27 1.00 Prom - 7435 7396 40 -9.26 2.03 PlyA - 7486 7481 6 1.05 2.02 Term - 11691 11122 570 0 0 44 43 298 0.971 15.34 2.01 Init - 12684 12643 42 0 0 68 94 25 0.843 1.62 2.00 Prom - 19888 19849 40 -3.66 3.00 Prom + 23810 23849 40 -4.66 3.01 Init + 31880 31953 74 1 2 90 71 62 0.565 5.24 3.02 Intr + 33462 33627 166 0 1 91 80 1 0.351 -0.44 3.03 Intr + 39571 39708 138 0 0 80 80 31 0.596 2.16 3.04 Term + 40337 40405 69 1 0 102 37 83 0.963 2.54 3.05 PlyA + 40958 40963 6 1.05 4.00 Prom + 42146 42185 40 0.54 4.01 Init + 42761 42768 8 1 2 93 91 0 0.850 1.30 4.02 Intr + 44902 45047 146 1 2 23 101 88 0.805 3.53 4.03 Intr + 52479 52642 164 1 2 98 23 87 0.197 2.89 4.04 Intr + 55034 55184 151 1 1 83 72 49 0.811 2.54 4.05 Intr + 60279 60406 128 1 2 40 30 126 0.507 2.40 4.06 Intr + 64565 64709 145 1 1 69 72 51 0.529 1.46 4.07 Intr + 70510 70639 130 2 1 80 37 80 0.447 1.85 4.08 Intr + 72208 72413 206 1 2 51 26 147 0.091 3.64 4.09 Intr + 87355 87507 153 2 0 26 -21 174 0.082 0.34 4.10 Term + 90478 90740 263 2 2 76 37 177 0.130 7.09 4.11 PlyA + 90903 90908 6 1.05 5.00 Prom + 91165 91204 40 -5.66 5.01 Init + 96619 96930 312 0 0 94 85 219 0.678 17.53 5.02 Intr + 99806 100198 393 1 0 7 64 447 0.411 29.05 5.03 Intr + 101718 102251 534 2 0 77 32 726 0.338 59.02 5.04 Intr + 102519 102678 160 2 1 82 91 273 0.995 26.56 5.05 Intr + 106681 106829 149 2 2 121 131 184 0.996 25.95 5.06 Term + 109805 109969 165 1 0 132 55 370 0.999 36.02 5.07 PlyA + 113141 113146 6 1.05 6.07 PlyA - 117681 117676 6 1.05 6.06 Term - 125096 125013 84 2 0 100 49 77 0.801 2.65 6.05 Intr - 126699 126582 118 2 1 79 96 111 0.906 11.47 6.04 Intr - 130506 130436 71 0 2 82 103 -10 0.868 -2.12 6.03 Intr - 132008 131903 106 1 1 117 107 87 0.983 13.72 6.02 Intr - 134014 133877 138 0 0 28 90 155 0.335 9.28 6.01 Init - 134107 134037 71 1 2 72 31 125 0.743 3.72 6.00 Prom - 135610 135571 40 -5.76 7.06 PlyA - 136524 136519 6 1.05 7.05 Term - 145973 145890 84 2 0 107 45 75 0.435 2.75 7.04 Intr - 156256 156050 207 1 0 29 56 118 0.187 1.97 7.03 Intr - 158263 157976 288 1 0 84 50 427 0.692 35.94 7.02 Intr - 162051 161948 104 1 2 84 119 250 0.999 27.59 7.01 Init - 162563 162332 232 2 1 81 79 436 0.996 38.42 7.00 Prom - 162742 162703 40 -2.26 8.08 PlyA - 163692 163687 6 -0.45 8.07 Term - 165187 165060 128 2 2 113 53 68 0.116 4.24 8.06 Intr - 180240 180033 208 0 1 69 55 87 0.013 2.15 8.05 Intr - 186365 186232 134 0 2 92 67 23 0.009 0.96 8.04 Intr - 196155 196013 143 2 2 44 91 111 0.010 7.00 8.03 Intr - 203644 203437 208 2 1 17 -13 192 0.081 0.14 8.02 Intr - 205886 205778 109 2 1 33 25 98 0.227 -2.14 8.01 Intr - 207141 207058 84 0 0 73 71 93 0.531 6.12 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 90432 90740 309 2 0 25 37 225 0.815 7.00 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587f:69541314_69751279|GENSCAN_predicted_peptide_1|215_aa MLPGAPEPCQMRYETESTFGEDGDNNEKVVTTHDSAIVFLHSTSPSSMVPRCHDSACKTW CSEKHPGVQERLASHRYTFKKRRCGSQKSKIKVSARLVPPGGSLRGYGRLWAVFSVHPPN LLLCLHWPSSLGLGKSGTLLIQAHGTDCFLYPEQSSPGESHYLPAVSCAPPGVLLPADYG FQIHLPYRKCKMLAENFLEATVIFRSALGAFMELS >gi568815587f:69541314_69751279|GENSCAN_predicted_CDS_1|648_bp atgctcccgggtgcgcctgagccttgccagatgcgctatgaaactgagtcaacttttgga gaggatggtgacaataatgagaaagtagtcaccacacacgactctgccatcgtcttcttg cactccacctctccctcctcgatggtacccagatgccatgactcagcgtgcaagacctgg tgttctgagaagcacccgggagtgcaggagaggttagcatctcacagatacacttttaaa aaacgtcgttgtggaagccagaagtccaaaatcaaggtgtcggccaggttggttcctcct ggaggctctctgaggggctacgggaggctgtgggcagtcttcagtgttcatccccccaat ctcctcctctgtcttcactggccttcttccctcggccttgggaaatcgggcacacttctc attcaagcccatggcactgactgctttctgtacccggagcagtcctcccctggggagtcc cactacctccctgcagttagctgtgctccaccgggcgttctgttacctgcggactatggc tttcaaatccatctcccttaccggaaatgcaagatgctggcagagaatttcctggaagcc actgtgatatttcggagtgccctcggggccttcatggaactctcatga >gi568815587f:69541314_69751279|GENSCAN_predicted_peptide_2|203_aa MTLTFLLPDYGEAQEANPRPETPDTPNPRPETPDTPNPRPEIPDAPNPRSETPDTPNPRP EIPDAPNPRPETPDTPNPRSEIPDTPNPRPEIPNAPNPRPEIPDAPNPRPETPDTPNPRP ETPDTPNPRPEIPDAPNPRPEIPDAPNPRPEIPDAPNPRPEIPDALNPRPETHDAPNPRP DTPHTPNPRPETPDAAVLTPGSK >gi568815587f:69541314_69751279|GENSCAN_predicted_CDS_2|612_bp atgaccctcactttcctgcttcccgactatggagaggcacaggaagcaaatcctaggcct gaaacacccgataccccaaatcctaggcctgaaacacccgataccccaaatcctaggcct gaaatccctgatgccccaaatcctaggtctgaaacccctgataccccgaatcctaggcct gaaatccctgatgccccaaatcctaggcctgaaacacccgataccccaaatcctaggtct gaaatccccgataccccaaatcctaggcctgaaatccccaatgccccaaatcctaggcct gaaatccctgatgccccaaatcctaggcctgaaacacccgataccccaaatcctaggcct gaaacccccgataccccaaatcctaggcctgaaatccccgatgccccaaatcctaggcct gaaatccccgatgccccaaatcctaggcctgaaatccccgatgccccaaatcctaggcct gaaatccccgatgccctaaatcctaggcctgaaacccacgatgccccaaatcctaggcct gacacaccccataccccaaatcctaggcctgaaacccctgatgccgcagtcctaactccg ggaagcaaataa >gi568815587f:69541314_69751279|GENSCAN_predicted_peptide_3|148_aa MPKVPAHAVLKSSPVCKHRSEGNRRCPRLPATLPSPGLSLRLPAPTIPSPAFAPVCTYLV PQANSLMSHHQDHRGPGKSESHKISGYWRSYETISFIEGPPKECCSFRHSLCSVHAIQPN LPGVEKALNGLDGAHSHCVTVNLLYTIH >gi568815587f:69541314_69751279|GENSCAN_predicted_CDS_3|447_bp atgcccaaggtgcctgctcatgctgtactgaaatccagtcctgtgtgcaaacacagaagt gaaggcaatcgcaggtgtccccgactccctgcaactctcccttcacccggcttgagccta cggcttcctgcccccactatcccatcacctgcttttgctcccgtctgcacctaccttgtc ccccaagccaacagcctcatgtctcatcatcaagaccaccgaggcccagggaagtcagag agtcataaaatctcaggctactggagatcttatgagaccatttctttcattgaaggtcct ccaaaagaatgctgcagcttcagacacagcctatgctctgtccatgcaatacaaccaaac ttaccaggtgtggaaaaagccctcaatggattggatggtgcccactcacactgcgtgacg gtcaatcttctttacacaatccactga >gi568815587f:69541314_69751279|GENSCAN_predicted_peptide_4|497_aa MPSLHVDIRCNNSSSKYHTLSTHSTGIRWDVNHDNEPMTVLRLGFSKWGPDARCKPLKAE AASPSCCPPQGLIGGCTKRLHREDEVLALKASSASSRDLGEQILSWSPQVDTAHLRSEAQ HPLLPHWEHLQDHEGPSSPWTQHPAPCERLGFQDLSVKVQIPGLAFKSLQTDYFSSIHFR ALWSTQQSSRFLIPKLTIQPEYPTSYIVRARSILPGKWPQRPVVQSTQRLKQRLECEPFM EKYDLRWDLPDDASVTSSFPGYMGTHLAGEPSRQEGQGKRPKRLSRMPSSSTCNKGSSRS SHFHGFEGHTSTNNIPRCLQQPYCDLKKSTISTNDKATRPAAMAVAHGLPSSATVQLEVA CKALLAVGWGLQFLAMWTTQRTTHNIAAGFSKGKGSKTASKADSTAFSEPVFGSKQQKPM LADFSRKGKSDAEDRQEQGRQDSTNHNGSMLQIFSDEKPGLPLPIPTPDPPVKLQRPTPP CHQSPQLYDGQHRYCCS >gi568815587f:69541314_69751279|GENSCAN_predicted_CDS_4|1494_bp atgcccagcctgcacgttgacatccgttgcaacaactcttcctccaagtatcacacactc agcacccactccactgggattcgctgggatgtgaaccatgacaatgaaccaatgactgta cttcgcctaggcttttccaaatggggcccagatgccaggtgtaaaccgctgaaggcagag gccgcatctccaagctgttgccctccccagggcctgattggaggctgcaccaagaggctg cacagggaggatgaggttttggccttgaaggccagctctgccagctccagggaccttgga gaacagatcctaagttggagtccccaggtggacaccgcccacctgcgctcagaagcccag catccccttctacctcactgggagcacctgcaggaccacgagggtccctctagtccttgg actcagcatccagcaccatgcgagcgccttggctttcaggacttgtcagtcaaagtccag attcctggcttggccttcaagtctctccagaccgactacttttccagcatccacttcaga gccctgtggtcaacacagcagagcagccgctttctcatccccaagctcaccatccagccc gagtatcctacctcatacatcgtgagagctcgatcaatattaccgggtaaatggcctcag cgcccagtggtgcagagtacgcagaggctgaaacaaaggcttgagtgtgagcccttcatg gagaagtatgatctgaggtgggaccttcctgatgatgccagtgtgaccagctctttccct ggatacatgggcacccacctggccggggagccctcgcggcaagagggacagggaaaaaga ccaaagcgcctatcacgaatgccatcctcaagcacatgcaataaaggcagcagcaggtct tctcacttccatggcttcgaagggcacaccagcacaaacaacattccgagatgtctgcag caaccgtattgtgacttgaaaaaatccacaatttctacaaacgacaaagccacacgccct gctgctatggctgtggctcatggcctaccctccagtgccaccgtgcagttagaggttgcc tgcaaggccttgctggctgtgggctggggactgcagttcctggccatgtggacaacccag aggacaactcacaacatagcagctggcttctccaagggcaagggctccaagaccgccagc aaggcagacagcacagccttttcagaacccgtgtttggttccaagcaacagaaaccaatg ctggcagatttcagcagaaaaggaaaatcagatgcagaagataggcaggaacaagggagg caagacagcacgaatcacaatggaagcatgctgcagatcttctctgatgagaagccaggg ctacccctacctatcccaacaccagaccctccagtaaaactgcagcggcccacaccacct tgtcaccagagcccacagctctatgatgggcagcacagatactgctgctcataa >gi568815587f:69541314_69751279|GENSCAN_predicted_peptide_5|570_aa MGGAVRLSMCRACGCLGWALGPEFAPAHLAVLGRLGSALRKRSRAAIARGPPRAPQKQAV WGEGELERLRSPGAGFRKRFPGRLKGLTMENSRSLSQVLSSRRQGSFVEVAKSWSLQRAV GAVAASSRVRTLRRGAEEREGARGSRSESRARTQPGPTALPSCPGRAPAMEHQLLCCEVE TIRRAYPDANLLNDRVLRAMLKAEETCAPSVSYFKCVQKEVLPSMRKIVATWMLEVCEEQ KCEEEVFPLAMNYLDRFLSLEPVKKSRLQLLGATCMFVASKMKETIPLTAEKLCIYTDNS IRPEELLVTTGPRRPPPPASRTQDHGAGEGAGGGGRPASDISAPPREGGPAAGRPCPGSG RDPSRPRPAALCALACDSHRVRAPRCGRKVGGARPPAAARGAPRSALSLQFQVQMELLLV NKLKWNLAAMTPHDFIEHFLSKMPEAEENKQIIRKHAQTFVALCATDVKFISNPPSMVAA GSVVAAVQGLNLRSPNNFLSYYRLTRFLSRVIKCDPDCLRACQEQIEALLESSLRQAQQN MDPKAAEEEEEEEEEVDLACTPTDVRDVDI >gi568815587f:69541314_69751279|GENSCAN_predicted_CDS_5|1713_bp atggggggggcggtgcggctgtccatgtgccgagcgtgtggctgtctcgggtgggcactg gggccggagttcgccccggcccacctcgcagttttggggcgcctgggatcggcgctacgt aagcgaagcagagctgccatagcacgtgggccgccacgcgcaccccaaaagcaagcagtg tggggggaaggggagctcgagcgccttcggagcccaggggccggctttcggaagcgtttt cccgggcgacttaagggcttaacaatggaaaactcgcggagcctgagccaagtcctttca agtcgccgccaggggagttttgttgaagttgcaaagtcctggagcctccagagggctgtc ggcgcagtagcagcgagcagcagagtccgcacgctccggcgaggggcagaagagcgcgag ggagcgcggggcagcagaagcgagagccgagcgcggacccagccaggacccacagccctc cccagctgcccaggaagagccccagccatggaacaccagctcctgtgctgcgaagtggaa accatccgccgcgcgtaccccgatgccaacctcctcaacgaccgggtgctgcgggccatg ctgaaggcggaggagacctgcgcgccctcggtgtcctacttcaaatgtgtgcagaaggag gtcctgccgtccatgcggaagatcgtcgccacctggatgctggaggtctgcgaggaacag aagtgcgaggaggaggtcttcccgctggccatgaactacctggaccgcttcctgtcgctg gagcccgtgaaaaagagccgcctgcagctgctgggggccacttgcatgttcgtggcctct aagatgaaggagaccatccccctgacggccgagaagctgtgcatctacaccgacaactcc atccggcccgaggagctgctggtaaccactggaccccgccgccccccgccccccgcgagc cgcacgcaggaccacggggccggggaaggtgcaggcggtggcggccggcccgcctctgac atatctgctcctccgagggagggcggccccgccgccgggcgtccctgtccggggagcggg cgggatcctagccgccctcgtcccgccgccctgtgtgcgcttgcctgcgactcccaccgc gttcgcgccccgcggtgtggccgaaaagtgggcggcgcgcgccctccagcggctgcacga ggagcgccgcgctcggcgctgagcctccagttccaggtgcaaatggagctgctcctggtg aacaagctcaagtggaacctggccgcaatgaccccgcacgatttcattgaacacttcctc tccaaaatgccagaggcggaggagaacaaacagatcatccgcaaacacgcgcagaccttc gttgccctctgtgccacagatgtgaagttcatttccaatccgccctccatggtggcagcg gggagcgtggtggccgcagtgcaaggcctgaacctgaggagccccaacaacttcctgtcc tactaccgcctcacacgcttcctctccagagtgatcaagtgtgacccggactgcctccgg gcctgccaggagcagatcgaagccctgctggagtcaagcctgcgccaggcccagcagaac atggaccccaaggccgccgaggaggaggaagaggaggaggaggaggtggacctggcttgc acacccaccgacgtgcgggacgtggacatctga >gi568815587f:69541314_69751279|GENSCAN_predicted_peptide_6|195_aa MYRLLWALPRAVLTRPAPRRRFRNGAFVPEAAGSATAPRGCRNPGAKGGLLAAMAGSQDI FDAIVMADERFHGEGYREGYEEGSSLGVMEGRQHGTLHGAKIGSEIGCYQGFAFAWKCLL HSCTTEKDSRKMKVLESLIGMIQKFPYDDPTYDKLHEDLDKIRGKFKQGHRSPHTLALHT PPVPESMSESSPGDY >gi568815587f:69541314_69751279|GENSCAN_predicted_CDS_6|588_bp atgtacaggctgctttgggccctcccgcgtgccgttcttacccggcctgccccgcgccgc cgcttccggaacggagcatttgtgcctgaagctgccgggtctgctacggcaccgcggggc tgcagaaacccgggggccaagggcgggctgcttgccgctatggctggcagtcaggacata ttcgatgccatcgtgatggcggatgagaggtttcatggggaagggtatcgggaaggctat gaagaaggcagtagtttgggtgtgatggagggaaggcagcatggcacgctgcatggagcc aaaatcgggtctgagatcgggtgctaccaaggttttgcttttgcatggaaatgtctactg cacagttgcaccactgagaaggacagcagaaagatgaaggtcttagaatcattgattgga atgatccagaaattcccttatgatgaccctacttacgataaactccatgaagacttagac aagatcagaggaaaatttaaacagggtcaccgaagtccccacacgctggctctccacacc cctcctgttccagaaagcatgtccgaaagcagtccaggagattattaa >gi568815587f:69541314_69751279|GENSCAN_predicted_peptide_7|304_aa MRSGCVVVHVWILAGLWLAVAGRPLAFSDAGPHVHYGWGDPIRLRHLYTSGPHGLSSCFL RIRADGVVDCARGQSAHSLLEIKAVALRTVAIKGVHSVRYLCMGADGKMQGLLQYSEEDC AFEEEIRPDGYNVYRSEKHRLPVSLSSAKQRQLYKNRGFLPLSHFLPMLPMVPEEPEDLR GHLESDMFSSPLETDSMDPFGLVTGLEAGEGIISGWASLIKKAKTSLEPFSKFRLHPKNQ VCVTGPGQPQLHGRLGRQELAVRVTRCGPHPECKEEQGNFNAYLGMDLTHRHHVYSPYFG EDRE >gi568815587f:69541314_69751279|GENSCAN_predicted_CDS_7|915_bp atgcggagcgggtgtgtggtggtccacgtatggatcctggccggcctctggctggccgtg gccgggcgccccctcgccttctcggacgcggggccccacgtgcactacggctggggcgac cccatccgcctgcggcacctgtacacctccggcccccacgggctctccagctgcttcctg cgcatccgtgccgacggcgtcgtggactgcgcgcggggccagagcgcgcacagtttgctg gagatcaaggcagtcgctctgcggaccgtggccatcaagggcgtgcacagcgtgcggtac ctctgcatgggcgccgacggcaagatgcaggggctgcttcagtactcggaggaagactgt gctttcgaggaggagatccgcccagatggctacaatgtgtaccgatccgagaagcaccgc ctcccggtctccctgagcagtgccaaacagcggcagctgtacaagaacagaggctttctt ccactctctcatttcctgcccatgctgcccatggtcccagaggagcctgaggacctcagg ggccacttggaatctgacatgttctcttcgcccctggagaccgacagcatggacccattt gggcttgtcaccggactggaggccggagagggaatcatttcgggctgggcctctctcatc aagaaagccaaaacttccttagaacctttcagcaaattccggctacatcctaagaaccag gtttgcgtcacaggaccaggccagccccagctgcacggcaggctgggaaggcaagaatta gcggtcagggtgaccagatgtgggccacatcctgagtgcaaggaggagcagggaaatttt aatgcatacctgggcatggacctgactcatcggcaccacgtatactcaccttactttggg gaagaccgagaatga >gi568815587f:69541314_69751279|GENSCAN_predicted_peptide_8|337_aa LIAVVVNEQTSYSNGTYAEVGERRQVKQHPEAEASSSPVRPHRPCGSRFQAQRSTLAYGH PFLKVLLLNNTPKNSKHTSTQKPIHKCSQQQIHKNQMSTRIPVSGRKDQQNAIFPYNGMS FSTNGNEAGTQATVGMGNFESISTSAKMVNGRKMTTKRTVENGQERVEVEEDSRLKSLTI HAALSSGPSALGCATPIMALTAQYLPCLSAVPSMGRQGQCSAAVGQCRLANICPVNQSTC GLARFDRSPRISWSPLEKTPSWAANAFEKVKSLLRKHKVSQRAWLQEVTALGSQASLAPS NHTHQGAQQTPKIVCLAHLGQTLEARQSISVTVSYLT >gi568815587f:69541314_69751279|GENSCAN_predicted_CDS_8|1014_bp cttattgctgtggtagtgaatgaacaaacatcctactccaacggcacttatgctgaggtg ggggagagaagacaagtaaaacaacacccagaggctgaagccagcagcagccccgtgagg ccccacaggccctgtgggagcagattccaggcccagcgatcaaccctcgcctacggccac cccttcctgaaagtcctactgctgaataacactccaaagaactcaaaacacacgtctaca caaaaacctatacacaagtgttcacagcagcaaattcacaagaaccaaatgtcaactcga atacccgtcagtggacgcaaagatcaacaaaatgccatcttcccatacaatgggatgtcg ttcagcacgaacgggaatgaagccgggacacaggccacagttgggatgggcaacttcgaa tcaatatcaacttctgctaaaatggttaatggcagaaaaatgactacaaagagaactgtt gagaatggtcaagaaagagtagaagttgaagaagacagccggttaaagtccttaacaatc catgctgccctcagttcagggccctcagccctagggtgtgccacccccatcatggctctg accgcccagtacctgccctgcttatctgcggtgccctccatgggcaggcagggccagtgc agtgcagcagttggacagtgtcggctggcaaacatctgtcctgtaaatcaaagcacttgc ggactcgccaggtttgacaggtctccgagaatttcatggagccctttggagaagacacca tcatgggctgcgaacgcatttgaaaaagttaaaagcctccttcgcaagcacaaagtgtcc cagagagcctggctccaagaggtcacagccctgggcagccaagccagcttggctccctct aaccacacacaccagggagcccagcaaacacccaagatcgtgtgcctggcccacctgggg caaaccctcgaggcccggcagtccatctctgtaaccgtcagctacctcacatga