GENSCAN 1.0 Date run: 8-Nov-116 Time: 03:29:08 Sequence gi568815578f:56259006_56466284 : 207279 bp : 43.62% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2140 2193 54 1 0 69 97 35 0.355 0.59 1.02 Intr + 5151 5333 183 0 0 116 84 69 0.758 8.20 1.03 Intr + 16427 16635 209 2 2 30 35 117 0.454 -0.68 1.04 Term + 17136 17287 152 1 2 84 43 137 0.860 6.87 1.05 PlyA + 18822 18827 6 1.05 2.00 Prom + 34539 34578 40 -3.46 2.01 Init + 36640 36650 11 0 2 74 100 6 0.200 0.36 2.02 Intr + 51689 51747 59 2 2 102 100 10 0.065 2.13 2.03 Intr + 58101 58251 151 1 1 76 88 84 0.372 6.42 2.04 Term + 63461 63575 115 2 1 93 54 94 0.338 4.54 2.05 PlyA + 64075 64080 6 1.05 3.00 Prom + 65372 65411 40 -2.76 3.01 Init + 68722 68752 31 0 1 63 116 -5 0.072 -0.30 3.02 Term + 78569 78789 221 0 2 64 54 146 0.924 6.00 3.03 PlyA + 79374 79379 6 1.05 4.00 Prom + 80216 80255 40 -4.96 4.01 Sngl + 81018 82067 1050 2 0 33 42 455 0.763 32.56 4.02 PlyA + 82154 82159 6 1.05 5.02 PlyA - 83378 83373 6 1.05 5.01 Sngl - 85386 84934 453 0 0 83 52 166 0.934 8.92 5.00 Prom - 88393 88354 40 -5.26 6.00 Prom + 92448 92487 40 -5.96 6.01 Init + 100001 100186 186 1 0 110 115 174 0.999 19.36 6.02 Intr + 106082 106257 176 1 2 93 116 70 0.994 9.04 6.03 Term + 107066 107282 217 2 1 128 39 212 0.861 16.92 6.04 PlyA + 108905 108910 6 1.05 7.06 PlyA - 110404 110399 6 1.05 7.05 Term - 110850 110788 63 0 0 69 49 27 0.439 -5.21 7.04 Intr - 111654 111480 175 2 1 83 94 130 0.578 13.04 7.03 Intr - 114551 114403 149 2 2 25 91 48 0.404 -2.17 7.02 Intr - 122566 122428 139 0 1 57 93 72 0.767 5.07 7.01 Init - 123536 123472 65 2 2 56 68 58 0.558 1.23 7.00 Prom - 125464 125425 40 -7.76 8.00 Prom + 125542 125581 40 -4.56 8.01 Init + 127313 127317 5 1 2 74 71 0 0.078 -3.63 8.02 Intr + 132519 132613 95 0 2 79 34 118 0.243 5.11 8.03 Intr + 136516 136716 201 2 0 117 1 304 0.744 23.86 8.04 Intr + 138202 138479 278 2 2 85 83 170 0.980 13.44 8.05 Intr + 138639 138836 198 2 0 94 91 80 0.997 8.45 8.06 Intr + 139962 140352 391 2 1 79 79 114 0.504 3.90 8.07 Term + 144463 144722 260 2 2 60 52 227 0.494 12.01 8.08 PlyA + 145231 145236 6 1.05 9.00 Prom + 151951 151990 40 -5.86 9.01 Init + 153454 153489 36 0 0 96 113 25 0.201 5.95 9.02 Intr + 178159 178581 423 0 0 112 101 153 0.627 13.06 9.03 Intr + 186952 186996 45 0 0 104 105 58 0.636 7.71 9.04 Intr + 191594 191674 81 1 0 77 84 78 0.980 6.13 9.05 Intr + 192814 194124 1311 0 0 82 109 953 0.358 84.49 9.06 Term + 199335 199742 408 2 0 101 55 521 0.953 45.32 9.07 PlyA + 200312 200317 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578f:56259006_56466284|GENSCAN_predicted_peptide_1|199_aa XFPICNRNGSCTYQLTLGRSSLGSFHMEQQAFWELKAARRTGAQRACLGTESDMAGESDM PRHGERRTLYGIPRALDGFLIGGYPAEEWQMDFTHMPKTKGIQYLLVWVDTFTNLVEAFP CQTEKASEVIKVRINKMMPCFGLPKYVHSAVKVTEIDSCIPYTRVKAWETNEIASVGPGE HLKYWSEEIGDFELKNHQD >gi568815578f:56259006_56466284|GENSCAN_predicted_CDS_1|600_bp nntttccccatctgtaacaggaatggttcttgtacttaccagctgacactgggcaggagc agcctgggcagtttccacatggagcaacaggcattctgggaactgaaagcagctcgtagg actggagcacagagggcttgtttaggaacagagagtgacatggctggagagtcagacatg cctcgtcatggggaaagaagaactttatatggcatcccgagggctctggatggatttcta ataggaggctacccagcggaagagtggcagatggatttcacccatatgccaaagacaaag ggcatccagtacctcctagtatgggtagataccttcactaacttggtagaagcatttcca tgtcaaacagagaaagcctctgaggtgataaaagtacgcattaacaaaatgatgccttgc tttggacttcctaagtatgtccacagtgcagtaaaggtcactgaaatagattcttgtatt ccttatactcgagttaaggcctgggaaaccaatgaaattgcctctgttggtccaggagag cacctaaagtactggagtgaggaaatcggggacttcgagctaaaaaatcaccaagattag >gi568815578f:56259006_56466284|GENSCAN_predicted_peptide_2|111_aa MNKRSFCRHPEMKGGLSLAFRTEEPIQMRRNQKTNPGNMTKQGSLTPPESHMSSPAMDPN QKEIPDLPEKEFRRCQYNDLCKLYQLHGSPDFSKMSTVGQNHLLFLSTLTQ >gi568815578f:56259006_56466284|GENSCAN_predicted_CDS_2|336_bp atgaacaaaaggtccttctgcagacacccagagatgaagggagggctgtcccttgccttt cgcacagaagagcctatccaaatgagaaggaaccagaaaaccaaccctggtaatatgaca aaacaaggctctttgacacccccagaaagtcacatgagttcaccagcaatggatccaaac caaaaagaaatccctgatttacctgaaaaagaattcaggagatgccagtacaacgacctc tgcaagttgtaccaactacatgggtctccagacttttccaaaatgtctactgtgggacaa aatcaccttctcttcctctccacgctcacacagtga >gi568815578f:56259006_56466284|GENSCAN_predicted_peptide_3|83_aa MLKSKGQSKKGLQLLASPENDFGKLTEVGFRRSVITNFSKLKEHVQTHRTQAKNLEKRLD EWLTTINSVEKTLSDLTELKTMA >gi568815578f:56259006_56466284|GENSCAN_predicted_CDS_3|252_bp atgctgaaaagcaagggtcaaagtaaaaaaggattgcagctccttgccagcccagagaat gactttggcaagctgacagaagtaggcttcagaagatcggtaataacaaacttctccaag ctaaaggagcatgttcaaacccatcgcacacaggctaaaaaccttgaaaaaaggttagat gaatggctaactacaataaacagtgtagaaaagaccttaagtgacctgacagagctgaaa accatggcatga >gi568815578f:56259006_56466284|GENSCAN_predicted_peptide_4|349_aa MKKREKNQIDAIKNDKGDITTNPTEIQTTIREYYKHLHANKLENLKEMHKFPDTYIHPRL NQEEVESLNRPTTGSEIESIINRLPTKKSPGPDGFTAEFYKRYKEEVIPLLLKLFQSIEK EGILSNSFYEASIILIPKPGRDTTKKENFRPISLVNIDVKILNKILANRIQQHIKKLIHH DQVGFIPGMQGWFNVRKSIHVIHHINRTNNKNHMIISIDAEKAFDKIQQPFMLKILNKLG IDGTYLKIIKAIYDKTTANIILNGQKLDAFHLKTSTRQGYPFSQLLFNMVLEVLAGAIRQ EKEIKSIQLGREEVKLSLFADDMIVYLENPIVSAQNLLKLISNFSKLSG >gi568815578f:56259006_56466284|GENSCAN_predicted_CDS_4|1050_bp atgaagaaaagagagaagaatcaaatagatgcaataaaaaatgataaaggggatatcacc accaatcccacagaaatacaaactactatcagagaatactataaacatctccatgcaaat aaactagaaaatctaaaagaaatgcataaattcccggacacatacatccacccaagacta aaccaggaagaagttgaatctctgaatagaccaacaacaggctctgaaattgagtcaata attaataggctaccaaccaaaaaaagtccaggaccagatggattcacagccgaattctac aagaggtacaaagaggaggtgataccactccttctgaaactattccaatcaatagaaaaa gagggaatcctctctaactcattttatgaggccagcatcatcctgataccaaagcctggc agagacacaacaaaaaaagagaattttagaccaatatccctggtgaacatcgatgtgaaa atcctcaataaaatactggcaaaccgaatccagcagcacatcaaaaagcttatccaccat gatcaagtgggcttcatccctgggatgcaaggctggttcaacgtacgcaaatcaatacac gtaatccatcacatcaacagaaccaataacaaaaaccacatgattatctcaatagatgca gaaaaggccttcgacaaaattcaacagcccttcatgctaaaaattctaaataaactaggt attgatggaacatacctcaaaataataaaagctatttatgacaaaaccacagccaatatc atactgaatgggcaaaaactggacgcattccatttgaaaaccagcacacgacaaggatac cctttctcacaactcctattcaacatggtattggaagttctggctggggcaatcaggcaa gagaaagaaataaagagtattcaattaggaagagaggaagtcaaactgtccctgtttgca gatgacatgattgtatatttagaaaaccccattgtgtcagcccaaaatctccttaagctg ataagcaacttcagcaaactctcaggataa >gi568815578f:56259006_56466284|GENSCAN_predicted_peptide_5|150_aa MGARVDSWDSTRRGRLPEPRLAWELSFLQTRARSFKRFPSAFPARSEAESCALVQEGNLD LMVAGGCRARPRKRALGDQGSAGVQLGGERGSWRGEERVPAPDDLAPGGLAHEVAPQALP RLRPRKGRRSPRLRGFAESQRQLLKTASGN >gi568815578f:56259006_56466284|GENSCAN_predicted_CDS_5|453_bp atgggggcaagggtggattcctgggattctacccgacgtgggaggcttcctgagccacgg ctggcctgggagctgagtttcttgcagacccgtgcccggtcattcaaacggttcccatca gcatttccagcacgctccgaggctgagagttgcgcgctcgtgcaggaaggcaatctggat ctgatggttgcagggggctgcagagctcggcctcggaagcgggcgttaggggatcagggc tccgcgggcgtccagcttggaggtgagcgggggtcctggcggggcgaggagcgggtcccg gcaccggatgaccttgcaccaggcggcctcgcccacgaggtggcgccgcaggctctgcca aggctccgaccgcgcaaggggaggcgcagcccaagacttcgaggtttcgcggagtcccaa aggcagctcctcaagacagcctcgggcaactga >gi568815578f:56259006_56466284|GENSCAN_predicted_peptide_6|192_aa MAGLLALLGPAGRVGARVRPRATWLLGATAPCAPPPLALALLPPRLDARLLRTARGDCRG HQDPSQATGTTGSSVSCTEEKKQSKSQQLKKIFQEYGTVGVSLHIGISLISLGIFYMVVS SGVDMPAILLKLGFKESLVQSKMAAGTSTFVVAYAIHKLFAPVRISITLVSVPLIVRYFR KVGFFKPPAAKP >gi568815578f:56259006_56466284|GENSCAN_predicted_CDS_6|579_bp atggccgggttgctggcgttgctgggtccggcaggcagggtgggcgcccgggtccggcct cgcgccacctggctcctgggcgccaccgccccctgcgccccgccgcccctggccctggcc ctgctcccgcccaggctagacgcccggctgctccgcacggcgcgcggggactgccgcggc caccaggaccccagccaggccacggggacaacaggcagcagcgtcagctgcacagaggag aaaaagcaaagcaagtcacagcaactgaaaaagatttttcaagagtatggcactgttggc gtgtcattgcacattggaatctcattaatttccttgggcatattttacatggttgtgtca agtggtgtggacatgcctgcaatcctgctgaaactcggatttaaagagtccctggtacag tcaaaaatggcagcaggcacaagtaccttcgtggtggcctatgcaatccacaagctgttt gcgccagtgagaatcagcattacgctagtctctgtgcccttgattgtcagatattttcga aaagtgggattttttaaacctccagctgcaaaaccttaa >gi568815578f:56259006_56466284|GENSCAN_predicted_peptide_7|196_aa MAPLILRYAGIASSYLQVDYGWHPNILRLYGYFHDATRVYLILEYAPLGTVYRELQKLSK FDEQRTATYITELANALSYCHSKRVIHRDIKPENLLLGSAGELKIADFGWSVHAPSSRRT TLCGTLDYLPPEMIEGRMHDEKVDLWSLGVLCYEFLVGKPPFEANTYQETYKRISREANP EPGCGESDHSALTPIS >gi568815578f:56259006_56466284|GENSCAN_predicted_CDS_7|591_bp atggcaccccttattctgaggtatgcaggaattgcgagcagctacctgcaggttgactac ggctggcatcctaatattcttagactgtatggttatttccatgatgctaccagagtctac ctaattctggaatatgcaccacttggaacagtttatagagaacttcagaaactttcaaag tttgatgagcagagaactgctacttatataacagaattggcaaatgccctgtcttactgt cattcgaagagagttattcatagagacattaagccagagaacttacttcttggatcagct ggagagcttaaaattgcagattttgggtggtcagtacatgctccatcttccaggaggacc actctctgtggcaccctggactacctgccccctgaaatgattgaaggtcggatgcatgat gagaaggtggatctctggagccttggagttctttgctatgaatttttagttgggaagcct ccttttgaggcaaacacataccaagagacctacaaaagaatatcacgggaggcaaatcca gagcctggctgtggggaaagtgaccactctgccctgaccccgatcagttaa >gi568815578f:56259006_56466284|GENSCAN_predicted_peptide_8|475_aa MRDSDLPGSRWSYMFNKHCKLSRYNCSKVYGLRTGRETVFLFSKMYRTKVGLKDRQQLYK LIISQLLYDGYISIANGLINEIKPQSVCAPSEQLLHLIKLGMENDDTAVQYAIGRSDTVA PGTGIDLEFDADVQTMSPEASEYETCYVTSHKGPCRVATYSRDGQLIATGSADASIKILD TERMLAKSAMPIEVMMNETAQQNMENHPVIRTLYDHVDEVTCLAFHPTEQILASGSRDYT LKLFDYSKPSAKRAFKYIQEAEMLRSISFHPSGDFILVGTQHPTLRLYDINTFQCFVSCN PQDQHTDAICSVNYNSSANMYVTGSKDGCIKLWDGVSNRCITTFEKAHDGAEVCSAIFSK NSKYILSSGKDSVAKLWEISTGRTLVRYTGAGLSGRQVHRTQAVFNHTEDYVLLPDERTI SLCCWDSRTAERRNLLSLGHNNIVRCIVHSPTNPGFMTCSDDFRARFWYRRSTTD >gi568815578f:56259006_56466284|GENSCAN_predicted_CDS_8|1428_bp atgagagattctgatttaccgggctctcggtggagctatatgtttaacaagcactgcaag ttatcccggtacaactgctccaaggtctacggtctgagaactggcagggaaactgtcttc cttttctccaagatgtacagaaccaaagtgggcttgaaggaccgccagcagctctacaag ctgatcattagccagctgctatatgacggctacatcagcatcgccaatggcctcatcaat gaaatcaagcctcagtctgtgtgtgcaccctcggagcagctcctgcatctcatcaaactc ggaatggaaaacgatgacaccgcagttcagtatgcaattggtcgttcagatactgttgcc cctggcacagggattgacctggaatttgatgcagatgttcagactatgtccccagaggct tctgagtacgaaacatgctatgtcacatcacataaaggaccatgccgtgtagctacctat agtagagatggacagttaatagctactgggtctgctgatgcttcgataaagatacttgac acagagaggatgttggccaaaagtgccatgccaatagaggtcatgatgaatgagaccgca caacaaaatatggaaaaccacccagtgattcgaactctttatgaccatgtggatgaagtc acgtgccttgctttccacccaacagaacagatcctggcttctggttcaagggattatact cttaaattatttgattattccaaaccatcagcaaaaagagccttcaaatacattcaggaa gctgaaatgttacgttccatctcttttcatccttctggagactttatacttgtcggaact cagcatcctactcttcgcctttatgatatcaacacctttcaatgttttgtctcttgcaat cctcaagatcaacacaccgatgctatatgttccgttaattacaattctagtgccaatatg tacgtaactggaagcaaggacggctgcatcaaattatgggatggtgtttcaaatcgatgc atcacaacttttgagaaagcacatgacggtgctgaagtttgttctgccattttttccaaa aattctaaatacattctctcaagtggaaaagactctgtagctaaactttgggaaatatca acgggacgaacactggtcagatacacgggcgcgggtttaagtggacgccaggtgcaccgg acacaggctgtgtttaaccacaccgaggactatgtgttgctgcccgacgagaggacgatc agtctttgctgctgggactcgaggacagccgagcggagaaacctgctgtcgttggggcac aacaatattgtacgctgcatagtgcactcccccaccaaccccgggttcatgacgtgcagc gatgacttcagagcgcggttttggtaccggagatcgaccactgactga >gi568815578f:56259006_56466284|GENSCAN_predicted_peptide_9|767_aa MKGTGIMDCAPKALLARALYDNCPDCSDELAFSRGDILTILEQHVPESEGWWKCLLHGRQ GLAPANRLQILTEVAADRPCPPFLRGLEEAPASSEETYQVPTLPRPPTPGPVYEQMRSWA EGPQPPTAQVYEFPDPPTSARIICEKTLSFPKQVYDVPTQHRGPVVLKEPEKQQLYDIPA SPKKAGLHPPDSQASGQGVPLISVTTLRRGGYSTLPNPQKSEWIYDTPVSPGKASVRNTP LTSFAEESRPHALPSSSSTFYNPPSGRSRSLTPQLNNNVPMQKKLSLPEIPSYGFLVPRG TFPLDEDVSYKVPSSFLIPRVEQQNTKPNIYDIPKATSSVSQAGKELEKAKEVSENSAGH NSSWFSRRTTSPSPEPDRLSGSSSDSRASIVSSCSTTSTDDSSSSSSEESAKELSLDLDV AKETVMALQHKVVSSVAGLMLFVSRKWRFRDYLEANIDAIHRSTDHIEESVREFLDFARG VHGTACNLTDSNLQNRIRDQMQTISNSYRILLETKESLDNRNWPLEVLVTDSVQNSPDDL ERFVMVARMLPEDIKRFASIVIANGRLLFKRNCEKEETVQLTPNAEFKCEKYIQPPQRET ESHQKSTPSTKQREDEHSSELLKKNRANICGQNPGPLIPQPSSQQTPERKPRLSEHCRLY FGALFKAISAFHGSLSSSQPAEIITQSKLVIMVGQKLVDTLCMETQERDVRNEILRGSSH LCSLLKDVALATKNAVLTYPSPAALGHLQAEAEKLEQHTRQFRGTLG >gi568815578f:56259006_56466284|GENSCAN_predicted_CDS_9|2304_bp atgaagggaacaggcatcatggactgtgcgcccaaggcactcctggccagggcactttat gacaactgccctgactgctctgacgagctggctttcagcagaggggacatcctgaccatt ctggagcaacacgtgccagaaagcgagggttggtggaagtgtttgctccatgggaggcaa ggcctggcccctgccaaccgcctccaaatcctcacggaggtcgctgcagacaggccgtgc cccccattcctgagaggcctggaagaagctcctgccagctcagaggagacctatcaggtg cccactctaccccgccctcccactccaggccccgtttatgagcagatgaggagttgggcg gaggggccccagccccctactgcccaagtctatgaattccccgaccctcccaccagtgcc agaatcatctgtgaaaagactctcagctttccaaaacaggtgtatgacgtgcctacccag caccggggccccgtggtcctgaaggagccagagaagcagcagttatatgacataccagcc agccccaagaaggcaggactccatcccccagacagccaagcaagtgggcagggtgttccc ctgatatcagtgactaccttaagaagaggcggttacagcacattaccaaatcctcagaaa tcggaatggatttatgacactccagtgtctccaggaaaggccagcgtcagaaacacgcct ctcaccagctttgcggaagaatcaaggccccacgctctccccagttccagctccactttc tacaatcctccaagtggcagatccaggtccctcactccacaactgaataacaatgtgccc atgcagaaaaaactcagccttccagaaattccttcttatggctttcttgtacccagaggc acatttcctttggatgaagatgtcagctacaaggttccttcaagctttctgattccccga gtggaacagcagaacaccaagcccaatatttatgacatccctaaagcaacgtcgagtgtt tctcaggctgggaaggagctggagaaagccaaggaggtgtcagagaattccgcgggccat aattcctcatggttctccagacggacaacttccccatctcctgaaccggacagattatca ggttccagttctgacagcagagctagcatcgtttcctcgtgctccaccacatccaccgac gactcctccagctcttcctcggaggagtcagcaaaggagctctccttggacctggatgtg gccaaggagacagtgatggctctgcagcacaaggtggtcagctctgtcgctggcctgatg ctctttgtcagcaggaagtggagattccgagactatctggaggccaacattgatgcaatc cacaggtccactgatcacatagaagaatctgtaagagaatttctggattttgcccgagga gtccatgggactgcctgtaacctcactgacagtaaccttcagaacagaattcgggaccag atgcagaccatctccaactcctaccgcatcctgcttgaaacaaaggaaagcttggataat cgcaattggcctctggaagttcttgtgactgacagtgtccagaacagcccagatgacctt gagaggtttgtcatggtggcacggatgcttccagaagacatcaagaggtttgcctccatt gtcattgccaatggaaggctcctttttaagcggaactgtgaaaaggaagagactgtgcag ttgaccccaaatgcagaatttaagtgtgaaaaatacatccagcctccccaaagagaaact gaatcacaccaaaagagtaccccttccactaagcaaagggaagatgaacactcttctgaa ctattaaagaaaaatagggcaaatatctgtggacagaatcctggccctcttatacctcag ccttcgagtcaacagactcctgagaggaaaccccgcttatctgaacactgccggctctac tttggggcgctcttcaaagccatcagcgcatttcacggcagcctcagcagcagccagccc gcggagatcatcactcagagcaagctggtcatcatggtgggacagaagctggtggacacg ctgtgcatggagacccaggagagggacgtgcgcaacgagatcctccgtggcagcagtcac ctctgcagcctgctcaaggacgtagcgctggccactaagaatgccgtgctcacgtacccc agccctgccgcgctggggcacctccaggcggaggctgagaagctggagcaacacacgcgg cagttcagagggacactgggatga