GENSCAN 1.0 Date run: 4-Nov-116 Time: 10:23:15 Sequence gi568815583f:66966155_67290533 : 324379 bp : 44.65% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.08 PlyA - 88 83 6 1.05 1.07 Term - 1473 1177 297 0 0 32 49 168 0.392 2.57 1.06 Intr - 4631 4284 348 2 0 94 115 133 0.450 12.15 1.05 Intr - 9883 9811 73 0 1 90 64 39 0.390 1.11 1.04 Intr - 11945 11867 79 0 1 68 88 32 0.647 -0.19 1.03 Intr - 21108 21004 105 1 0 95 68 40 0.162 2.89 1.02 Intr - 34930 34664 267 2 0 62 11 170 0.001 4.20 1.01 Init - 54712 54580 133 1 1 78 47 83 0.011 3.50 1.00 Prom - 54836 54797 40 -2.46 2.00 Prom + 62324 62363 40 -3.96 2.01 Init + 66860 66927 68 1 2 61 119 51 0.856 6.04 2.02 Term + 67325 67445 121 2 1 55 39 99 0.671 -0.35 2.03 PlyA + 70746 70751 6 1.05 3.00 Prom + 71639 71678 40 -3.26 3.01 Init + 78116 78250 135 1 0 51 53 94 0.031 2.58 3.02 Intr + 98700 98874 175 2 1 47 62 59 0.121 -1.29 3.03 Intr + 99999 100206 208 1 1 67 75 455 0.440 40.24 3.04 Intr + 114582 114666 85 0 1 50 75 48 0.070 -0.48 3.05 Term + 114756 114866 111 2 0 109 48 88 0.648 5.46 3.06 PlyA + 117700 117705 6 1.05 4.12 PlyA - 124825 124820 6 1.05 4.11 Term - 125798 125717 82 1 1 105 41 28 0.361 -3.03 4.10 Intr - 126494 126312 183 1 0 80 94 32 0.420 1.90 4.09 Intr - 127996 127819 178 2 1 80 64 72 0.128 2.98 4.08 Intr - 139299 139207 93 1 0 94 69 84 0.822 7.04 4.07 Intr - 150532 150380 153 2 0 77 80 19 0.030 0.04 4.06 Intr - 160094 160031 64 2 1 80 82 60 0.360 2.89 4.05 Intr - 161149 161070 80 2 2 114 61 5 0.764 -0.33 4.04 Intr - 164291 164154 138 0 0 46 109 87 0.175 7.04 4.03 Intr - 171756 171560 197 2 2 88 93 38 0.114 3.36 4.02 Intr - 175736 175661 76 0 1 86 99 13 0.041 0.77 4.01 Init - 191653 191593 61 1 1 87 67 80 0.395 7.21 4.00 Prom - 192849 192810 40 -7.56 5.00 Prom + 193349 193388 40 -6.06 5.01 Init + 195899 195921 23 1 2 74 82 13 0.414 -1.55 5.02 Intr + 198741 198934 194 0 2 124 85 281 0.970 30.54 5.03 Intr + 199099 199230 132 2 0 80 96 149 0.935 15.52 5.04 Intr + 200625 200699 75 1 0 78 82 127 0.994 10.59 5.05 Intr + 204400 204534 135 2 0 85 116 13 0.379 4.34 5.06 Intr + 215087 215299 213 0 0 95 64 355 0.165 32.49 5.07 Intr + 218573 218710 138 0 0 34 97 155 0.998 11.44 5.08 Intr + 219082 219169 88 2 1 61 80 69 0.681 2.43 5.09 Intr + 220866 220976 111 0 0 63 91 35 0.327 0.79 5.10 Intr + 221188 221355 168 1 0 70 82 267 0.346 23.36 5.11 Intr + 224259 224371 113 0 2 119 28 84 0.973 5.52 5.12 Intr + 224988 225187 200 1 2 87 127 45 0.952 7.37 5.13 Term + 232179 232364 186 2 0 93 54 34 0.083 -2.01 5.14 PlyA + 234833 234838 6 1.05 6.10 PlyA - 235543 235538 6 1.05 6.09 Term - 236744 236667 78 2 0 119 38 79 0.889 3.76 6.08 Intr - 242504 242408 97 1 1 65 106 33 0.307 2.81 6.07 Intr - 254386 254335 52 2 1 112 79 28 0.337 2.27 6.06 Intr - 265743 265660 84 1 0 61 115 90 0.971 8.79 6.05 Intr - 269914 269825 90 2 0 53 115 73 0.988 6.47 6.04 Intr - 270350 270254 97 2 1 104 91 34 0.998 4.88 6.03 Intr - 270666 270476 191 1 2 55 71 110 0.976 5.30 6.02 Intr - 281109 280881 229 0 1 53 113 165 0.476 12.94 6.01 Init - 298074 298042 33 0 0 72 92 45 0.344 3.17 6.00 Prom - 305883 305844 40 -4.06 7.03 PlyA - 305899 305894 6 1.05 7.02 Term - 307956 307849 108 0 0 19 46 132 0.798 0.61 7.01 Init - 311216 311037 180 2 0 57 75 86 0.496 3.38 7.00 Prom - 319428 319389 40 -1.86 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 100678 100732 55 1 1 104 48 58 0.849 0.43 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583f:66966155_67290533|GENSCAN_predicted_peptide_1|433_aa MEYYAAIKNDEFMSFVGTWMKLETIILSKLLQGQNTKHRMFSLIVPKRKAEGDAKADRAK MKDKPQRRSTRFSTKPAPPKPEPKPKKAPAKKGEKVPKGKREKLRLARRGITLQKMEMPK QTRHRKLKVLQMPKLDLYEKHSFVFSFLSPHPEWPEPKPAPSEKSLPPERAYKKICLSSP ISTLDITIYHYECKRIGGKYWAPVGRLKMISVAARIKTQCWDYRHKRLCVACTCATLNPG TFLPDNEEKIEHNCQQVIAQTYTTRGELLEVPLTDPDLNLYTDGSSFVEKGLGKVGYAVV SNNGILESNPLTPGTSAQLAELIALTRALELEGKKKERNLLRGHESRHQLSAAAKSMRRN VDKSQPLINRSNTKRGTSDIECITTIDSYQNNEPEFDHASDITTSFQEIQRIEEMLNKTI IQQPAKTRMFEVL >gi568815583f:66966155_67290533|GENSCAN_predicted_CDS_1|1302_bp atggaatactatgcagccataaaaaatgatgagttcatgtcctttgtagggacatggatg aagctggaaaccatcattctcagcaaactattgcaaggacaaaacaccaaacaccgcatg ttctcactcatagtgcccaagagaaaggctgaaggtgatgctaaagcagatagagccaag atgaaggacaaaccacagagaagatccacgaggttttctactaaacctgctcctccaaag ccggagcccaaacctaaaaaggcccctgcaaagaagggagagaaggtaccaaaagggaaa agggaaaagctgaggctggccaggaggggaataaccctgcagaaaatggagatgcccaaa cagaccagacacagaaagctgaaggtgctgcagatgccaaaattagacctttacgagaag cacagttttgtcttcagtttcctctccccccacccggaatggcctgagccgaagccagct ccctccgagaagtcattaccgcctgaaagagcatacaagaaaatctgtttatcctcacca atatcaacattagatattaccatttatcattacgaatgtaagagaattggaggaaaatac tgggcacctgttggccggttaaaaatgattagcgtggccgccagaattaagactcagtgc tgggattacaggcataagcgactgtgcgtggcctgcacttgtgcaactcttaacccagga acatttcttccagacaatgaagaaaagatagaacataactgtcaacaggtgattgctcaa acctacaccactcgaggggaacttctagaggttcccttgactgatcctgacctcaacttg tatactgatgggagttcctttgtagaaaaaggacttggaaaagtggggtatgcagtggtc agtaataatggaatacttgaaagtaatcctctcactccaggaactagcgctcagctggca gaactaatagccctcactcgggcactagaattagaaggaaaaaagaaggaaaggaattta ttaagaggccatgagtctaggcatcaattatcagcagctgctaaatccatgaggagaaat gttgataagtctcaacccctgatcaatcgtagcaacacgaaaagagggacatctgacatt gaatgcatcactacgatagattcttaccagaataatgaacctgaatttgatcatgcctct gacataactaccagtttccaggaaatacagaggatagaggagatgttaaataaaaccata atacagcaaccagccaagaccagaatgtttgaagttctctag >gi568815583f:66966155_67290533|GENSCAN_predicted_peptide_2|62_aa MEVPSPPGPFVHCFVCDAKKAVWAAESTDSPKEKGQVGRRRQTFAELPPRRKSTRGPWAA EP >gi568815583f:66966155_67290533|GENSCAN_predicted_CDS_2|189_bp atggaagttccgtcaccacctggaccttttgttcactgctttgtctgcgatgccaagaaa gctgtctgggcagcagaaagcacggactctccgaaggagaaaggccaggtgggaaggagg cgccagaccttcgccgaattacctccccggagaaaatccacgcggggaccttgggcagca gaaccctaa >gi568815583f:66966155_67290533|GENSCAN_predicted_peptide_3|237_aa MIHGSNLAKPCSVACHDLVQAHAVALLPTEHSGLERRLTLERAKENSTDVLEAHFLPGTR FTRLKRPLAPAIVQVVTGVPSSPLLRTHFSPPPPPHTFPRLSAAMSSILPFTPPIVKRLL GWKKGEQNGQEEKWCEKAVKSLVKKLKKTGQLDELEKAITTQNVNTKCITIPSAWNVPGP LRSKHVAPNNPSLPTEFCREHIIYWKSGEGLEFSQLSNQTVAVGEAMSGELQGEQLE >gi568815583f:66966155_67290533|GENSCAN_predicted_CDS_3|714_bp atgatccacggcagtaacctggccaagccctgctcagtggcctgccatgatctggtccag gcccacgctgttgccctcctgcccaccgaacattcaggactggagaggaggctcaccctg gagcgggctaaggaaaactcaacagatgtccttgaggctcattttctgccgggcactcgg ttcacaaggctcaagagacccctggctccagccattgtccaggttgtcactggtgtcccc agctctccacttctgaggacccacttctcaccccctcctcctccgcacacgttcccgagg ctctcggccgccatgtcgtccatcctgcctttcactcccccgatcgtgaagcgcctgctg ggctggaagaagggcgagcagaacgggcaggaggagaaatggtgcgagaaggcggtcaag agcctggtcaagaaactcaagaagacggggcagctggacgagctggagaaggccatcacc acgcagaacgtcaacaccaagtgcatcaccatccccagtgcatggaatgtgccagggccg ctgcggagcaagcacgtggccccgaataacccctccctgccaacggaattctgcagagaa catataatttattggaagtctggagagggacttgagttctctcaactctccaaccagact gttgctgtgggcgaggccatgtcgggtgagctccaaggagagcagttggaatga >gi568815583f:66966155_67290533|GENSCAN_predicted_peptide_4|434_aa MVVRATAKDVAMAEPKADGEGKLSWPQLRVCIGFSGALNPKFSCRRRALATPRQKKSKGR TVLCKLVSNSSRKHQDILTCHSDPMGLLARLPEKIRLERNDAPQTKLVINKGGGALAVML TCPPLTYYYETRFLTGRTRVSVHGPGVGDPWCKRRRSGGGTAETEGGDRKTKHLVDSSHS TSDRSILTPGLTPRKMVPRRKEGYGGPTGFEEHFHTCSLSLSEVLQHLLSRLIAEPLGMI PIIITAAPLYHTLLQPGALLTQQPGIFSASWTHRIQLKVQGFHCGRGGHPTQASGGPTAC PEDPSTGLVVKGLEHWKCGPGKHRGHTKSPKTENAIRVQIPCLCPLGTLLSILPPASTRC IIAYNHDCQAQMHISSPRRVYKIQTAGTALRVSDPRGLGAGPETLHSYLKAAASSKFWVT SPSHCGFSAALSPV >gi568815583f:66966155_67290533|GENSCAN_predicted_CDS_4|1305_bp atggtggtgagagccacagccaaggatgtggccatggcagagcccaaagccgatggtgaa ggcaaactgagttggcctcaacttagagtttgtatcggcttttctggggcactcaacccc aagttcagctgcaggagaagagccctggcaacacccagacagaagaaatccaaagggagg acagtactttgcaagcttgtttcaaactcttcaagaaagcaccaagacattttgacctgc cattctgaccctatgggccttctggccagattaccagagaaaattcggcttgagagaaat gatgcaccccaaacaaagctggtgatcaacaaaggaggcggagctctggccgtaatgctc acttgcccaccactcacctactactacgagacccggttcctaacaggccgcacacgggta tcagtccacggccccggggttggggacccctggtgtaagagaaggaggtcagggggaggc actgcagaaacagagggaggagacagaaaaacaaaacacctggtagacagttcacattcg acctcagacaggagcatcctaacacctggcctgacacctcgcaagatggtgccaagacgc aaagaaggctatggaggtcccacaggctttgaggagcacttccacacctgtagcctctcc ttgtctgaagtcctgcagcacttactgtcaagactaattgcagaaccactaggaatgata cccataataatcacagctgctcccctttaccacacgcttctccagccaggggctctgctt acccaacagcccggcatcttctcagcatcctggactcaccgcatacagctcaaggttcaa ggtttccactgtggccgtggaggccaccccacgcaggcctcaggtggtcccactgcctgc cctgaggacccctccacaggattggtggtcaagggcctggaacactggaagtgtgggcct gggaagcaccgtggtcacacaaagagccctaaaacagaaaatgcaatccgggttcagatt ccttgcctgtgtcccctcggcaccctcctcagtatcctgcccccagcttccacacggtgc atcattgcctataatcatgattgtcaagctcaaatgcacatcagcagccctcggagagtg tacaaaattcagactgctgggaccgccctcagggtttctgatccaagaggcctgggggcg gggcctgagactctgcattcctacctaaaggcagcggcttcctccaagttctgggtcacc tcaccttcccattgtggcttctcagctgctttatcacctgtgtaa >gi568815583f:66966155_67290533|GENSCAN_predicted_peptide_5|591_aa MPLVSSSRSLDGRLQVSHRKGLPHVIYCRLWRWPDLHSHHELRAMELCEFAFNMKKDEVC VNPYHYQRVETPVLPPVLVPRHTEIPAEFPPLDDYSHSIPENTNFPAGIEPQSNIPETPP PGYLSEDGETSDHQMNHSMDAGSPNLSPNPMSPAHNNLGEYLLVHTTGTPSSCSPGESPV WGGGPEDLQPVTYCEPAFWCSISYYELNQRVGETFHASQPSMTVDGFTDPSNSERFCLGL LSNVNRNAAVELTRRHIGRGVRLYYIGGEVFAECLSDSAIFVQSPNCNQRYGWHPATVCK IPPDLTGLQSTEDKDTDEFNIRDGCQAPSQQPRKGPPLHTISLFLEALFADFSLLWESEL VGQPSISPCSPVSVFLAGCNLKIFNNQEFAALLAQSVNQGFEAVYQLTRMCTIRMSFVKG WGAEYRRQTVTSTPCWIELHLNGPLQWLDKVLTQMGSPSIRCSKHTDWEVRVQQNLHTGQ REKSMSATSLKTHLRLSFFTLKSWKDLLRPSAYAMYSVYYHINLKEIRMTLDPSSIISPK SAVLKVGPRTEPAASVITQMLKMQILRPHARPTEAATLEWGPPGDCDIAHF >gi568815583f:66966155_67290533|GENSCAN_predicted_CDS_5|1776_bp atgcccttagtcagcagctccaggtccctggatggccggttgcaggtgtcccatcggaag gggctccctcatgtcatctactgccgcctgtggcgatggccagacctgcacagccaccac gagctacgggccatggagctgtgtgagttcgccttcaatatgaagaaggacgaggtctgc gtgaatccctaccactaccagagagtagagacaccagttctacctcctgtgttggtgcca cgccacacagagatcccggccgagttccccccactggacgactacagccattccatcccc gaaaacactaacttccccgcaggcatcgagccccagagcaatattccagagaccccaccc cctggctacctgagtgaagatggagaaaccagtgaccaccagatgaaccacagcatggac gcaggttctccaaacctatccccgaatccgatgtccccagcacataataacttgggtgag tatctccttgtgcacacaactggaaccccctctagctgcagccctggcgagtcgccagtg tggggagggggccctgaagacctgcagccagttacctactgcgagccggccttctggtgc tccatctcctactacgagctgaaccagcgcgtcggggagacattccacgcctcgcagcca tccatgactgtggatggcttcaccgacccctccaattcggagcgcttctgcctagggctg ctctccaatgtcaacaggaatgcagcagtggagctgacacggagacacatcggaagaggc gtgcggctctactacatcggaggggaggtcttcgcagagtgcctcagtgacagcgctatt tttgtccagtctcccaactgtaaccagcgctatggctggcacccggccaccgtctgcaag atcccaccagaccttacaggcctgcagagcacagaagacaaagacacggatgagttcaat ataagggacgggtgccaagcaccaagccagcagcccaggaaaggcccccctctccacacc atctccctatttttggaagccctctttgcagacttctccctgctctgggagtcggagctt gtgggccagccttccatctccccttgcagccctgtttctgtgtttttggcaggatgcaac ctgaagatcttcaacaaccaggagttcgctgccctcctggcccagtcggtcaaccagggc tttgaggctgtctaccagttgacccgaatgtgcaccatccgcatgagcttcgtcaaaggc tggggagcggagtacaggagacagactgtgaccagtaccccctgctggattgagctgcac ctgaatgggcctttgcagtggcttgacaaggtcctcacccagatgggctccccaagcatc cgctgttccaaacatactgattgggaggtgcgtgttcagcagaacctgcacacaggacag cgggaaaaatcgatgagcgccacctctttaaaaactcacttacgtttgtcctttttcact ttgaaaagttggaaggatctgctgaggcccagtgcatatgcaatgtatagtgtctattat cacattaatctcaaagagattcgaatgacgctcgacccaagcagcattatttcccctaag tcagcggttctcaaagtgggtcccaggaccgaaccagcagcatcggttatcacacagatg ttaaagatgcaaattctcaggccccacgccaggcctactgaagccgcaactctggagtgg ggccctccaggtgactgtgacatagctcacttctga >gi568815583f:66966155_67290533|GENSCAN_predicted_peptide_6|316_aa MTDVYVCKEVQWVLGLIDFKNEAVNPRCECPEFLPSGGFVVSLTSGVKLQTFAVSVTALK GGVSGVVCSSQWVLGLADFRNEAADPHDILGTEDLIVEVTSNDAVRFYPWTIDNKYYSAD INLCVVPNKFLVTAEIAESVQAFVVYFDSTQKSGLDSVSSWLPLAKAWLPEVMILVCDRV SEDGINRQKAQEWCIKHGFELVELSPEELPEEDDDFPESTGVKRIVQALNANVWSNVVMK NGGEKQDPKRKSNMLIFTHNPICQQQIVLNPSLIIGVVHLTQQMPRLIALWVAKAFWMAI GGDRDEIEGLSSDEEH >gi568815583f:66966155_67290533|GENSCAN_predicted_CDS_6|951_bp atgacagatgtttatgtctgcaaggaggtccagtgggttcttggtctcattgacttcaag aatgaagccgtgaaccctcgttgtgagtgtccggagtttcttccttctggtgggttcgtg gtctcgctgacttcaggagtgaagctgcagaccttcgccgtgagtgttacagctcttaaa ggtggtgtgtccggagttgtttgttcctcccagtgggttcttggtcttgctgacttcagg aatgaagctgcagaccctcacgatatccttggaacagaagatcttattgtggaagtgact tccaatgatgctgtgagattttatccctggaccattgataataaatactattcagcagac atcaatctatgtgtggtgccaaacaaatttcttgttactgcagagattgcagaatctgtc caagcatttgtggtttactttgacagcacacaaaaatcgggccttgatagtgtctcctca tggcttccactggcaaaagcatggttacctgaggtgatgatcttggtctgcgatagagtg tctgaagatggtataaaccgacaaaaagctcaagaatggtgcatcaaacatggctttgaa ttggtagaacttagtccagaggagttgcctgaggaggatgatgacttcccagaatctaca ggagtaaagcgaattgtccaagccctgaatgccaatgtgtggtccaatgtagtgatgaag aatggtggggaaaaacaggacccaaagaggaagagtaacatgctcatattcacacacaac cccatttgccagcagcagatagtactgaatccctctctgatcatcggggtggtgcatcta acacaacagatgcccaggttgatagcattgtgggtggccaaagcattctggatggcaatc gggggagacagagatgaaattgaaggcctttcatctgatgaagagcactga >gi568815583f:66966155_67290533|GENSCAN_predicted_peptide_7|95_aa MRHAKRQEKKQSEETKRISEPDSAMPEILELSDQKFKVTLINRLRALMKKVDNMQDKMGN VKDKEKILKTAREKKQIAHKGALINLANDFSMETI >gi568815583f:66966155_67290533|GENSCAN_predicted_CDS_7|288_bp atgaggcatgctaaaaggcaagaaaaaaaacagtctgaagagacaaagcgaatatcagaa ccagattcagctatgccagagattttggaattatcagaccagaaatttaaagtaactctg attaataggctaagggcactaatgaaaaaagtggacaacatgcaagacaagatgggtaat gtcaaggacaaagagaagatcctgaaaacagcaagagagaagaagcaaatagcacataaa ggagctctaattaatctggcaaacgacttctcaatggaaacgatataa