GENSCAN 1.0 Date run: 4-Nov-116 Time: 15:19:15 Sequence gi568815594f:40686112_40886708 : 200597 bp : 42.01% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 4964 5095 132 1 0 66 93 95 0.444 7.99 1.02 Term + 9895 10062 168 0 0 106 42 140 0.957 8.10 1.03 PlyA + 12468 12473 6 1.05 2.00 Prom + 14363 14402 40 -0.75 2.01 Init + 33063 33228 166 2 1 34 91 96 0.332 4.35 2.02 Intr + 45643 45789 147 2 0 17 53 145 0.031 3.19 2.03 Term + 48166 48305 140 2 2 92 43 74 0.051 0.44 2.04 PlyA + 48376 48381 6 1.05 3.00 Prom + 59756 59795 40 -4.15 3.01 Init + 64583 64880 298 1 1 91 82 394 0.795 36.53 3.02 Intr + 74323 74381 59 2 2 102 92 18 0.397 1.28 3.03 Intr + 75060 75190 131 2 2 55 89 51 0.372 0.47 3.04 Intr + 76648 76704 57 1 0 71 77 60 0.385 0.28 3.05 Term + 76873 77002 130 1 1 89 41 97 0.701 1.77 3.06 PlyA + 78420 78425 6 1.05 4.02 PlyA - 79217 79212 6 1.05 4.01 Sngl - 81218 79497 1722 2 0 66 32 636 0.804 50.45 4.00 Prom - 96092 96053 40 -4.35 5.00 Prom + 97829 97868 40 -7.05 5.01 Init + 100001 100630 630 1 0 69 33 273 0.187 15.30 5.02 Intr + 102489 102654 166 2 1 40 70 86 0.205 0.61 5.03 Intr + 104491 104634 144 2 0 105 75 38 0.910 3.53 5.04 Intr + 108264 108365 102 1 0 54 73 104 0.939 4.73 5.05 Intr + 112676 112793 118 0 1 51 110 75 0.940 4.60 5.06 Intr + 120950 121073 124 2 1 119 94 -5 0.996 2.87 5.07 Term + 122196 122828 633 2 0 50 38 384 0.999 22.90 5.08 PlyA + 126989 126994 6 1.05 6.14 PlyA - 129230 129225 6 1.05 6.13 Term - 130148 129981 168 2 0 89 37 188 0.995 10.70 6.12 Intr - 135939 135760 180 0 0 72 101 223 0.798 21.14 6.11 Intr - 137648 137533 116 0 2 121 100 198 0.998 23.55 6.10 Intr - 138261 138187 75 1 0 104 51 42 0.453 0.67 6.09 Intr - 139859 139776 84 0 0 43 88 67 0.603 1.07 6.08 Intr - 141108 141021 88 0 1 84 115 130 0.896 13.92 6.07 Intr - 143245 143153 93 1 0 87 61 63 0.087 2.74 6.06 Intr - 143614 143447 168 1 0 58 84 49 0.042 0.72 6.05 Intr - 144466 144352 115 0 1 81 116 67 0.117 8.33 6.04 Intr - 146124 146069 56 0 2 77 17 46 0.030 -6.74 6.03 Intr - 146567 146347 221 0 2 49 33 169 0.005 4.60 6.02 Intr - 157176 157002 175 0 1 93 46 128 0.087 7.69 6.01 Init - 162799 162608 192 1 0 37 40 138 0.121 2.93 6.00 Prom - 165619 165580 40 -4.15 7.06 PlyA - 166361 166356 6 1.05 7.05 Term - 167936 167764 173 0 2 56 36 142 0.926 2.81 7.04 Intr - 169111 169024 88 1 1 57 60 159 0.504 8.62 7.03 Intr - 171026 170915 112 1 1 1 100 116 0.596 3.56 7.02 Intr - 173920 173739 182 1 2 7 59 125 0.005 -0.76 7.01 Init - 186451 186371 81 1 0 98 95 89 0.538 11.62 7.00 Prom - 190326 190287 40 -4.15 8.03 PlyA - 192562 192557 6 1.05 8.02 Term - 196574 196114 461 0 2 44 42 230 0.540 8.27 8.01 Init - 199458 199356 103 0 1 71 91 26 0.465 1.65 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 64178 63621 558 2 0 42 41 291 0.863 15.69 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594f:40686112_40886708|GENSCAN_predicted_peptide_1|99_aa MGKEVEDGRDWPQFLELAPKEGARKLLHKVAEHEVLVLVLQLQLPQWSSFGASRYNAFPQ ILSSSCYILRYVCMVYPSILYPRAPETVELVNSSITCGT >gi568815594f:40686112_40886708|GENSCAN_predicted_CDS_1|300_bp atggggaaagaagtagaagatgggagagattggccacaatttttggaacttgccccgaag gaaggggccaggaagcttttgcacaaggtagcagaacatgaagtgctggttctggttctc caactacagctgccacagtggtcctcttttggtgcttcccggtacaatgccttcccccag atcctttcttcttcctgttatattcttcgctatgtctgcatggtttatcccagtatcctg tacccgagagctccagaaactgttgaattagttaattcatcaattacatgtggtacataa >gi568815594f:40686112_40886708|GENSCAN_predicted_peptide_2|150_aa MLTSVSAQTTVIVPTTPLAGQCEERHKKNEHFSVCYLFIFPHIQIPQEWKKQGGNAKCVE NIFRPQHTGSPSGDRQCLWVEDIQLQVQDSQVQDQKAVQRDSSGDDQGREKDSRDIEGEK RKAFTDRPWKAFLVSVVLSTYFMLDREKVE >gi568815594f:40686112_40886708|GENSCAN_predicted_CDS_2|453_bp atgctcacaagtgtatcagctcaaaccactgtaattgttcctacaactccccttgctggc caatgtgaagagcgacacaagaaaaatgaacacttctcagtctgttacctcttcatcttc cctcatatccaaataccccaagaatggaaaaaacaagggggaaatgctaaatgtgtagag aatattttcaggccccagcacactgggagcccaagtggagaccggcagtgtctctgggtc gaggacattcagttgcaagtccaggacagccaagtacaggatcagaaagctgtgcagaga gacagttctggagatgatcaagggagagaaaaggatagtagggacatagaaggagagaaa agaaaagcattcacggacaggccatggaaggcgtttcttgtttcagtagtattaagtacc tatttcatgttggacagagaaaaagttgagtga >gi568815594f:40686112_40886708|GENSCAN_predicted_peptide_3|224_aa MLNSTGELEFSNEEDPEIISQLTSLPLSGGKSSAGVPEKTGYPDSVYVMAANIFQGIRIE KSAQKVLIKYGNEPLRSLSESEDQSFQRLSYELAFSALKYQDILETILIDSCIFPSTTIP DHLSSLIIVMLYDFQDRKFQTRVLSDNEEPISEVQEVENLLNRLAAALDSHRSVNLIVNC TCSWKNCLPRNWSLVPKRFETAELKGLFNATELERGVARIHTQE >gi568815594f:40686112_40886708|GENSCAN_predicted_CDS_3|675_bp atgctgaattccacgggcgaactggagttttcgaacgaagaagatcccgagatcatctcc caactcacttccctgcctctgtccggtgggaaaagctcagctggtgtgcccgaaaaaacg ggctatccggactccgtttatgtcatggcagccaacatttttcagggtattcgaatcgaa aagtcggcacagaaagtcttaatcaagtatgggaatgaacccctgcggtccttgtccgag tctgaggatcagtcctttcagcgtttgtcttatgagctggctttcagtgccctgaaatat caagatattttggaaactatattgatagacagctgtatcttcccaagtaccacaatacca gatcatttgagcagtcttattattgtgatgctatatgatttccaagatagaaaatttcaa actcgtgtcctttctgataatgaagagcccatatcagaagttcaagaagtagagaacctt cttaacagattagcagcagcattagattctcataggagcgtgaacctgattgtgaactgc acatgttcatggaaaaattgtcttccacgaaactggtccctggtaccaaaaaggttcgag actgctgagttaaagggactgtttaatgccacagaactggaaaggggagtagccagaatt cacactcaggagtga >gi568815594f:40686112_40886708|GENSCAN_predicted_peptide_4|573_aa MKAEIKTFFETGENKDTTYQNLWDTFKAVCRGKFIALNAHKRKKERSKIDTLTSQLKELE KQEQAHSKASRRQEITKIRAELKEIETQKTLQKINESRSWFFERINKIDRLLARLIKKKR EKNQIDAIKNDKGDITTDPTEIQTTIREYYKHLYANKLENLEEMDKFLDTYTLPRLNQEE VESLNRPITGSETVAIINSLPTKKTPGPDGFTAEFYQRYKEELVPFLLKLFQTIEKEGIL PNSFYEASIILIPKPGRDTAKKENFRPISLMNIDAKILNKILANRIQQHIKKLIHHDQVG FIPGMQGWFNIHKSINVIQHINRTKDKNHMIISIDAEKAFHKIQQPFMLKTLNKLGIDGT YLKIIRAIYDKPTANIILNGQKLEAFPWKTGTRQGCPLSPLLFNIVLEVLARAIRQEKEI KGIQLGKEEVKLSLFADDMIVYLGNPIVSAQNLLKLISNFSKVSGYKINVQISQAFLYND NRQTESQIMSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLNEIKEDTKKWKNIPCS WVERINIVKMAILPKVIYRFNAIPIKLPMTFFT >gi568815594f:40686112_40886708|GENSCAN_predicted_CDS_4|1722_bp atgaaggcagaaataaagacgttctttgaaactggcgagaacaaagacacaacataccag aatctctgggacacattcaaagcagtgtgtagagggaaatttatagcactaaatgcccac aagagaaagaaggaaagatccaaaattgacaccctaacatcacaattaaaagaactagaa aagcaagagcaagcacattcaaaagctagcagaaggcaagaaataactaaaatcagagca gaactgaaggaaatagagacacaaaaaacccttcaaaaaattaatgaatccaggagctgg ttttttgaaaggatcaacaaaattgatagactgctagcaagactaataaagaaaaaaaga gagaagaatcaaatagacgcaataaaaaatgataaaggggatatcaccaccgatcccaca gaaatacaaactaccatcagagaatactacaaacacctctacgcaaataaactagaaaat ctagaagaaatggataaattcctcgacacatacactctcccaagactaaaccaggaagaa gttgaatctctgaatagaccaataacaggctctgaaactgtggcaataatcaatagctta ccaaccaaaaaaactccaggaccagatggattcacagccgaattctaccagaggtacaag gaggaactggtaccattccttctgaaactattccaaacaatagaaaaagagggaatcctc cctaactcattttatgaggccagcatcatcctcataccaaagccaggcagagacacagcc aaaaaagagaattttagaccaatatccttgatgaacattgatgcgaaaatcctcaataaa atactggcaaaccgaatccagcagcacatcaaaaagcttatccaccatgatcaagtgggc ttcatccctgggatgcaaggctggttcaatatacacaaatcaataaatgtaatccagcat ataaacagaaccaaagacaaaaaccacatgattatctcaatagatgcagaaaaggccttt cacaaaattcaacaacccttcatgctaaaaactctcaataaattaggtattgatgggacg tatctcaaaataataagagctatctatgacaaacccacagccaatatcatactgaatggg caaaaactggaagcattcccttggaaaactggcacaagacagggatgccctctctcacca ctcctattcaacatagtattggaagttctggccagggcaattaggcaggagaaggaaata aagggtattcaattaggaaaagaggaagtcaaattgtccctgtttgcagatgacatgatt gtatatctaggaaaccccattgtctcagcccaaaatctccttaagctgataagcaacttc agcaaagtctcaggatacaaaatcaatgtacaaatatcacaagcattcttatacaacgat aacagacaaacagagagccaaatcatgagtgaactcccattcacaattgcttcaaagaga ataaaatacctaggaatccaacttacaagggatgtgaaggacctcttcaaggagaattac aaaccactgctcaatgaaataaaagaggataccaagaaatggaagaacattccatgctca tgggtagaaagaatcaatatcgtgaaaatggccatactacccaaggtaatttatagattc aatgccatccccatcaagctaccaatgactttcttcacataa >gi568815594f:40686112_40886708|GENSCAN_predicted_peptide_5|638_aa MGNGLSDQTSILSNLPSFQSFHIVILGLDCAGKTTVLYRLQFNEFVNTVPTKGFNTEKIK VTLGNSKTVTFHFWDVGGQEKLRPLWKSYTRCTDGIVFVVDSVDVERMEEAKTELHKINR ISENQGVPVLIVANKQDLKNSLSLSEIEKLLAMGDLSSSTPWHLQPTCAIIGDGLKEGLE KLHDMIIKRRKMLRQQKKKDEYQYLLYLCGRVPRDFVRLEEEEERTFLLPMLVYPYGGCV ESWQCSPQHIPLVTVPSSQRPESQPDIEILHEKFINIESKDHRLQKVKVILLLPRCSGLG VSNPVEFILNEHEDTEFLKDHSQGGISVDKLHVLAQQQYEQLTHAMKFTKAQAVVYCTCS VFPEENEAVVKKALEFQDLGNKGQPYRLSPPVLPLCSLKEIQLSTDKFFRMEPSEITNGC FLSILTRERDPSETVSVNDVLARAAAKGLLDGIELGKSSKREKKKKKSKTSLTKGATTDN GIQMKIAEFLNRETKASANLSETVTKPPLPQKNTAQVGASSQTRKPNKLAPHPAVPAFVK NTCPSRPRERQTHFLRPRPEDRMVALKPIKIVLPPVFMPFSSPQGIRSRMPTQHLYCRWV APKALVPTCLPTHSLSRKEEKPKDDTPSSLLRPPRRWL >gi568815594f:40686112_40886708|GENSCAN_predicted_CDS_5|1917_bp atggggaatgggctatcagaccagacttctatcctgtccaacctgccttcatttcagtcc ttccacattgttattctgggtttggactgtgctggaaagacaactgtcttatacaggctg cagttcaatgaatttgtaaataccgtacctaccaaaggatttaacactgagaaaattaag gtaaccttgggaaattctaaaacagtcacttttcatttctgggatgtaggtggtcaggag aaattaaggccactgtggaagtcatataccagatgcacagatggcattgtgtttgttgtg gactctgttgatgtcgaaaggatggaagaagccaaaactgaacttcacaaaataaatagg atatcagaaaatcaaggagtccctgtacttatagttgctaacaaacaagatttgaagaac tcattgtctctttcagaaattgagaaattgttagcaatgggtgacctgagttcatcaact ccttggcatttgcagcctacctgtgcaatcataggagatggcctaaaggaaggacttgag aaactacatgatatgatcattaaaagaagaaaaatgttgcggcaacagaaaaagaaagat gaatatcaatacctattatatctgtgtggaagggtgccgagagactttgtaagactggaa gaggaagaagaaaggacttttcttcttccaatgcttgtttatccctatggtggctgtgta gaatcctggcagtgttcaccacagcatattcctttggtaaccgtgccctcttctcagaga cctgaatctcagcctgatattgaaatacttcatgagaaatttattaacattgaatcaaag gatcacaggttacagaaagttaaagtgattttgctgctacctcgttgttcaggactgggt gttagtaatccagtagaatttattttaaatgaacatgaagatacagaattccttaaagat cactctcaaggaggcatctcagtggacaaacttcacgttcttgctcaacagcagtatgaa cagctaacacatgcaatgaaatttactaaagctcaagcagttgtttactgcacatgttca gtttttccagaagaaaatgaagctgttgttaagaaagcactggaatttcaagaccttggg aataaaggacaaccttacaggcttagtcctcctgttcttccactgtgctccttaaaggaa attcaattgtctactgataaatttttcagaatggaaccatctgaaattaccaatggttgt tttctttctattttaacaagggagcgggacccttctgagacagtgtctgtgaatgatgtt ttggcccgagctgcagccaagggtctgctggatgggattgagttgggtaaatcatcaaaa cgggagaagaagaagaaaaaatcaaaaacatcattgacaaaaggtgccactactgataat ggcatccaaatgaaaattgctgagttcctgaatcgagaaactaaagccagtgctaatcta tcagagactgtaacaaaaccacctcttccccagaaaaatactgctcaagtgggggcttcc tcacagaccagaaaacccaacaagctggccccccatcctgcagtgcctgcatttgtgaag aacacttgtccctccagaccgcgtgaacggcagacacacttcttaagacctcggccagaa gacagaatggttgctctgaaacccatcaagattgttctgcctccagtctttatgccattt tcaagtccccaagggatcagatctcggatgccaactcaacatttgtactgtcgttgggtt gcacccaaggcacttgtgcccacctgccttcccacacactcactatccagaaaagaggaa aagcctaaagatgacacaccttcctccctactcaggcctcctcggcgatggctttga >gi568815594f:40686112_40886708|GENSCAN_predicted_peptide_6|576_aa MAVNVDNDWGILRVLLEAVMIRMMRGGPGGKADQAADQQAQAGAGLYVSEQNGPSVAGGR EFAAGPAGRSMTLAIECPRERVSVNIGFLPFLLVIYPVLHALRSSCSVYGACKSAVTMPE RNAESWQGDGDGLSTGREAKSSLEWLRGEGAGNGGMDARNFQEVKQDECGDWVWHVRQGE SGSSMEDTELEREEAGAAKAFLGKVSIIVYEKAARDFAYVARDKDTRILKCHVFRCDTPA KAIATSLHEICSKHNSPLADIPPPSALHTHGLPQMESLAFQEEVLLPSSSGCYRFLLYVD PKSSSWNFQPLLGHMAIPVPNHCLLGTLWLDPPDEHSIPKIMAERKNAKALACSSLQERA NVNLDVPLQVDFPTPKTELVQKFHVQYLGMLPVDKPVEKLWKQLAKGPYSMGGSTVKVPS RDRMDILNSAIENLMTSSNKEDWLSVNMNVADATVTVISEKNEEEVLVECRVRFLSFMGV GKDVHTFAFIMDTGNQRFECHVFWCEPNAGNVSEAVQAACMLRYQKCLVARPPSQKVRPP PPPADSVTRRVTTNVKRGVLSLIDTLKQKRPVTEMP >gi568815594f:40686112_40886708|GENSCAN_predicted_CDS_6|1731_bp atggctgtgaatgttgacaatgactgggggatcctcagggttcttttggaggcagtgatg atcaggatgatgaggggaggacctgggggaaaagcagaccaagcagccgatcagcaagcc caggcaggggcgggcttgtatgtttcggagcaaaacggtcccagtgtggctggaggcagg gagtttgcggcgggccctgctggccgctccatgaccctggccatagagtgcccgagagaa cgggtttctgtaaacattgggtttcttcctttcctcctcgttatctatccagtgctccat gctcttcggtctagttgctctgtgtacggtgcatgcaaatcggctgtgacaatgccagaa aggaatgctgagtcctggcaaggggatggagatggcctgagcacaggcagggaggcgaag agcagcttagaatggctgcgtggtgagggggctggaaatggagggatggatgccagaaat ttccaggaggtgaaacaggacgaatgtggggattgggtgtggcacgtgaggcagggggag tccggttctagcatggaggacactgagctagaacgtgaggaagcaggagccgccaaggct ttcttggggaaagtatccatcatagtctatgaaaaggctgccagggattttgcttatgta gcaagagataaagatacaagaattttgaaatgtcatgtatttcgatgtgacacaccagca aaagccattgccacaagtctccacgagatctgctccaagcataacagcccccttgcagac atccccccaccctctgccctacacacccatggactcccacagatggagtctcttgctttt caggaggaggttctattaccaagcagctctggctgttacagatttcttctttatgttgac ccgaaatcttcctcttggaacttccagccacttttgggtcacatggctattccagtccct aaccattgtcttctgggcactctctggttggatcccccagatgaacacagtattccaaaa attatggctgaacggaagaatgccaaagcgctggcctgcagctccttacaggaaagggcc aatgtgaacctcgatgtccctttgcaagtagattttccaacaccaaagactgagctggtc cagaagttccacgtgcagtacttgggcatgttacctgtagacaaaccagtcgaaaaactg tggaaacagctggcaaaagggccctactcaatggggggatccactgtaaaggtgccatca agagacagaatggatattttgaacagtgccatagaaaatcttatgacctcatccaacaag gaggactggctgtcagtgaacatgaacgtggctgatgccactgtgactgtcatcagtgaa aagaatgaagaggaagtcttagtggaatgtcgtgtgcgattcctgtccttcatgggtgtt gggaaggacgtccacacatttgccttcatcatggacacggggaaccagcgctttgagtgc cacgttttctggtgcgagcctaatgctggtaacgtgtctgaggcggtgcaggccgcctgc atgttacgatatcagaagtgcttggtagccaggccgccttctcagaaagttcgaccacct ccaccgccagcagattcagtaaccagaagagtcacaaccaatgtaaaacgaggggtctta tccctcattgacactttgaaacagaaacgccctgtcaccgaaatgccatag >gi568815594f:40686112_40886708|GENSCAN_predicted_peptide_7|211_aa MLGTKDVAMNENAMVLALMGCAKRNRQLQQEYEGKAWDAVREDTACQEHLPKESTFGLRF LEDERDSSGEQCREEHSRRVTVSAKALSARGCLSDGTAGWCRHPGACRSTWGGIPARLRK DREAKVLCTGASLGSSVDDTKVRQEDTAADGRLGDKEHPCVRHLVRQHEGLRQMRSLFEE AGDKPEKQFRKQDDTGYAECLEDHQQRDATE >gi568815594f:40686112_40886708|GENSCAN_predicted_CDS_7|636_bp atgcttggcaccaaggatgtagccatgaatgagaacgccatggttcttgccctcatggga tgtgctaagaggaacagacagctgcagcaggagtacgaaggaaaagcctgggatgctgtc agggaggacactgcctgtcaggaacatcttccaaaggaatcaacttttgggctgagattc ctagaagatgagagggattcatcaggtgagcagtgcagggaagagcattctaggagggtc acagtcagtgcaaaggctctgagtgcgcgcggctgcttgagcgacggcaccgctggctgg tgccggcaccctggggcttgtcgtagcacctggggcggcatccccgcccgacttcggaag gaccgggaggcaaaggttctctgcactggagcatccctgggaagcagtgtggatgacacc aaggtgagacaagaagatactgctgctgatggccgccttggagataaggagcatccatgt gtcaggcaccttgtgaggcagcatgaaggactgagacagatgaggtccctgtttgaagag gcaggagataagccggaaaaacaattcaggaagcaagatgatacaggttatgccgagtgt cttgaagaccatcaacagcgtgatgcaacagagtag >gi568815594f:40686112_40886708|GENSCAN_predicted_peptide_8|187_aa MDLEARRTVKVRLFNNSLARSGIWWAGTPGTVTTDYAPASLVCYLLDSGDCAALEGELPG GGGRRNLLPPLLPVSLSTILETQLYPSCISSESSLQFFQLHRNAELASSHGFFRDSSTAS RQALFLSLGPGPGVPPSSLGTPSAANNVSSSAVRGFAPGGSSVKLLCFNYCNLSPLFPQP CRSEPPS >gi568815594f:40686112_40886708|GENSCAN_predicted_CDS_8|564_bp atggacttagaggcacgaagaacagtgaaagtgagactttttaataacagtcttgcgaga tcgggtatctggtgggcaggcacacctgggacagtcacaacagactacgctcctgcttct ctcgtctgctacctgttagactctggtgactgtgcagcactcgaaggagaactgccaggt ggaggaggaaggaggaacttgctccctcctttgcttcctgtttctctgagcaccatccta gaaacacagctttatcccagttgcatcagttctgaatccagtttgcagtttttccaactt cacagaaatgcagaactggcttcatcacacggcttcttcagagattccagcacagctagc aggcaagccctctttttaagtctaggtcctggccctggggtccctccttcaagcctaggg acaccatcagcagccaacaatgtctcttcctctgcagtccgaggttttgctccaggaggc tcctccgtgaagcttctatgttttaattattgcaacctctctcctctgttccctcaaccc tgcaggtcagagcccccctcttaa