GENSCAN 1.0 Date run: 3-Nov-116 Time: 13:26:03 Sequence gi568815581r:56791703_57013988 : 222286 bp : 43.50% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 233 228 6 1.05 1.02 Term - 1358 675 684 2 0 68 35 836 0.999 69.94 1.01 Init - 5947 5876 72 1 0 82 60 121 0.972 7.77 1.00 Prom - 9442 9403 40 -3.66 2.00 Prom + 12706 12745 40 -3.96 2.01 Init + 15785 15803 19 1 1 39 110 36 0.312 1.09 2.02 Intr + 17594 17701 108 0 0 39 62 85 0.084 1.26 2.03 Intr + 21969 22012 44 1 2 89 80 35 0.070 0.76 2.04 Intr + 31277 31376 100 1 1 56 76 73 0.005 2.58 2.05 Intr + 41425 41582 158 2 2 34 69 162 0.013 8.63 2.06 Intr + 43076 43557 482 1 2 88 80 797 0.121 70.73 2.07 Intr + 52317 52476 160 0 1 129 36 53 0.884 4.19 2.08 Intr + 56220 56363 144 2 0 133 62 63 0.994 8.68 2.09 Intr + 56994 57151 158 2 2 55 123 55 0.966 4.51 2.10 Intr + 57479 57530 52 2 1 90 95 8 0.939 0.61 2.11 Intr + 64810 64923 114 0 0 84 93 43 0.913 5.04 2.12 Intr + 70089 70216 128 2 2 60 91 47 0.972 1.68 2.13 Intr + 70438 70549 112 1 1 88 73 65 0.824 5.38 2.14 Intr + 85633 85737 105 0 0 106 28 90 0.377 5.21 2.15 Term + 86326 86613 288 0 0 -61 48 457 0.796 22.18 2.16 PlyA + 91084 91089 6 1.05 3.11 PlyA - 91888 91883 6 1.05 3.10 Term - 93139 92938 202 0 1 75 49 203 0.623 11.96 3.09 Intr - 100527 100002 526 1 1 62 32 548 0.066 38.60 3.08 Intr - 103739 103641 99 0 0 100 65 95 0.984 8.48 3.07 Intr - 103902 103819 84 1 0 102 110 19 0.968 5.29 3.06 Intr - 104250 104224 27 1 0 68 99 29 0.512 0.09 3.05 Intr - 107478 107413 66 1 0 70 100 33 0.732 1.58 3.04 Intr - 109876 109717 160 1 1 61 60 198 0.998 13.86 3.03 Intr - 112786 112553 234 1 0 89 92 349 0.999 33.29 3.02 Intr - 116861 116766 96 2 0 116 100 97 0.998 13.91 3.01 Init - 122286 121690 597 0 0 95 75 920 0.987 86.68 3.00 Prom - 129557 129518 40 -2.46 4.08 PlyA - 129699 129694 6 1.05 4.07 Term - 147452 147369 84 2 0 104 48 42 0.773 -0.55 4.06 Intr - 150421 150333 89 2 2 111 100 80 0.970 11.19 4.05 Intr - 154809 154740 70 0 1 64 105 7 0.484 -1.25 4.04 Intr - 157732 157685 48 1 0 88 116 40 0.924 5.68 4.03 Intr - 158065 157979 87 1 0 79 71 36 0.630 1.17 4.02 Intr - 159294 158511 784 2 1 89 22 327 0.389 17.68 4.01 Init - 169317 169073 245 0 2 84 85 263 0.999 22.81 4.00 Prom - 173359 173320 40 -3.26 5.00 Prom + 176255 176294 40 -5.76 5.01 Init + 186458 186533 76 1 1 67 96 238 0.999 21.75 5.02 Intr + 189380 189528 149 0 2 25 103 226 0.832 17.75 5.03 Intr + 193676 193765 90 1 0 131 27 61 0.923 4.49 5.04 Intr + 195993 196148 156 2 0 41 115 162 0.900 14.41 5.05 Intr + 196514 196588 75 1 0 69 90 60 0.941 4.01 5.06 Intr + 199397 199469 73 1 1 123 111 50 0.997 9.78 5.07 Intr + 203279 203316 38 0 2 63 115 25 0.965 0.68 5.08 Intr + 203805 203933 129 2 0 49 98 101 0.919 8.09 5.09 Intr + 206697 206796 100 2 1 30 65 75 0.651 -0.92 5.10 Intr + 209153 209290 138 0 0 99 49 306 0.989 28.24 5.11 Intr + 210316 210479 164 2 2 115 95 156 0.993 18.69 5.12 Term + 214471 214533 63 0 0 95 45 48 0.739 -0.91 5.13 PlyA + 214767 214772 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 41672 41406 267 2 0 70 49 341 0.972 22.23 S.002 Term + 70910 71089 180 1 0 62 37 122 0.855 2.11 S.003 Term - 100527 99998 530 1 2 62 48 560 0.920 43.82 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:56791703_57013988|GENSCAN_predicted_peptide_1|251_aa MALCYLCCLLPSLVVAHLPLLIILSPPFPLSSSSFLFFIIIITITTTNIITITITITITT IATIITITHTIITTTITPTTITTTTTTTTITTTTTTTTTTTTTTTTTTTTTTTTTTTTTT TTTTTTTTTTTTTTSPSPPSSSSPLPPSSPSTPSSPSPPSAPSPPPPHHHYHHHHHNHHH YYHHHNHHHHHHCHTIITTTTTTTTTIIIIITTTTIITIITITTTTITIITITATPSSPP PPLPSQSLSLS >gi568815581r:56791703_57013988|GENSCAN_predicted_CDS_1|756_bp atggccctctgttatttgtgctgtctgctcccttctctggtggtagcacacctgccactc ctcatcatcctgtctcctccttttcccctctcctcctcctccttcttgttcttcatcatc atcatcacaatcaccaccaccaacatcatcaccatcactatcaccatcaccatcaccacc atcgccaccatcatcaccattacccacaccatcatcaccaccaccatcacccccaccacc atcaccaccaccaccaccaccaccaccatcaccaccaccaccaccaccaccaccaccacc accaccaccaccaccaccaccaccaccaccaccaccaccaccaccaccaccaccaccacc accaccaccaccaccaccaccaccaccaccaccaccaccacctcaccatcaccaccatca tcatcgtcaccattgccaccatcatcaccatcaaccccatcatcaccatcaccaccatca gcaccatcaccaccaccaccacatcaccattatcaccaccaccaccataaccatcatcac tattaccatcaccacaatcaccatcatcaccatcactgccacaccatcatcaccaccacc accaccaccaccaccactatcatcattatcatcaccaccaccaccatcataaccatcatc actattaccaccaccacaatcaccatcatcaccatcactgccacaccatcatcaccacca ccaccactaccatcacaatcattatcattatcctaa >gi568815581r:56791703_57013988|GENSCAN_predicted_peptide_2|723_aa MSDGIPAPYCVQKPNSAVASKYLPTQPHNNLAMPSAIIPQNPDFEQKPTAPAPSRYQVRT AKVSIMFLMASVRLLSNLNRDKGALAQAESRRKPLSAAVRAGAVPVPLWIPCFFGGAEQR CFRVRVLRVADVLQAAARVVAAQVSSLEKMEAERRPAPGSPSEGLFADGHLILWTLCSVL LPVFITFWCSLQRSRRQLHRRDIFRKSKHGWRDTDLFSQPTYCCVCAQHILQGAFCDCCG LRVDEGCLRKADKRFQCKEIMLKNDTKVLDAMPHHWIRGNVPLCSYCMVCKQQCGCQPKL CDYRCIWCQKTVHDECMKNSLKNEKCDFGEFKNLIIPPSYLTSINQMRKDKKTDYEVVFD VTKTPPIKALQLCTLLPYYSARVLVCGGDGTVGWVLDAVDDMKIKGQEKYIPQVAVLPLG TGNDLSNTLGWGTGYAGEIPVAQVLRNVMEADGIKLDRWKVQVTNKGYYNLRKPKEFTMN NYFSVGPDALMALNFHAHREKAPSLFSSRILNKLELDGERVALPSLEGIIVLNIGYWGGG CRLWEGMGDETYPLARHDDGLLEVVGVYGSFHCAQIQVKLANPFRIGQAHTVRQTATSVQ MTAKYSEKVTHPNSGVTNMRIPFSLQLELLHDDDEEERRGRGRGRGRRRRKKRKKKKKKK NKKKKKKKKKKKKKKKKKKKEKEKKKKKKSGWILEGQNAKVKIAEILDLGEHSPLGETGQ RMK >gi568815581r:56791703_57013988|GENSCAN_predicted_CDS_2|2172_bp atgtctgacgggatcccagctccctactgtgtacagaaacccaactctgctgtggcatcc aagtaccttccaacccagcctcacaacaacctggcgatgccatctgccatcattccacaa aacccagactttgagcagaaacctacagccccggcacctagcaggtaccaggtgaggaca gccaaggtcagcatcatgttcctgatggccagtgtcaggcttctctccaacttaaacagg gacaaaggagcattggcacaggcagaatcacgaaggaagccgctgagtgcagctgtgcgc gccggggcggtgcctgtgcctctctggattccgtgtttcttcgggggtgctgagcagcgg tgcttccgcgtccgcgttctccgggtagctgatgtgctgcaggctgcagcccgcgtggtc gcggctcaggtatcgtccttggagaagatggaagcggagaggcggccggcgccgggctcg ccctccgagggcctgtttgcggacgggcacctgatcttgtggacgctgtgctcggtcctg ctgccggtgttcatcaccttctggtgtagcctccagcggtcgcgccggcagctgcaccgc agggacatcttccgcaagagcaagcacgggtggcgcgacacggacctgttcagccagccc acctactgctgcgtgtgcgcgcagcacattctgcagggcgccttctgcgactgctgcggg ctccgcgtggacgagggctgcctcaggaaggccgacaagcgcttccagtgcaaggagatt atgctcaagaatgacaccaaggtcctggacgccatgccccaccactggatccggggcaac gtgcccctgtgcagttactgtatggtttgcaagcagcagtgtggctgtcaacccaagctt tgcgattacaggtgcatttggtgccagaaaacagtacatgatgagtgcatgaaaaatagt ttaaagaatgaaaaatgtgattttggagaattcaaaaacctaatcattccaccaagttat ttaacatccattaatcagatgcgtaaagacaaaaaaacagattatgaagtggtttttgat gtaactaaaactcctcctatcaaagccctacaactctgtactcttctcccatattattca gctcgagtacttgtttgtggaggggatgggactgtagggtgggtcctggatgcagttgat gacatgaagattaagggacaagaaaagtacattccacaagttgcagttttgcctctggga acaggcaacgatctatccaatacattgggttggggtacaggttatgctggagaaattcca gttgcgcaggttttgcgaaatgtaatggaagcagatggaattaaactagatcgatggaaa gttcaagtaacaaataaaggatactacaacttaagaaaacccaaggaattcacaatgaac aactatttttctgttggacctgatgctctcatggctctcaattttcatgctcatcgtgag aaggcaccatctctgttttctagcagaattcttaataagctagaactggatggtgagcga gtagcactgcccagcttggaaggtattatagttctgaacatcggatactggggcggtggc tgcagactatgggaagggatgggggacgagacttaccctctagccaggcatgacgatggt ctgctggaagtcgttggagtatatgggtctttccactgtgctcagattcaagtaaaactg gctaatccttttcgaataggacaggcacatacagtgaggcagactgccacgtcggtccaa atgacagccaagtactcagagaaggtgacccaccccaactcaggagtaaccaacatgcga atcccattctctttgcagttggagctattacatgatgatgatgaagaagaaagaagagga agaggaagaggaagaggaagaagaagaagaaagaagaggaagaagaagaagaagaagaag aacaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaag gagaaggagaagaagaagaaaaaaaaaagtgggtggattcttgaagggcaaaatgctaag gtgaaaatagcagagatcctggaccttggagaacactcacctttaggggagactggacag aggatgaaatga >gi568815581r:56791703_57013988|GENSCAN_predicted_peptide_3|696_aa MAELCPLAEELSCSICLEPFKEPVTTPCGHNFCGSCLNETWAVQGSPYLCPQCRAVYQAR PQLHKNTVLCNVVEQFLQADLAREPPADVWTPPARASAPSPNAQVACDHCLKEAAVKTCL VCMASFCQEHLQPHFDSPAFQDHPLQPPVRDLLRRKCSQHNRLREFFCPEHSECICHICL VEHKTCSPASLSQASADLEATLRHKLTVMYSQINGASRALDDVRNRQQDVRMTANRKVEQ LQQEYTEMKALLDASETTSTRKIKEEEKRVNSKFDTIYQILLKKKSEIQTLKEEIEQSLT KRDEFEFLEKASKLRGISTKPVYIPEVELNHKLIKGIHQSTIDLKNELKQCIGRLQEPTP SSGDPGEHDPASTHKSTRPVKKVSKEEKKSKKPPPVPALPSKLPTFGAPEQLVDLKQAGL EAAAKATSSHPNSTSLKAKVLETFLAKSRPELLEYYIKVILDYNTAHNKVALSECYTVAS VAEMPQNYRPHPQRFTYCSQVLGLHCYKKGIHYWEVELQKNNFCGVGICYGSMNRQGPES RLGRNSASWCVEWFNTKISAWHNNVEKTLPSTKATRVGVLLNCDHGFVIFFAVADKVHLM YKFRVDFTEALYPAFWVFSAGATLSICSPKNVKGHELIQDLLSSLHLDSSYPPDAGLSDD DEPPNASLPPDPPLLTVPQMHSVCDQWLQDAFHISL >gi568815581r:56791703_57013988|GENSCAN_predicted_CDS_3|2091_bp atggcagagctgtgccccctggccgaggagctgtcgtgctccatctgcctggagcccttc aaggagccggtcaccactccgtgcggccacaacttctgcgggtcgtgcctgaatgagacg tgggcagtccagggctcgccatacctgtgcccgcagtgccgcgccgtctaccaggcgcga ccgcagctgcacaagaacacggtgctgtgcaacgtggtggagcagttcctgcaggccgac ctggcccgggagccacccgccgacgtctggacgccgcccgcccgcgcctctgcacccagc ccgaatgcccaggtggcctgcgaccactgcctgaaggaggccgccgtgaagacgtgcttg gtgtgcatggcctccttctgtcaggagcacctgcagccgcacttcgacagccccgccttc caggaccacccgctgcagccgcccgttcgcgacctgttgcgccgcaaatgttcccagcac aatcggctgcgggaatttttctgccccgagcacagcgagtgcatctgccacatctgcctg gtggagcataagacctgctctcccgcgtccctgagccaggccagcgccgacctggaggcc accctgaggcacaaactaactgtcatgtacagtcagatcaacggggcgtcgagagcactg gatgatgtgagaaacaggcagcaggatgtgcggatgactgcaaacagaaaggtggagcag ctacaacaagaatacacggaaatgaaggctctcttggacgcctcagagaccacctcgaca aggaagataaaggaagaggagaagagggtcaacagcaagtttgacaccatttatcagatt ctcctcaagaagaagagtgagatccagaccttgaaggaggagattgaacagagcctgacc aagagggatgagttcgagtttctggagaaagcatcaaaactgcgaggaatctcaacaaag ccagtctacatccccgaggtggaactgaaccacaagctgataaaaggcatccaccagagc accatagacctcaaaaacgagctgaagcagtgcatcgggcggctccaggagcccaccccc agttcaggtgaccctggagagcatgacccagcgtccacacacaaatccacacgccctgtg aagaaggtctccaaagaggaaaagaaatccaagaaacctccccctgtccctgccttaccc agcaagcttcccacgtttggagccccggaacagttagtggatttaaaacaagctggcttg gaggctgcagccaaagccaccagctcacatccgaactcaacatctctcaaggccaaggtg ctggagaccttcctggccaagtccagacctgagctcctggagtattacattaaagtcatc ctggactacaacaccgcccacaacaaagtggctctgtcagagtgctatacagtagcttct gtggctgagatgcctcagaactaccggccgcatccccagaggttcacatactgctctcag gtgctgggcctgcactgctacaagaaggggatccactactgggaggtggagctgcagaag aacaacttctgtggggtaggcatctgctacggaagcatgaaccggcagggcccagaaagc aggctcggccgcaacagcgcctcctggtgcgtggagtggttcaacaccaagatctctgcc tggcacaataacgtggagaaaaccctgccctccaccaaggccacgcgggtgggcgtgctt ctcaactgtgaccacggctttgtcatcttcttcgctgttgccgacaaggtccacctgatg tataagttcagggtggactttactgaggctttgtacccggctttctgggtattttctgct ggtgccacactctccatctgctcccccaaaaatgtgaaaggtcatgagctcatccaggac ttgctatcctccctgcatttagacagttcctacccacctgatgctggcctgtctgatgat gatgagcctcccaatgccagcctgccccccgacccgccactcctcactgtgccccagatg cacagtgtttgtgaccagtggctgcaggatgccttccacatcagcctctga >gi568815581r:56791703_57013988|GENSCAN_predicted_peptide_4|468_aa MAASETVRLRLQFDYPPPATPHCTAFWLLVDLNRCRVVTDLISLIRQRFGFSSGAFLGLY LEGGLLPPAESARLVRDNDCLRVKLEERGVAENSVVISNGDINLSLRKAKKRAFQLEEGE ETEPDCKYSKKHWKSRENNNNNEKVLDLEPKAVTDQTVSKKNKRKNKATCGTVGDDNEEA KRKSPKKKEKCEYKKKAKNPKSPKVQAVKDWANQRCSSPKGSARNSLVKAKRKGSVSVCS KESPSSSSESESCDESISDGPSKVTLEARNSSEKLPTELSKEEPSTKNTTADKLAIKLGF SLTPSKGKTSGTTSSSSDSSAESDDQCLMSSSTPECAAGFLKTNPVETPKKDYSLLPLLA AAPQVGEKIAFKLLELTSSYSPDVSDYKEGRILSHNPETQQVDIEILSSLPALREPGKFD LVYHNENGAEVVEYAVTQESKITVFWKELIDPRLIIESPSNTSSTEPA >gi568815581r:56791703_57013988|GENSCAN_predicted_CDS_4|1407_bp atggcagcttccgagacggttaggctacggcttcaatttgattacccgccgccagctacc ccgcactgtacggccttctggcttctggtcgacttgaacagatgccgagtcgtcacagat ctcattagtctcatccgccagcgcttcggcttcagttctggggccttcctaggcctctac ctggagggggggctcttgccccccgccgagagcgcgcgccttgtgagagacaacgactgc ctcagagttaaattagaagagagaggagttgctgagaattctgtagtcatcagtaatggt gacattaatttatctcttagaaaagcaaagaagcgggcatttcagttagaggagggtgaa gaaactgaaccagattgcaaatattcaaagaagcattggaagagtcgagagaacaataac aataatgagaaggtcttggatctggaaccaaaagctgtcacagatcagactgtcagcaaa aaaaacaagagaaaaaataaagcaacctgtggcacagtgggtgatgataacgaagaggcc aaaagaaaatcaccaaagaaaaaggagaaatgtgaatataaaaaaaaggctaagaatccc aagtctccgaaagtacaggcagtgaaagactgggccaatcagagatgtagttctccaaaa ggttctgctagaaacagccttgttaaagccaaaaggaaaggtagtgtaagcgtttgctca aaagagagtcccagttcctcctcggagtctgaatcttgtgatgaatctatcagtgatggt cccagcaaagtcactttggaggccagaaattcctcagagaaattaccaactgagttatca aaggaagaaccctctaccaaaaatacaactgcagacaaactggctataaaacttggcttt agccttacccccagcaagggcaagacctctggaacaacatcttccagttcagactctagt gcagagtcagacgaccaatgcttgatgtcatcgagcaccccggagtgtgctgcgggtttc ttaaagacaaatccagtagagacacccaagaaggactatagtctgttaccactgttagca gctgcccctcaagttggagaaaagattgcatttaagcttttggagctaacatccagttac tctcctgatgtctctgactacaaggaaggaagaatattaagccacaatccagagacccag caagtagatatagaaattctttcatccttacctgccttgagagaacctgggaaatttgat ttagtttatcacaatgaaaatggagccgaggtagtggagtacgctgtgacacaggagagc aagatcactgtattttggaaagagttgattgacccaagactgattattgaatctccaagt aacacatcaagtacagaacctgcctga >gi568815581r:56791703_57013988|GENSCAN_predicted_peptide_5|416_aa MELALRRSPVPRWLLLLPLLLGLNAGAVIDWPTEEGKEVWDYVTVRKDAYMFWWLYYATN SCKNFSELPLVMWLQGGPGGSSTGFGNFEEIGPLDSDLKPRKTTWLQAASLLFVDNPVGT GFSYVNGSGAYAKDLAMVASDMMVLLKTFFSCHKEFQTVPFYIFSESYGGKMAAGIGLEL YKAIQRGTIKCNFAGVALGDSWISPVDSVLSWGPYLYSMSLLEDKGLAEVSKVAEQVLNA VNKGLYREATELWGKAEMIIEQRHVRHLQRDALSQLMNGPIRKKLKIIPEDQSWGGQATN VFVNMEEDFMKPVISIVDELLEAGINVTVYNGQLDLIVDTMGQEAWVRKLKWPELPKFSQ LKWKALYSDPKSLETSAFVKSYKNLAFYWILKAGHMVPSDQGDMALKMMRLVTQQE >gi568815581r:56791703_57013988|GENSCAN_predicted_CDS_5|1251_bp atggagctggcactgcggcgctctcccgtcccgcggtggttgctgctgctgccgctgctg ctgggcctgaacgcaggagctgtcattgactggcccacagaggagggcaaggaagtatgg gattatgtgacggtccgcaaggatgcctacatgttctggtggctctattatgccaccaac tcctgcaagaacttctcagaactgcccctggtcatgtggcttcagggcggtccaggcggt tctagcactggatttggaaactttgaggaaattgggccccttgacagtgatctcaaacca cggaaaaccacctggctccaggctgccagtctcctatttgtggataatcccgtgggcact gggttcagttatgtgaatggtagtggtgcctatgccaaggacctggctatggtggcttca gacatgatggttctcctgaagaccttcttcagttgccacaaagaattccagacagttcca ttctacattttctcagagtcctatggaggaaaaatggcagctggcattggtctagagctt tataaggccattcagcgagggaccatcaagtgcaactttgcgggggttgccttgggtgat tcctggatctcccctgttgattcggtgctctcctggggaccttacctgtacagcatgtct cttctcgaagacaaaggtctggcagaggtgtctaaggttgcagagcaagtactgaatgcc gtaaataaggggctctacagagaggccacagagctgtgggggaaagcagaaatgatcatt gaacagcgccacgtgagacacctacaacgagatgccttaagccagctcatgaatggcccc atcagaaagaagctcaaaattattcctgaggatcaatcctggggaggccaggctaccaac gtctttgtgaacatggaggaggacttcatgaagccagtcattagcattgtggacgagttg ctggaggcagggatcaacgtgacggtgtataatggacagctggatctcatcgtagatacc atgggtcaggaggcctgggtgcggaaactgaagtggccagaactgcctaaattcagtcag ctgaagtggaaggccctgtacagtgaccctaaatctttggaaacatctgcttttgtcaag tcctacaagaaccttgctttctactggattctgaaagctggtcatatggttccttctgac caaggggacatggctctgaagatgatgagactggtgactcagcaagaatag