GENSCAN 1.0 Date run: 6-Nov-116 Time: 00:14:08 Sequence gi568815581r:56839074_57061019 : 221946 bp : 43.43% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 4946 5105 160 2 1 129 36 53 0.886 4.19 1.02 Intr + 8849 8992 144 1 0 133 62 63 0.995 8.68 1.03 Intr + 9623 9780 158 1 2 55 123 55 0.966 4.51 1.04 Intr + 10108 10159 52 1 1 90 95 8 0.939 0.61 1.05 Intr + 17439 17552 114 2 0 84 93 43 0.913 5.04 1.06 Intr + 22718 22845 128 1 2 60 91 47 0.972 1.68 1.07 Intr + 23067 23178 112 0 1 88 73 65 0.824 5.38 1.08 Intr + 38262 38366 105 2 0 106 28 90 0.377 5.21 1.09 Term + 38955 39242 288 2 0 -61 48 457 0.796 22.18 1.10 PlyA + 43713 43718 6 1.05 2.11 PlyA - 44517 44512 6 1.05 2.10 Term - 45768 45567 202 2 1 75 49 203 0.623 11.96 2.09 Intr - 53156 52631 526 0 1 62 32 548 0.066 38.60 2.08 Intr - 56368 56270 99 2 0 100 65 95 0.984 8.48 2.07 Intr - 56531 56448 84 0 0 102 110 19 0.968 5.29 2.06 Intr - 56879 56853 27 0 0 68 99 29 0.512 0.09 2.05 Intr - 60107 60042 66 0 0 70 100 33 0.732 1.58 2.04 Intr - 62505 62346 160 0 1 61 60 198 0.998 13.86 2.03 Intr - 65415 65182 234 0 0 89 92 349 0.999 33.29 2.02 Intr - 69490 69395 96 1 0 116 100 97 0.998 13.91 2.01 Init - 74915 74319 597 2 0 95 75 920 0.987 86.68 2.00 Prom - 82186 82147 40 -2.46 3.08 PlyA - 82328 82323 6 1.05 3.07 Term - 100081 99998 84 1 0 104 48 42 0.773 -0.55 3.06 Intr - 103050 102962 89 1 2 111 100 80 0.970 11.19 3.05 Intr - 107438 107369 70 2 1 64 105 7 0.484 -1.25 3.04 Intr - 110361 110314 48 0 0 88 116 40 0.924 5.68 3.03 Intr - 110694 110608 87 0 0 79 71 36 0.630 1.17 3.02 Intr - 111923 111140 784 1 1 89 22 327 0.389 17.68 3.01 Init - 121946 121702 245 2 2 84 85 263 0.999 22.81 3.00 Prom - 125988 125949 40 -3.26 4.00 Prom + 128884 128923 40 -5.76 4.01 Init + 139087 139162 76 0 1 67 96 238 0.999 21.75 4.02 Intr + 142009 142157 149 2 2 25 103 226 0.832 17.75 4.03 Intr + 146305 146394 90 0 0 131 27 61 0.923 4.49 4.04 Intr + 148622 148777 156 1 0 41 115 162 0.900 14.41 4.05 Intr + 149143 149217 75 0 0 69 90 60 0.941 4.01 4.06 Intr + 152026 152098 73 0 1 123 111 50 0.997 9.78 4.07 Intr + 155908 155945 38 2 2 63 115 25 0.965 0.68 4.08 Intr + 156434 156562 129 1 0 49 98 101 0.919 8.09 4.09 Intr + 159326 159425 100 1 1 30 65 75 0.651 -0.92 4.10 Intr + 161782 161919 138 2 0 99 49 306 0.989 28.24 4.11 Intr + 162945 163108 164 1 2 115 95 156 0.993 18.69 4.12 Term + 167100 167162 63 2 0 95 45 48 0.737 -0.91 4.13 PlyA + 167396 167401 6 1.05 5.00 Prom + 167565 167604 40 -3.56 5.01 Init + 169642 169706 65 0 2 37 46 128 0.324 4.12 5.02 Intr + 172610 172676 67 2 1 66 109 25 0.145 1.31 5.03 Intr + 201638 201736 99 1 0 85 60 34 0.020 0.61 5.04 Intr + 206624 206804 181 1 1 19 -7 295 0.098 12.54 5.05 Term + 206987 207372 386 0 2 -49 52 658 0.922 43.55 5.06 PlyA + 207701 207706 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 23539 23718 180 0 0 62 37 122 0.855 2.11 S.002 Term - 53156 52627 530 0 2 62 48 560 0.920 43.82 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:56839074_57061019|GENSCAN_predicted_peptide_1|420_aa XCIWCQKTVHDECMKNSLKNEKCDFGEFKNLIIPPSYLTSINQMRKDKKTDYEVVFDVTK TPPIKALQLCTLLPYYSARVLVCGGDGTVGWVLDAVDDMKIKGQEKYIPQVAVLPLGTGN DLSNTLGWGTGYAGEIPVAQVLRNVMEADGIKLDRWKVQVTNKGYYNLRKPKEFTMNNYF SVGPDALMALNFHAHREKAPSLFSSRILNKLELDGERVALPSLEGIIVLNIGYWGGGCRL WEGMGDETYPLARHDDGLLEVVGVYGSFHCAQIQVKLANPFRIGQAHTVRQTATSVQMTA KYSEKVTHPNSGVTNMRIPFSLQLELLHDDDEEERRGRGRGRGRRRRKKRKKKKKKKNKK KKKKKKKKKKKKKKKKKEKEKKKKKKSGWILEGQNAKVKIAEILDLGEHSPLGETGQRMK >gi568815581r:56839074_57061019|GENSCAN_predicted_CDS_1|1263_bp nngtgcatttggtgccagaaaacagtacatgatgagtgcatgaaaaatagtttaaagaat gaaaaatgtgattttggagaattcaaaaacctaatcattccaccaagttatttaacatcc attaatcagatgcgtaaagacaaaaaaacagattatgaagtggtttttgatgtaactaaa actcctcctatcaaagccctacaactctgtactcttctcccatattattcagctcgagta cttgtttgtggaggggatgggactgtagggtgggtcctggatgcagttgatgacatgaag attaagggacaagaaaagtacattccacaagttgcagttttgcctctgggaacaggcaac gatctatccaatacattgggttggggtacaggttatgctggagaaattccagttgcgcag gttttgcgaaatgtaatggaagcagatggaattaaactagatcgatggaaagttcaagta acaaataaaggatactacaacttaagaaaacccaaggaattcacaatgaacaactatttt tctgttggacctgatgctctcatggctctcaattttcatgctcatcgtgagaaggcacca tctctgttttctagcagaattcttaataagctagaactggatggtgagcgagtagcactg cccagcttggaaggtattatagttctgaacatcggatactggggcggtggctgcagacta tgggaagggatgggggacgagacttaccctctagccaggcatgacgatggtctgctggaa gtcgttggagtatatgggtctttccactgtgctcagattcaagtaaaactggctaatcct tttcgaataggacaggcacatacagtgaggcagactgccacgtcggtccaaatgacagcc aagtactcagagaaggtgacccaccccaactcaggagtaaccaacatgcgaatcccattc tctttgcagttggagctattacatgatgatgatgaagaagaaagaagaggaagaggaaga ggaagaggaagaagaagaagaaagaagaggaagaagaagaagaagaagaagaacaagaag aagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaaggagaaggag aagaagaagaaaaaaaaaagtgggtggattcttgaagggcaaaatgctaaggtgaaaata gcagagatcctggaccttggagaacactcacctttaggggagactggacagaggatgaaa tga >gi568815581r:56839074_57061019|GENSCAN_predicted_peptide_2|696_aa MAELCPLAEELSCSICLEPFKEPVTTPCGHNFCGSCLNETWAVQGSPYLCPQCRAVYQAR PQLHKNTVLCNVVEQFLQADLAREPPADVWTPPARASAPSPNAQVACDHCLKEAAVKTCL VCMASFCQEHLQPHFDSPAFQDHPLQPPVRDLLRRKCSQHNRLREFFCPEHSECICHICL VEHKTCSPASLSQASADLEATLRHKLTVMYSQINGASRALDDVRNRQQDVRMTANRKVEQ LQQEYTEMKALLDASETTSTRKIKEEEKRVNSKFDTIYQILLKKKSEIQTLKEEIEQSLT KRDEFEFLEKASKLRGISTKPVYIPEVELNHKLIKGIHQSTIDLKNELKQCIGRLQEPTP SSGDPGEHDPASTHKSTRPVKKVSKEEKKSKKPPPVPALPSKLPTFGAPEQLVDLKQAGL EAAAKATSSHPNSTSLKAKVLETFLAKSRPELLEYYIKVILDYNTAHNKVALSECYTVAS VAEMPQNYRPHPQRFTYCSQVLGLHCYKKGIHYWEVELQKNNFCGVGICYGSMNRQGPES RLGRNSASWCVEWFNTKISAWHNNVEKTLPSTKATRVGVLLNCDHGFVIFFAVADKVHLM YKFRVDFTEALYPAFWVFSAGATLSICSPKNVKGHELIQDLLSSLHLDSSYPPDAGLSDD DEPPNASLPPDPPLLTVPQMHSVCDQWLQDAFHISL >gi568815581r:56839074_57061019|GENSCAN_predicted_CDS_2|2091_bp atggcagagctgtgccccctggccgaggagctgtcgtgctccatctgcctggagcccttc aaggagccggtcaccactccgtgcggccacaacttctgcgggtcgtgcctgaatgagacg tgggcagtccagggctcgccatacctgtgcccgcagtgccgcgccgtctaccaggcgcga ccgcagctgcacaagaacacggtgctgtgcaacgtggtggagcagttcctgcaggccgac ctggcccgggagccacccgccgacgtctggacgccgcccgcccgcgcctctgcacccagc ccgaatgcccaggtggcctgcgaccactgcctgaaggaggccgccgtgaagacgtgcttg gtgtgcatggcctccttctgtcaggagcacctgcagccgcacttcgacagccccgccttc caggaccacccgctgcagccgcccgttcgcgacctgttgcgccgcaaatgttcccagcac aatcggctgcgggaatttttctgccccgagcacagcgagtgcatctgccacatctgcctg gtggagcataagacctgctctcccgcgtccctgagccaggccagcgccgacctggaggcc accctgaggcacaaactaactgtcatgtacagtcagatcaacggggcgtcgagagcactg gatgatgtgagaaacaggcagcaggatgtgcggatgactgcaaacagaaaggtggagcag ctacaacaagaatacacggaaatgaaggctctcttggacgcctcagagaccacctcgaca aggaagataaaggaagaggagaagagggtcaacagcaagtttgacaccatttatcagatt ctcctcaagaagaagagtgagatccagaccttgaaggaggagattgaacagagcctgacc aagagggatgagttcgagtttctggagaaagcatcaaaactgcgaggaatctcaacaaag ccagtctacatccccgaggtggaactgaaccacaagctgataaaaggcatccaccagagc accatagacctcaaaaacgagctgaagcagtgcatcgggcggctccaggagcccaccccc agttcaggtgaccctggagagcatgacccagcgtccacacacaaatccacacgccctgtg aagaaggtctccaaagaggaaaagaaatccaagaaacctccccctgtccctgccttaccc agcaagcttcccacgtttggagccccggaacagttagtggatttaaaacaagctggcttg gaggctgcagccaaagccaccagctcacatccgaactcaacatctctcaaggccaaggtg ctggagaccttcctggccaagtccagacctgagctcctggagtattacattaaagtcatc ctggactacaacaccgcccacaacaaagtggctctgtcagagtgctatacagtagcttct gtggctgagatgcctcagaactaccggccgcatccccagaggttcacatactgctctcag gtgctgggcctgcactgctacaagaaggggatccactactgggaggtggagctgcagaag aacaacttctgtggggtaggcatctgctacggaagcatgaaccggcagggcccagaaagc aggctcggccgcaacagcgcctcctggtgcgtggagtggttcaacaccaagatctctgcc tggcacaataacgtggagaaaaccctgccctccaccaaggccacgcgggtgggcgtgctt ctcaactgtgaccacggctttgtcatcttcttcgctgttgccgacaaggtccacctgatg tataagttcagggtggactttactgaggctttgtacccggctttctgggtattttctgct ggtgccacactctccatctgctcccccaaaaatgtgaaaggtcatgagctcatccaggac ttgctatcctccctgcatttagacagttcctacccacctgatgctggcctgtctgatgat gatgagcctcccaatgccagcctgccccccgacccgccactcctcactgtgccccagatg cacagtgtttgtgaccagtggctgcaggatgccttccacatcagcctctga >gi568815581r:56839074_57061019|GENSCAN_predicted_peptide_3|468_aa MAASETVRLRLQFDYPPPATPHCTAFWLLVDLNRCRVVTDLISLIRQRFGFSSGAFLGLY LEGGLLPPAESARLVRDNDCLRVKLEERGVAENSVVISNGDINLSLRKAKKRAFQLEEGE ETEPDCKYSKKHWKSRENNNNNEKVLDLEPKAVTDQTVSKKNKRKNKATCGTVGDDNEEA KRKSPKKKEKCEYKKKAKNPKSPKVQAVKDWANQRCSSPKGSARNSLVKAKRKGSVSVCS KESPSSSSESESCDESISDGPSKVTLEARNSSEKLPTELSKEEPSTKNTTADKLAIKLGF SLTPSKGKTSGTTSSSSDSSAESDDQCLMSSSTPECAAGFLKTNPVETPKKDYSLLPLLA AAPQVGEKIAFKLLELTSSYSPDVSDYKEGRILSHNPETQQVDIEILSSLPALREPGKFD LVYHNENGAEVVEYAVTQESKITVFWKELIDPRLIIESPSNTSSTEPA >gi568815581r:56839074_57061019|GENSCAN_predicted_CDS_3|1407_bp atggcagcttccgagacggttaggctacggcttcaatttgattacccgccgccagctacc ccgcactgtacggccttctggcttctggtcgacttgaacagatgccgagtcgtcacagat ctcattagtctcatccgccagcgcttcggcttcagttctggggccttcctaggcctctac ctggagggggggctcttgccccccgccgagagcgcgcgccttgtgagagacaacgactgc ctcagagttaaattagaagagagaggagttgctgagaattctgtagtcatcagtaatggt gacattaatttatctcttagaaaagcaaagaagcgggcatttcagttagaggagggtgaa gaaactgaaccagattgcaaatattcaaagaagcattggaagagtcgagagaacaataac aataatgagaaggtcttggatctggaaccaaaagctgtcacagatcagactgtcagcaaa aaaaacaagagaaaaaataaagcaacctgtggcacagtgggtgatgataacgaagaggcc aaaagaaaatcaccaaagaaaaaggagaaatgtgaatataaaaaaaaggctaagaatccc aagtctccgaaagtacaggcagtgaaagactgggccaatcagagatgtagttctccaaaa ggttctgctagaaacagccttgttaaagccaaaaggaaaggtagtgtaagcgtttgctca aaagagagtcccagttcctcctcggagtctgaatcttgtgatgaatctatcagtgatggt cccagcaaagtcactttggaggccagaaattcctcagagaaattaccaactgagttatca aaggaagaaccctctaccaaaaatacaactgcagacaaactggctataaaacttggcttt agccttacccccagcaagggcaagacctctggaacaacatcttccagttcagactctagt gcagagtcagacgaccaatgcttgatgtcatcgagcaccccggagtgtgctgcgggtttc ttaaagacaaatccagtagagacacccaagaaggactatagtctgttaccactgttagca gctgcccctcaagttggagaaaagattgcatttaagcttttggagctaacatccagttac tctcctgatgtctctgactacaaggaaggaagaatattaagccacaatccagagacccag caagtagatatagaaattctttcatccttacctgccttgagagaacctgggaaatttgat ttagtttatcacaatgaaaatggagccgaggtagtggagtacgctgtgacacaggagagc aagatcactgtattttggaaagagttgattgacccaagactgattattgaatctccaagt aacacatcaagtacagaacctgcctga >gi568815581r:56839074_57061019|GENSCAN_predicted_peptide_4|416_aa MELALRRSPVPRWLLLLPLLLGLNAGAVIDWPTEEGKEVWDYVTVRKDAYMFWWLYYATN SCKNFSELPLVMWLQGGPGGSSTGFGNFEEIGPLDSDLKPRKTTWLQAASLLFVDNPVGT GFSYVNGSGAYAKDLAMVASDMMVLLKTFFSCHKEFQTVPFYIFSESYGGKMAAGIGLEL YKAIQRGTIKCNFAGVALGDSWISPVDSVLSWGPYLYSMSLLEDKGLAEVSKVAEQVLNA VNKGLYREATELWGKAEMIIEQRHVRHLQRDALSQLMNGPIRKKLKIIPEDQSWGGQATN VFVNMEEDFMKPVISIVDELLEAGINVTVYNGQLDLIVDTMGQEAWVRKLKWPELPKFSQ LKWKALYSDPKSLETSAFVKSYKNLAFYWILKAGHMVPSDQGDMALKMMRLVTQQE >gi568815581r:56839074_57061019|GENSCAN_predicted_CDS_4|1251_bp atggagctggcactgcggcgctctcccgtcccgcggtggttgctgctgctgccgctgctg ctgggcctgaacgcaggagctgtcattgactggcccacagaggagggcaaggaagtatgg gattatgtgacggtccgcaaggatgcctacatgttctggtggctctattatgccaccaac tcctgcaagaacttctcagaactgcccctggtcatgtggcttcagggcggtccaggcggt tctagcactggatttggaaactttgaggaaattgggccccttgacagtgatctcaaacca cggaaaaccacctggctccaggctgccagtctcctatttgtggataatcccgtgggcact gggttcagttatgtgaatggtagtggtgcctatgccaaggacctggctatggtggcttca gacatgatggttctcctgaagaccttcttcagttgccacaaagaattccagacagttcca ttctacattttctcagagtcctatggaggaaaaatggcagctggcattggtctagagctt tataaggccattcagcgagggaccatcaagtgcaactttgcgggggttgccttgggtgat tcctggatctcccctgttgattcggtgctctcctggggaccttacctgtacagcatgtct cttctcgaagacaaaggtctggcagaggtgtctaaggttgcagagcaagtactgaatgcc gtaaataaggggctctacagagaggccacagagctgtgggggaaagcagaaatgatcatt gaacagcgccacgtgagacacctacaacgagatgccttaagccagctcatgaatggcccc atcagaaagaagctcaaaattattcctgaggatcaatcctggggaggccaggctaccaac gtctttgtgaacatggaggaggacttcatgaagccagtcattagcattgtggacgagttg ctggaggcagggatcaacgtgacggtgtataatggacagctggatctcatcgtagatacc atgggtcaggaggcctgggtgcggaaactgaagtggccagaactgcctaaattcagtcag ctgaagtggaaggccctgtacagtgaccctaaatctttggaaacatctgcttttgtcaag tcctacaagaaccttgctttctactggattctgaaagctggtcatatggttccttctgac caaggggacatggctctgaagatgatgagactggtgactcagcaagaatag >gi568815581r:56839074_57061019|GENSCAN_predicted_peptide_5|265_aa MHHLNVDRYGTTATLQDSPERCSAELSCSPFPRILLEHVCFPGQHSQPSMVFVLFSRNSL IAGLSLVHRCGPGIWHVSRPPLENADQHLFTLPQGYGQFAFGIFDDSFEIPTFSPGAQAD GSKDPERPWETEHQSRPLANGLDAFAQLLNQFENTGPPPADEEKIQYLPTVPVTEEHVGS GLECPVCKDDYALGEQLPRNHLFHDGCIVHRLEQHDSCPVCRKSLPGHNTATNTPAPGPT GMNCSSSSSSPSSSSPSKENATSNS >gi568815581r:56839074_57061019|GENSCAN_predicted_CDS_5|798_bp atgcatcacctgaatgtggaccgctatggcaccacggccacactccaggacagccctgaa agatgctcagctgagctcagctgctcccccttcccccggatcctcctagagcatgtctgc ttcccgggccagcacagccagccaagtatggtgtttgtcctgttttctagaaactccttg atagcaggcttaagtctggttcatcgatgtggccctggcatctggcacgtaagccggccg ccgttggagaacgcggaccagcacctgttcacgctgccgcagggctacggacagtttgct ttcggcatctttgacgacagcttcgagatccccacgttctctcctggggcgcaggctgac ggcagcaaggaccctgagagaccgtgggagacagagcatcagtcccggcccctggccaac ggcctggacgccttcgcacagctcctcaatcagtttgaaaacacgggccccccaccggca gatgaagagaaaatccagtacctccccaccgtccccgtcaccgaggagcacgtaggctcc gggctcgagtgccccgtgtgcaaggacgactacgcgctgggcgagcagctgccccgcaac cacctgttccacgatggctgcatagtgcaccggctggagcagcacgacagctgccccgtc tgccgaaaaagcctcccgggacacaacacggccacgaacacccccgccccgggcccgact gggatgaactgctcctcctcgtcgtcctccccctcctccagctcgcccagtaaagagaac gccacaagtaactcctga