GENSCAN 1.0 Date run: 5-Nov-116 Time: 21:33:50 Sequence gi568815590f:102940837_103168744 : 227908 bp : 41.82% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 703 866 164 2 2 73 43 133 0.696 4.42 1.02 PlyA + 2784 2789 6 -0.45 2.04 PlyA - 3313 3308 6 1.05 2.03 Term - 4765 4536 230 2 2 35 48 190 0.903 5.51 2.02 Intr - 12890 12776 115 2 1 116 96 65 0.924 9.20 2.01 Init - 13683 13627 57 0 0 56 76 5 0.257 -2.54 2.00 Prom - 13737 13698 40 -5.65 3.00 Prom + 19518 19557 40 -6.85 3.01 Init + 21074 21130 57 1 0 96 94 48 0.114 7.56 3.02 Term + 36751 36918 168 0 0 100 44 148 0.218 8.50 3.03 PlyA + 37019 37024 6 -0.45 4.00 Prom + 37033 37072 40 -7.05 4.01 Init + 37913 38119 207 1 0 38 121 112 0.763 8.37 4.02 Intr + 50364 50529 166 2 1 78 53 86 0.198 2.71 4.03 Intr + 52374 52572 199 1 1 51 56 131 0.107 3.69 4.04 Intr + 55596 55616 21 0 0 112 80 36 0.225 0.94 4.05 Intr + 59728 59858 131 1 2 108 64 -16 0.156 -2.58 4.06 Intr + 61932 62170 239 1 2 51 65 115 0.128 1.91 4.07 Intr + 62739 62857 119 2 2 44 70 89 0.120 1.04 4.08 Intr + 71675 71892 218 2 2 40 87 64 0.061 -1.28 4.09 Intr + 75977 76299 323 0 2 40 39 184 0.361 3.35 4.10 Term + 76362 76511 150 2 0 -28 42 194 0.588 0.13 4.11 PlyA + 76894 76899 6 1.05 5.03 PlyA - 78096 78091 6 1.05 5.02 Term - 80576 80128 449 0 2 105 49 310 0.585 23.19 5.01 Init - 96032 96026 7 2 1 84 90 5 0.247 1.17 5.00 Prom - 98405 98366 40 -5.55 6.00 Prom + 98438 98477 40 -8.65 6.01 Init + 100001 100132 132 1 0 89 67 103 0.988 8.49 6.02 Intr + 101504 101571 68 1 2 95 57 93 0.779 3.68 6.03 Intr + 108034 108119 86 1 2 83 99 71 0.775 6.24 6.04 Intr + 110214 110308 95 1 2 74 116 53 0.947 5.46 6.05 Intr + 111895 111986 92 0 2 74 59 127 0.999 6.27 6.06 Intr + 113048 113146 99 2 0 82 108 104 0.997 10.11 6.07 Intr + 115032 115100 69 0 0 89 108 84 0.989 7.88 6.08 Intr + 122119 122211 93 1 0 96 71 86 0.979 5.86 6.09 Intr + 122299 122392 94 1 1 48 25 130 0.995 1.75 6.10 Intr + 123878 123975 98 1 2 82 98 77 0.991 5.99 6.11 Intr + 125485 125611 127 1 1 34 95 83 0.993 3.36 6.12 Term + 127816 127911 96 0 0 92 48 109 0.971 4.29 6.13 PlyA + 128424 128429 6 1.05 7.06 PlyA - 129221 129216 6 1.05 7.05 Term - 135466 135241 226 0 1 79 44 192 0.610 9.17 7.04 Intr - 142748 142566 183 1 0 85 101 51 0.269 4.08 7.03 Intr - 149936 149701 236 2 2 63 53 136 0.213 3.16 7.02 Intr - 155854 155756 99 1 0 84 65 45 0.427 1.19 7.01 Init - 161424 161272 153 0 0 89 31 103 0.089 4.73 7.00 Prom - 168283 168244 40 -3.95 8.08 PlyA - 169581 169576 6 1.05 8.07 Term - 170189 170052 138 2 0 96 43 142 0.446 7.48 8.06 Intr - 171235 171214 22 0 1 111 94 6 0.613 0.13 8.05 Intr - 179429 179245 185 2 2 70 69 103 0.646 4.26 8.04 Intr - 181834 181723 112 0 1 77 64 67 0.458 2.66 8.03 Intr - 184192 184060 133 2 1 71 -6 120 0.406 -0.32 8.02 Intr - 188830 188710 121 1 1 71 106 113 0.825 10.55 8.01 Init - 189457 189395 63 1 0 92 39 82 0.884 4.90 8.00 Prom - 197881 197842 40 -4.45 9.00 Prom + 198619 198658 40 -4.55 9.01 Init + 200062 200221 160 0 1 68 114 129 0.715 13.61 9.02 Intr + 212309 212474 166 0 1 88 97 69 0.493 5.90 9.03 Intr + 214526 214695 170 2 2 78 58 78 0.406 2.47 9.04 Term + 214980 215203 224 1 2 82 42 96 0.197 0.50 9.05 PlyA + 215325 215330 6 -0.45 10.05 PlyA - 215529 215524 6 1.05 10.04 Term - 215691 215561 131 1 2 48 41 94 0.569 -1.94 10.03 Intr - 215990 215846 145 2 1 23 115 115 0.509 6.63 10.02 Intr - 219872 219838 35 0 2 88 103 31 0.533 1.62 10.01 Init - 222555 222480 76 0 1 43 109 51 0.421 4.00 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 164414 164529 116 0 2 68 49 115 0.820 3.35 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590f:102940837_103168744|GENSCAN_predicted_peptide_1|54_aa XAQEHVSGKLQVRPSQKTHTGAGGEAQRNSVGSSQQDQDSTGYTGLSITSHNRK >gi568815590f:102940837_103168744|GENSCAN_predicted_CDS_1|165_bp nctgcccaagaacatgtctctggaaagttgcaggtgcggccatcgcagaaaacacacaca ggagctgggggagaggcacagagaaacagtgtaggatccagccaacaggaccaggacagc actggctacactggcttatcgattacatcccacaatagaaagtaa >gi568815590f:102940837_103168744|GENSCAN_predicted_peptide_2|133_aa MNRGPEKRFLKRRHTNGQQDPILDLTFNHHVSSGSSVAVSQSFLIFDDLDNFEEFWSVHS SRESVSAADTVMGSPRFPSSLKDLFSQLLSGLLADSPQLSALFSMASVKRVSLPKVTPPH WSNLNPEAGQHRS >gi568815590f:102940837_103168744|GENSCAN_predicted_CDS_2|402_bp atgaacagaggacctgaaaagagatttctcaaaagaagacatacaaatggccaacaggat cccatcctggatctcacatttaatcatcatgtgtcctcaggctcctctgtggcagtttct cagtcttttcttatttttgatgaccttgacaattttgaggagttctggtcagtacattcc agccgagagtcagtttcggcagcagacactgtgatgggctctccacggttcccttccagt ctgaaggacttattctctcagctgttgagtgggctgctggcagacagccctcagctgtca gccctcttcagcatggcctcagtgaagagggtctctttacctaaggttacaccccctcac tggagcaatctgaatccagaggctggtcaacataggtcataa >gi568815590f:102940837_103168744|GENSCAN_predicted_peptide_3|74_aa MPPVSNTEAGDLRNCGTSKNQKSERRKKWVSPGERKRRIMVLCKELEAKKQAVFTYQPGL SGIFASFQYGLKSI >gi568815590f:102940837_103168744|GENSCAN_predicted_CDS_3|225_bp atgccgccagtgtcaaatacagaagcaggagacctgagaaactgtggcacctccaagaat cagaaatctgagcgtaggaagaagtgggttagccctggagaaaggaaaaggaggatcatg gttctctgcaaagagctggaagctaaaaagcaggctgtgtttacttaccaaccaggactc agtgggatctttgccagcttccaatatggccttaaaagtatatga >gi568815590f:102940837_103168744|GENSCAN_predicted_peptide_4|590_aa MRKEVWQKACCLRGSGPGRNFTLKQLWEIFHDAENTKDKMLGADPVLERRTIHHVREKML SPCRKLHKKVPSQGLYLDLIWLIIKSNLILDPVCSNEVWRAQNIIHEAQSERKLITASSY KQENNQESPSPWPVRNQAAQQEVSSRRTSKASSVLTAVPHRLHYHGSSVACQINSRIRFS WDHEPYCELCMFLFSAEWWTRYGILTASKCHCDISMAQLVTIQPIFVGHPPHAPGSGLSP LRLRVNIYTDFKYAFHILHHHAVIWAKRSFLTTQGSSIVNASSIKTLLKVALLPKKTRVI HCKGHQKASDPIAQGSTYADKSQLYTITDQPLLVKSPKQFLRLLVFSGTFIPLTVLNLQE RFHTISKDAMTKMMISISSRTDTMRTRGKELTGLTTQTTQYLFPSSSSVIKCLKSGLSFG CEIYSATLTECLPEANKDYHFKVDNDENEHKLSLRMVSLGAGRKDGFYIVEAEVMNYEGS PIKVILATLKRYVQSMVYLGGFEITLPVALHSVCFRAWAYEYTALNRYEGRYKAGHEEEG DKKIKLLADENHDDDEEEEHFDEGKTEEKALVMKFIQDTPSKKCTKIKPK >gi568815590f:102940837_103168744|GENSCAN_predicted_CDS_4|1773_bp atgaggaaggaagtgtggcaaaaagcatgctgtctcagaggaagcggtcctggcagaaac ttcacattgaagcaactctgggaaatttttcatgacgctgaaaacacaaaggataaaatg ttgggagctgatccagtcttagaaaggagaacaattcatcacgtcagagaaaagatgctc tctccatgtcgtaagttacacaagaaggtacctagccaaggactgtatctggatctgatc tggttaatcatcaaatccaatttgatcctggaccctgtttgctcaaatgaagtctggaga gctcaaaatataattcatgaagctcagtctgagaggaaattaatcacggcctccagttac aaacaagagaacaatcaagaatccccaagtccatggcctgttaggaaccaggctgcacag caggaggtgagcagcaggcgaacaagcaaagcttcatctgtacttacagctgttccccat cgcttgcattaccatgggagttctgttgcctgtcagatcaacagccgcattagattctca tgggaccatgaaccctattgtgaactgtgcatgtttctcttcagcgctgaatggtggaca agatatggcattttgactgctagcaaatgtcactgtgacatttcaatggcacaattggtg acaattcagccaatttttgttgggcaccctccacatgccccagggtctgggctcagccct ttacgactacgcgtcaatatttatactgactttaaatatgccttccatatcctgcaccac catgctgttatatgggcgaaaagaagtttcctcactacacaagggtcctccatcgttaat gcctcttcaataaaaactcttctcaaggtcgctttacttccaaaaaaaactagagtcatt cactgcaagggccatcaaaaggcatcagatcccattgctcagggcagcacttatgctgat aagtcccagctgtatacaataacggaccagcctttattagtcaaatcacccaagcagttt ctcaggctcttggtattcagtggaaccttcataccccttactgtcctcaatcttcaggaa agatttcataccatttctaaggatgcaatgacaaaaatgatgatttctattagttcaagg accgacactatgaggaccagaggaaaggaactgacaggtttaaccactcagacaacacag tacctcttcccttcctcatcctctgtcataaaatgtttgaaatctggtctttcatttggt tgtgaaatctattcagctacccttactgaatgcttaccagaggccaacaaagattatcac tttaaggtggataatgatgaaaatgagcacaagttatctttaagaatggtcagtttaggg gctggtagaaaggatggattttatattgttgaagcagaagtaatgaactatgaaggcagt ccaattaaagtaatactagcaactttaaaaaggtatgtacagtcaatggtttatcttggg ggctttgaaataacattgcctgtggccttacactcagtgtgtttcagggcctgggcatat gagtacacagcacttaacaggtatgaaggaagatacaaagcaggacatgaagaggaggga gataaaaaaataaaacttcttgctgatgaaaatcatgatgatgatgaagaggaagaacat tttgatgaagggaaaactgaagaaaaagctctagtgatgaaatttatacaggatactcca tccaaaaaatgcacaaagatcaaaccaaaataa >gi568815590f:102940837_103168744|GENSCAN_predicted_peptide_5|151_aa MAETEPRPKTVNWRRETRHSHPASSSSNATVSAPAPTRVPPRQRPELMLRPDQDSPSGES HRSLTQEGVKRGSSSSNRYRGRQPSGRYRLSDPSLPDLSSGLTASSTQANTEQVTCQPSS QLTAARAAPPTPAPLRSRSRCHCRALGEGRD >gi568815590f:102940837_103168744|GENSCAN_predicted_CDS_5|456_bp atggcagagacggaacctaggcccaagacggtgaattggcgccgagaaaccagacacagc cacccagccagctccagctccaacgccacggtctctgcccccgcacccacccgcgtccct cccaggcagcgaccggagctcatgcttcggcctgaccaggactccccatccggggagagc catcgctcactgacccaggaaggtgtcaaacgaggaagtagcagcagtaacaggtaccga ggcagacagcctagcggacgctatcggctcagcgatccatcccttcccgacctaagctcc ggcctcacggcttcctctacccaagcgaacacagaacaagtcacgtgccagcccagcagt cagctgaccgcggcgcgcgctgctccgcctacgcccgcgccactgcgcagccgcagccgc tgccactgccgggccttgggagaggggcgggactag >gi568815590f:102940837_103168744|GENSCAN_predicted_peptide_6|382_aa MTEFWLISAPGEKTCQQTWEKLHAATSKNNNLAVTSKFNIPDLKVGTLDVLVGLSDELAK LDAFVEGVVKKVAQYMADVLEDSKDKVQENLLANGVDLVTYITRFQWDMAKYPIKQSLKN ISEIIAKGVTQIDNDLKSRASAYNNLKGNLQNLERKNAGSLLTRSLAEIVKKDDFVLDSE YLVTLLVVVPKLNHNDWIKQYETLAEMVVPRSSNVLSEDQDSYLCNVTLFRKAVDDFRHK ARENKFIVRDFQYNEEEMKADKEEMNRLSTDKKKQFGPLVRWLKVNFSEAFIAWIHVKAL RVFVESVLRYGLPVNFQAMLLQPNKKTLKKLREVLHELYKHLDSSAAAIIDAPMDIPGLN LSQQEYYPYVYYKIDCNLLEFK >gi568815590f:102940837_103168744|GENSCAN_predicted_CDS_6|1149_bp atgactgagttctggcttatatctgctcctggggagaaaacctgtcagcaaacatgggag aaattgcatgcggcaacttcaaagaacaataatcttgctgtcacttccaagttcaatatt cctgacttaaaggttggcacgttggatgtcttggttggcttgtcagatgaactggctaaa ctggatgcatttgtagaaggagtggttaagaaagtagctcaatacatggctgatgtattg gaagatagcaaagacaaagttcaagagaatctgttggctaatggagtggacttggttact tatataacaaggttccagtgggacatggccaaatatccaatcaagcagtccctgaaaaat atttctgaaataattgccaagggagtaactcagattgataatgacctgaaatctcgagca tctgcatacaataacctgaaaggaaatcttcagaatttggaacgaaagaatgcaggaagt ttgctaactagaagtctagcagaaattgtgaagaaggatgactttgttcttgattcagag tatctcgtcacattactggtagtagttcccaagttaaaccacaacgactggattaagcag tatgaaacactagccgaaatggtagttccaaggtctagcaatgttctttcagaggaccaa gacagttacctgtgtaatgtcaccttgtttaggaaggcagttgatgacttcagacacaaa gccagagaaaacaaattcattgttcgtgacttccagtataatgaagaggagatgaaagca gataaagaagaaatgaacaggctttctactgataagaaaaaacaatttggaccacttgta cggtggctgaaagtgaattttagtgaagcatttattgcatggattcacgtgaaagcatta cgggttttcgttgagtctgttttaaggtatggcttgccagtgaacttccaagcaatgcta cttcagcccaataagaaaactttgaagaaactgagagaagtattacatgaattgtataaa catctagacagcagtgcagcagctattattgatgctcctatggatattccaggtttaaac ctgagtcaacaagaatactacccctatgtgtactacaagattgattgcaacttgctggaa ttcaagtga >gi568815590f:102940837_103168744|GENSCAN_predicted_peptide_7|298_aa MDQRREQQDFLLNLFLHKRRTKITHSHIRIDNRDAISTHSKEKGLQKPTEVCLIIQDNVH YNYTKEQIQSCLATSQSQAESGLSLLHKQALQMSDMLMGSQEAIQVVVESSTTDNSRYFL GSSWLPSTTMKVSQIVKIINITVIDEIYKPINEMELHVLYASGWVCVNYKLEFIEAFEEG NGRSMCASQPYSIKLSILNLLLNPPLACSFIRLIMGWVYFNQTIGRNDSDPGMLVCPRFY ILREHRCEVGSCGLLGGGRKGEETAGASLGTIYGLNSSSKAAPIKSTTVNDILLSENT >gi568815590f:102940837_103168744|GENSCAN_predicted_CDS_7|897_bp atggaccaacgtagagagcagcaggatttcctgctaaatctgttcctccacaagagaaga acaaaaattacccacagccacatccgcatagataacagagatgccatatcaacacacagc aaggagaaggggctacagaagccaaccgaagtgtgtttgattattcaggataatgtacac tacaactacacgaaagaacagattcagtcttgtttagccacctcgcaaagtcaagcagaa agtggtttgtctctcttgcataagcaggctcttcaaatgtctgatatgctaatgggcagc caggaggccattcaagttgtagttgaaagttccactactgataattctcggtactttctt ggcagttcttggcttccttccactaccatgaaagtctctcagattgtgaagattattaat attactgtcattgatgaaatatataagcctatcaatgagatggaattgcatgttttatat gcatcaggatgggtctgtgttaattataaacttgagtttattgaagcatttgaggaaggt aatgggcggtctatgtgtgcttcacaaccttattcaattaagctctctattcttaattta ctactaaatcctcctttggcctgtagtttcataaggcttatcatgggttgggtgtacttt aatcagacaataggaaggaatgattcagaccctgggatgctagtctgcccacgattttac atcctgagagagcacagatgtgaagttggctcatgtggcctgcttggaggaggtaggaaa ggagaggagactgctggagcatccctaggaaccatctatggtctaaattcctcctccaaa gcggcccccatcaaatcaacaacagtaaacgatattcttctttcagagaacacatag >gi568815590f:102940837_103168744|GENSCAN_predicted_peptide_8|257_aa MNITGEFGKSGSGGVASQPEQSTRRKDSLGKPTNPEKTPNDTSGLPKEIVKPDCPTVKLI VEQSTTLDCPVHEPWRHHYAGMKCWSHFSDYNAQNTVIEDNGKWAPRGELLAGRLVSWSA PLGTGPIPIPTRCSQGGEVKWDNKIQPRRMRANRKFIGIGGTLLQQLQPHPSPPCPRSKF SRKAGLSHRFCALPEEIPQIVNPQSLFLCSRPVSTSVSAFPLPGPSQSSTPDQIARAGSA ELHGQAQLETNHAGCQP >gi568815590f:102940837_103168744|GENSCAN_predicted_CDS_8|774_bp atgaacatcactggtgaatttggtaaaagtggcagtggaggggtagcttcacagccagag caaagcacacgtaggaaagattctcttggaaaaccaaccaatccagagaaaacacctaac gatacatcagggctgcccaaggaaatagtcaagccagattgccctactgtgaagctcata gtggagcagagcacgaccctagactgtccagtccacgagccctggaggcatcactatgcg ggaatgaaatgttggtcacacttcagtgactacaatgcacaaaatactgtgattgaagat aatgggaaatgggcaccacgtggggagctgctggctggtcggctggtcagctggtcagct cccctgggaacagggcccatcccaattcccaccaggtgtagccagggtggtgaagtcaag tgggacaacaaaattcagcccagacggatgagagccaacaggaaatttataggcattgga ggcacgctgctgcagcagctccagcctcacccatccccaccctgtcccaggtccaagttc agcaggaaagcaggattgtctcataggttttgtgccctgcctgaggaaattccccagatt gtgaatccccaaagcctcttcttgtgtagtcgcccggtctcaacctctgtctctgctttc cctctccctgggccatctcaaagctcaacaccagaccaaatagctagagcaggatcagct gaacttcacggccaagcacagctagaaacgaaccatgctggctgtcagccatag >gi568815590f:102940837_103168744|GENSCAN_predicted_peptide_9|239_aa MGCGGSRADAIEPRYYESWTRETESTWLTYTDSDAPPSAAAPDSGPEAGGLHSGNFQATL KVSGSSWQRKNGENRYWAGCQQSGHFFHPIAVVREERRFPGTRRGKGIRVRGTVQGRGRS GMAVVIGASADALGSSGAGMAIQQLRRLGLFTLYRPGAGCSLPLASQWRVKFRGSGVDLR FCMFDRLPSDADAAGPEVLDQRLANFFCKGTEGKYFRLCGPRDFFCNYSTSLFLMPKQL >gi568815590f:102940837_103168744|GENSCAN_predicted_CDS_9|720_bp atgggctgcggcgggagccgggcggatgccatcgagccccgctactacgagagctggacc cgggagacagaatccacctggctcacctacaccgactcggacgcgccgcccagcgccgcc gccccggacagcggccccgaagcgggcggcctgcactcggggaactttcaagctacccta aaagtatcagggtcctcttggcaaaggaaaaatggggagaacagatactgggcaggctgc cagcagagtggtcacttctttcatcctatagctgtggtcagagaggagagaagatttcca ggaacaagaagggggaaaggaattagggtgaggggcacagtccagggcagagggaggagt ggaatggctgttgtaataggggcttcagctgatgccctggggagctctggagctgggatg gccattcagcaattgagaagactgggcctttttaccctgtatcggccaggcgctggatgc agtcttcccctagcaagtcagtggagagtgaaatttcgtgggtctggggtggacctgaga ttctgcatgtttgacaggcttccaagtgatgctgatgctgctggtccagaggtcctagat cagaggttagcaaactttttctgtaaagggacagagggtaaatatttcaggctttgtggg ccacgtgatttcttttgcaactactcaacttctctgtttttaatgccaaagcagctatag >gi568815590f:102940837_103168744|GENSCAN_predicted_peptide_10|128_aa MVGNVVWNQEQIWRGAATGRITSAAACRQPIVGPCDHEHLIQKSKHHNSNPFISCPDSQP SRHAATFQKPSFLALQPVAITELRKVTCAEGIKASQHSSEPKARRIPYEDEIDLQAPSTL LCVMPYSS >gi568815590f:102940837_103168744|GENSCAN_predicted_CDS_10|387_bp atggttggaaatgtggtctggaatcaagagcagatatggcggggagcagccactgggagg ataacgtcagcagcagcttgcagacagcctattgtgggaccttgtgatcatgaacacctg attcagaaaagtaaacaccataacagcaacccattcatcagctgcccagactcacaacca agcagacatgcagcgactttccaaaaaccctcatttctggcattacagcctgtggcaata acagagttaaggaaagtgacctgtgctgaagggattaaggcatcccaacattcctcagag cccaaggcaaggagaatcccttatgaagatgaaattgatcttcaagcaccctctacctta ctctgcgtgatgccatattcttcctga