GENSCAN 1.0 Date run: 5-Nov-116 Time: 09:41:01 Sequence gi568815592f:138722747_138923796 : 201050 bp : 40.90% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 PlyA - 284 279 6 1.05 1.01 Sngl - 8076 7564 513 0 0 39 36 310 0.770 16.69 1.00 Prom - 10389 10350 40 -6.15 2.00 Prom + 13414 13453 40 -8.05 2.01 Init + 15170 15244 75 1 0 62 107 53 0.402 5.74 2.02 Intr + 19080 19284 205 2 1 87 94 68 0.189 5.15 2.03 Term + 28354 28460 107 2 2 85 48 43 0.045 -2.41 2.04 PlyA + 29410 29415 6 1.05 3.00 Prom + 33922 33961 40 -4.85 3.01 Init + 50929 51156 228 0 0 49 91 140 0.990 7.14 3.02 Intr + 53333 53464 132 1 0 67 81 92 0.672 6.32 3.03 Term + 54533 54667 135 1 0 77 46 83 0.027 0.04 3.04 PlyA + 54973 54978 6 1.05 4.00 Prom + 55427 55466 40 -5.85 4.01 Init + 57080 57239 160 1 1 50 99 72 0.016 4.54 4.02 Intr + 62481 62635 155 1 2 49 95 158 0.992 11.47 4.03 Term + 68843 68923 81 1 0 121 40 47 0.379 -0.09 4.04 PlyA + 70550 70555 6 1.05 5.07 PlyA - 71321 71316 6 1.05 5.06 Term - 73130 72955 176 0 2 83 48 153 0.407 7.74 5.05 Intr - 74770 74671 100 1 1 79 87 34 0.087 1.16 5.04 Intr - 83409 83288 122 1 2 16 56 126 0.034 1.49 5.03 Intr - 87198 87027 172 0 1 84 50 92 0.237 3.59 5.02 Intr - 98119 98040 80 2 2 56 47 99 0.293 0.95 5.01 Init - 98662 98476 187 1 1 85 50 134 0.685 8.57 5.00 Prom - 99043 99004 40 -12.03 6.00 Prom + 99832 99871 40 -8.05 6.01 Sngl + 100001 101053 1053 1 0 105 43 457 0.994 39.79 6.02 PlyA + 101109 101114 6 1.05 7.00 Prom + 101806 101845 40 -6.65 7.01 Init + 104383 104387 5 0 2 76 55 0 0.400 -4.98 7.02 Intr + 107532 107652 121 0 1 70 73 93 0.445 5.58 7.03 Intr + 110658 110843 186 2 0 53 55 84 0.120 0.56 7.04 Intr + 120233 120485 253 1 1 74 89 164 0.304 11.18 7.05 Intr + 121666 121834 169 2 1 105 103 71 0.965 8.38 7.06 Intr + 123793 123931 139 1 1 86 65 49 0.769 1.95 7.07 Intr + 126523 126688 166 0 1 65 102 164 0.779 14.11 7.08 Intr + 131280 131408 129 1 0 107 71 125 0.662 12.45 7.09 Term + 134934 134986 53 1 2 106 36 31 0.170 -3.69 7.10 PlyA + 135727 135732 6 1.05 8.00 Prom + 136398 136437 40 -5.35 8.01 Init + 142275 142432 158 2 2 55 87 192 0.968 15.27 8.02 Intr + 145334 145460 127 2 1 44 84 121 0.969 7.06 8.03 Intr + 147486 147572 87 2 0 9 98 91 0.643 1.45 8.04 Intr + 153726 153812 87 2 0 90 107 53 0.977 6.65 8.05 Intr + 158211 158425 215 2 2 88 97 99 0.337 7.39 8.06 Intr + 162928 163084 157 1 1 47 77 162 0.968 10.09 8.07 Intr + 166197 166285 89 2 2 87 92 18 0.461 -0.05 8.08 Intr + 175812 175925 114 0 0 58 99 90 0.826 5.74 8.09 Intr + 178202 178374 173 2 2 77 31 177 0.979 9.56 8.10 Term + 179754 179881 128 1 2 69 53 127 0.985 4.76 8.11 PlyA + 181363 181368 6 1.05 9.09 PlyA - 181569 181564 6 1.05 9.08 Term - 182386 182318 69 1 0 95 46 43 0.788 -2.24 9.07 Intr - 184854 184749 106 2 1 90 85 80 0.958 7.20 9.06 Intr - 186070 185922 149 1 2 48 89 159 0.996 10.11 9.05 Intr - 188625 188530 96 0 0 50 56 134 0.972 5.69 9.04 Intr - 190204 190019 186 1 0 37 77 289 0.899 21.66 9.03 Intr - 194142 194030 113 1 2 81 -8 69 0.073 -4.22 9.02 Intr - 197570 197469 102 0 0 58 97 60 0.859 3.13 9.01 Intr - 198378 198291 88 0 1 103 91 82 0.707 8.62 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 57076 57239 164 1 2 87 99 76 0.974 7.30 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:138722747_138923796|GENSCAN_predicted_peptide_1|170_aa MYGNTWMPRQKFAAGALPSWRTSARTVQKGNVGSEPPHRVPTGAPPSGALRRGPPSSRPQ NGRSTNSLHCAPGKATDTQHQPVKATGGREVVPCKATGVELPKTMGTHLLHQRDLDVRPG VKGDHFGTLKFDCHWISDLHGPCNPFVLSNFYHLEWLYLPNTSTPIVSRK >gi568815592f:138722747_138923796|GENSCAN_predicted_CDS_1|513_bp atgtatggaaacacctggatgcccaggcagaagtttgctgcaggggcgttgccctcatgg agaacctctgctaggacagtgcagaagggaaatgtggggtcagagcccccacatagagtc cctactggggcaccacctagtggagccttgagaagagggccaccgtcctctagaccccag aatggtagatccaccaacagcttgcactgtgcgcctggaaaagccacagacactcaacac cagcctgtgaaagcaactggaggaagggaggttgtaccctgcaaagccacaggggtggag ctgcccaagaccatgggaacccacctcttgcatcagcgtgatttggatgtgagacctgga gtcaaaggagatcattttggaactttaaaatttgattgccactggatttcggatttgcac gggccctgtaacccctttgttttgtccaatttctaccatttggaatggctgtatttaccc aataccagtacacccattgtatctagaaagtaa >gi568815592f:138722747_138923796|GENSCAN_predicted_peptide_2|128_aa MKRKRSSFPVAVEKYMLKNQAQDSTTSRQPLRNIPDTQICLTPSLLALGGLHSPTFSFPK SHPCMGGFVGQSTVLQGSALRPSPPRSLPFLLPPLYKLLVPLFSFTLKYSELCIWIQAQC KKPKKDFL >gi568815592f:138722747_138923796|GENSCAN_predicted_CDS_2|387_bp atgaaaagaaaacgatcctcatttcctgtagctgtagagaaatatatgctgaaaaatcag gcccaggattcaacgacctctcgccagcccctaaggaatattccagatacacagatctgt ctgacaccctccttactggcacttggaggtcttcacagtcccacattttcctttcccaaa tctcacccatgcatgggaggcttcgtgggtcagagcacagttttgcaaggttcagctcta agaccatctcctccaaggagccttccctttctcctcccacctctttataaactactggtg ccattgttcagcttcactttaaagtattcagagctctgcatttggattcaggcacaatgt aagaaacccaagaaggattttctttga >gi568815592f:138722747_138923796|GENSCAN_predicted_peptide_3|164_aa MPRAEPRATLGEQEKAGLPLGAWRLYLLRHFRKQTELRRSGSRDVTGALLVAAAVASEAV GSLRVAEGGPNTLLLQVLRSWPWCNKELKTMEERKVKRRSPKSFSAHCTQVVNAKKNAIP KGHNQRQEKIWKKYSNEYQMINSSFEDFLRINYQNGVFVMEEFN >gi568815592f:138722747_138923796|GENSCAN_predicted_CDS_3|495_bp atgccgagggcggagccaagagcgacactgggggagcaggaaaaggcggggcttccgctt ggggcatggaggctgtacctcttacgtcacttccgtaaacaaacggagctgcggaggagc gggtcccgggatgtgaccggggctctgcttgtggctgcggcggtggcttctgaggctgtc gggtctttgcgggttgcggaagggggccccaatacccttcttcttcaggtcttaagaagc tggccgtggtgcaataaggaacttaaaacaatggaagagcggaaagtgaagaggaggagt cctaagtcttttagtgcccactgtactcaggttgtcaatgccaaaaaaaatgccattcca aagggacataaccaaagacaagagaagatttggaagaagtattccaatgagtaccaaatg ataaacagcagttttgaggatttccttcgaataaattatcagaatggggtttttgtcatg gaagagtttaattag >gi568815592f:138722747_138923796|GENSCAN_predicted_peptide_4|131_aa MKEKTKPQGGEGKGAQSTPIQHSFLTDVSDVQEMERGLLSLLNDFHSGKLQAFGNECSIE QMEHVRGMQEKLARLNLELYGELEELPEDKRKTASDSNLDRLLSDAGFLDAQETHHVHSH HNAFVQVVPPT >gi568815592f:138722747_138923796|GENSCAN_predicted_CDS_4|396_bp atgaaagaaaagaccaaacctcagggtggagagggcaaaggcgctcagtcaactccgatc cagcactccttcctcactgatgtctcagatgttcaggagatggagagagggctgctcagt cttttgaatgatttccactctggaaaacttcaagcatttggaaatgaatgttccattgaa cagatggaacatgttcggggaatgcaggagaaattagctcgcttgaatttggagctctat ggggagttagaggaacttcctgaggataagagaaaaacagccagtgactccaatctggat aggcttctgtcagatgctggttttctggatgcccaagaaacacatcatgttcattctcat cataatgcgtttgttcaggttgtacctcctacctag >gi568815592f:138722747_138923796|GENSCAN_predicted_peptide_5|278_aa MATEAEQRRRKKEAIGFTTEALQEASAAGRPISGVFNDVRTNYFLHYSHQVRAAWSETHS KAKGSSIADPNANADIKFKALNVCIMVLEVFHNNFMLSDWMVMPHNIECVAVVMETSYSW NRFLYLLNVQGRNNTTLTQNLPENRKSLQIERSYEQGASFATGPDGDHKRLGFKPHTIIG RPFGGVLVICYTAGEMQVSLVLLHYSRQSGLEIKTLLLGRGSLRWEKASVGSRGSRHVSE ALRGAQQRCQTQGRGTPRRDLDTEANVCLISNKASDVN >gi568815592f:138722747_138923796|GENSCAN_predicted_CDS_5|837_bp atggctacagaggcagaacagaggcgcaggaagaaagaagccattggtttcactacggaa gcactgcaagaagcttcagctgcaggaagacccatatctggagttttcaatgatgtgaga acaaactatttccttcattattcacatcaagttcgggctgcctggagtgaaacacattct aaagccaaaggaagcagcatagcagatcctaatgctaatgccgacatcaaattcaaagca ctgaatgtgtgtatcatggtcttagaagtcttccacaacaatttcatgctgtcagactgg atggtaatgccacataatatagaatgtgttgctgttgttatggaaacatcctactcctgg aaccgatttctgtatctactaaacgttcaaggaagaaataacactacacttacacaaaat cttccagagaatagaaaaagtctccagattgagaggagttatgaacaaggagcctcattt gcaactggacctgacggagatcacaagagactgggcttcaagcctcataccataattgga agaccctttggtggtgtcttggttatctgttatacagctggagaaatgcaagttagtctg gtccttctccactacagtagacagagtggcttagaaattaaaacactgctcctcggcaga gggagcttgcggtgggagaaagcttccgtggggtcccgtgggtcgcggcacgtctcggaa gccctgcgaggtgcccagcagaggtgccagacccaaggccgtgggaccccaagaagagac ttggacactgaagccaacgtttgcttgatctcaaacaaagccagcgatgttaattaa >gi568815592f:138722747_138923796|GENSCAN_predicted_peptide_6|350_aa MALEQNQSTDYYYEENEMNGTYDYSQYELICIKEDVREFAKVFLPVFLTIAFVIGLAGNS MVVAIYAYYKKQRTKTDVYILNLAVADLLLLFTLPFWAVNAVHGWVLGKIMCKITSALYT LNFVSGMQFLACISIDRYVAVTNVPSQSGVGKPCWIICFCVWMAAILLSIPQLVFYTVND NARCIPIFPRYLGTSMKALIQMLEICIGFVVPFLIMGVCYFITARTLMKMPNIKISRPLK VLLTVVIVFIVTQLPYNIVKFCRAIDIIYSLITSCNMSKRMDIAIQVTESIALFHSCLNP ILYVFMGASFKNYVMKVAKKYGSWRRQRQSVEEFPFDSEGPTEPTSTFSI >gi568815592f:138722747_138923796|GENSCAN_predicted_CDS_6|1053_bp atggctttggaacagaaccagtcaacagattattattatgaggaaaatgaaatgaatggc acttatgactacagtcaatatgaattgatctgtatcaaagaagatgtcagagaatttgca aaagttttcctccctgtattcctcacaatagctttcgtcattggacttgcaggcaattcc atggtagtggcaatttatgcctattacaagaaacagagaaccaaaacagatgtgtacatc ctgaatttggctgtagcagatttactccttctattcactctgcctttttgggctgttaat gcagttcatgggtgggttttagggaaaataatgtgcaaaataacttcagccttgtacaca ctaaactttgtctctggaatgcagtttctggcttgcatcagcatagacagatatgtggca gtaactaatgtccccagccaatcaggagtgggaaaaccatgctggatcatctgtttctgt gtctggatggctgccatcttgctgagcataccccagctggttttttatacagtaaatgac aatgctaggtgcattcccattttcccccgctacctaggaacatcaatgaaagcattgatt caaatgctagagatctgcattggatttgtagtaccctttcttattatgggggtgtgctac tttatcacggcaaggacactcatgaagatgccaaacattaaaatatctcgacccctaaaa gttctgctcacagtcgttatagttttcattgtcactcaactgccttataacattgtcaag ttctgccgagccatagacatcatctactccctgatcaccagctgcaacatgagcaaacgc atggacatcgccatccaagtcacagaaagcattgcactctttcacagctgcctcaaccca atcctttatgtttttatgggagcatctttcaaaaactacgttatgaaagtggccaagaaa tatgggtcctggagaagacagagacaaagtgtggaggagtttccttttgattctgagggt cctacagagccaaccagtacttttagcatttaa >gi568815592f:138722747_138923796|GENSCAN_predicted_peptide_7|406_aa MRGGQAFSKRPDSNNLRPCGPCDPCFNDLALPMQYEGHHRQYEKPLEDELLDPKAHLYGH RYLSLTNLSSFSCNSGVAQMPMVPLALVSSPFGPIKTCTRGSPEDCLWMPKCVKFGWFLP YTPTDNEYGAWKRHYIACVSHLDWLTPREAAATYGTLNEPKTEDEELLERQREKCLRKRI WEKIALRKKELFKVRPPWVSGTCCSSVLKPRCQPRLSQTVRERVGLHEALEKQLVLTSLE TLPKRSNISGSHSYPLLSKKNWHGVHKNDDRSSYALRPHFMLISSRIPAYEMVMESVKAG VVSVVYEHSVTLESLLYLIEKALDGQKAQSIGIFSDGDSREINLLQGYKIGVKNLLRPEV RDFWEKLGSYVATEEEGGHVDFFVPLGASVPSSQSPLQALLSLQDP >gi568815592f:138722747_138923796|GENSCAN_predicted_CDS_7|1221_bp atgaggggtggacaagctttttctaaacggccagatagcaacaatcttaggccttgtggg ccctgtgatccctgcttcaacgacttagctttaccgatgcagtatgaaggccaccacagg caatatgaaaaacccctagaagacgaattgttggatccaaaagcacatttgtacggccat cggtatttgtcacttaccaacttgtcttccttttcatgcaactcaggagttgcacagatg cctatggttccccttgcccttgtgtcatctccttttggccccattaaaacatgcaccaga ggatcacctgaggattgcttatggatgcccaaatgcgttaagttcggatggtttctgccc tatactccaacagataatgagtatggtgcttggaagcgccattacattgcttgtgtgtcc cacttagactggctgacacctagggaggctgctgctacttatgggacgctgaatgaaccc aaaacagaagatgaggaactactggagagacaaagagaaaagtgcctgaggaaaagaatt tgggagaaaattgcactacgtaagaaggagttattcaaagttcgacccccttgggtgagt ggaacttgctgctctagcgtgctaaagcccagatgccaaccacgcctctcccagactgta agggagcgagtgggattacatgaagctttggagaaacagcttgttttgacatcgttagaa accttgcccaagcgaagcaatatttctggaagccattcctaccctttattatcaaagaaa aattggcatggagttcataaaaatgatgacagatcttcatatgctctccggccacacttc atgttaatatcatcccggattcctgcgtatgagatggtgatggagagtgtgaaggctggt gttgtttctgtggtatatgaacacagcgtaaccttggaaagccttctgtatcttatagaa aaagctctggatgggcagaaggcacagagcatcggaatatttagcgatggagacagcaga gaaatcaatttactccaaggctataaaattggtgttaaaaatttactgaggcctgaagtg agagatttctgggagaaattaggaagctatgtggccactgaagaagaagggggtcacgtg gacttcttcgtgccccttggagcatcagtgccctcttctcaatctcctttgcaggctctt ctttctctgcaagatccttaa >gi568815592f:138722747_138923796|GENSCAN_predicted_peptide_8|444_aa MGKGPLFHLLLRIEATDVVQLHRLPRRNLENSKEAAVSFLQGTAEEHQWQDDRACGLVFT GQFMFDTMGMTNILNNQDTAQALADGLMELSKEDSKCGKKIKDVEGNVIPTKCDPKTTFS LFMKERNVVEDNSWDTKSRLSKNDLNFEALINLERILQKDSAEKRARVVRELLQSERKYV QILEIVRDVYVAPLKAALSSNRAILSAANIQIIFCDILQILSLNSLPELLLYPSRRFEEY LNLLYAVRLHTPAEHVDRGDLTTAIDQIKKYKGYIDQTLSEVNRYLIRVQDVAQLHCCDE EISFSLSVACRSVDSASPESLPEMQNSQATLALLNQNLHFNKITRLYEHIHDLSLFLFND ALLVSSRGTSHTPFERTSKTTYQFIASVALHRLLIENIPDSKYVKNAFILQGPKYKWICA TEIEDDKFLWLSVLRNAIKSSMEK >gi568815592f:138722747_138923796|GENSCAN_predicted_CDS_8|1335_bp atggggaaaggccccctcttccatctacttctgcgaatcgaagctacagacgtggtccag cttcacagacttcctagaagaaaccttgaaaacagtaaggaagcagctgtatcctttctt caaggaactgcagaagagcatcagtggcaggatgatagggcatgtggccttgtatttaca gggcagtttatgtttgacaccatgggtatgaccaacattctaaacaaccaagatactgcg caagctctggcagatggattgatggagttgtcaaaagaagattctaaatgtggcaaaaag atcaaagatgttgaaggaaatgttattccaacaaaatgtgacccaaagacaacattcagt ttattcatgaaggaaagaaatgttgtagaagacaattcttgggacacaaagtccaggctc agcaaaaatgatttaaattttgaagcactgattaatctggagagaatactccagaaggac tcagcagaaaagcgagctagagttgtcagagaactcttacagagtgagagaaaatacgtg cagatactggaaattgtgagagatgtttatgtcgcaccactgaaagcagcattgtcatca aacagagcgattctgagtgctgccaatatccagatcattttctgtgacattctacagatt ttaagtctcaacagcctgccagagctgctgctgtacccatcccgaagatttgaagaatac cttaatcttctctacgctgtcaggcttcatacccctgcagagcatgttgaccgtggggac ttgaccactgcaattgaccaaatcaaaaaatataaaggttatatagatcagactctatca gaagtaaacagatatctgattagggtacaagatgtagcccaacttcattgctgtgatgaa gaaataagtttctctttaagtgtggcctgcagatctgtggactcagcatcaccagagagc ttgccagaaatgcagaattctcaggccaccctagctctactcaatcagaatttgcatttt aacaaaatcaccaggctctatgaacacatccatgatctcagccttttcctcttcaatgat gccctgctcgtttctagtcggggcacatctcacactccatttgagaggacttcaaaaaca acctaccagttcattgcatcagtggcccttcatcggttactcatagaaaatattccagat tccaagtatgtcaagaatgcatttattcttcagggtccaaaatataaatggatttgtgct acagaaatagaggatgataagttcctatggctgtcagtacttcgaaatgcaatcaaaagc agtatggagaagtga >gi568815592f:138722747_138923796|GENSCAN_predicted_peptide_9|302_aa DTAIVHPVPIRMTPSKIHMQEMELKRTGSDHTNPTSPLLVKPSDLLEENKINSSVKFASG NTVAFSVILDFTLLFIPIYADQITRGLGDSFQNMKDFGSKEAPGPAVHRPVDADGLITHT STSPQQIPEQPNFADFSQFEVFAASNVNDEQDDEAEKHPEVLPAEKASDPASSLRVAKTD SKTEEKTAASAPANVSKGTTPLAPPPKPVRRRLKSEDELRPEVDEHTQKTGVLAAVLASQ PSIPRSVGKDKKAIQASIRRNKETNTVLARLNSELQQQLKDVLEERISLEVQLEQLRPFS HL >gi568815592f:138722747_138923796|GENSCAN_predicted_CDS_9|909_bp gatactgctattgttcatccagttcccattcgtatgactccaagcaaaatccacatgcag gaaatggaacttaaaagaactggcagcgatcatacaaatcccactagcccattacttgtg aaaccatctgaccttttagaagaaaataagataaattcatcggtgaaattcgcttctggt aatactgtagcattctcagtcatcttggattttactctgctgtttattcccatatatgca gatcagattaccagaggccttggagactcttttcagaatatgaaagactttggtagcaaa gaagctcctggtcctgctgtgcatcgcccagtggatgccgatggcctcataactcacact agtacctcacctcagcagataccagagcaaccaaattttgcagatttcagtcagtttgaa gtatttgctgcatcaaatgtaaacgacgaacaagatgatgaagccgagaaacatccagaa gtcctgccggctgaaaaagcttctgatcctgcaagttctcttcgagttgccaaaacagat agtaaaactgaagaaaagacagctgctagtgctcctgccaatgtgagcaaaggcacaaca ccacttgctccaccacctaaacctgttcgaagaagattaaaatcagaagatgaattaagg ccagaagttgatgaacatacacaaaagacgggtgtcttagctgctgttcttgcatcacaa ccttctattcccagatctgttgggaaagataagaaagctattcaggcatcaattagacgt aataaggaaaccaacaccgttttggccagattgaatagcgaattgcagcaacaattaaag gatgttcttgaggagagaatttccctggaagttcaactggaacaacttcgaccattctct cacctataa