GENSCAN 1.0 Date run: 4-Nov-116 Time: 07:39:54 Sequence gi568815597r:149713275_149840749 : 127475 bp : 42.25% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 238 399 162 1 0 40 81 132 0.286 5.87 1.02 Term + 839 965 127 2 1 58 50 84 0.472 -1.63 1.03 PlyA + 1195 1200 6 1.05 2.08 PlyA - 4280 4275 6 1.05 2.07 Term - 6838 6762 77 2 2 127 44 53 0.991 1.82 2.06 Intr - 7013 6918 96 0 0 113 101 15 0.948 4.36 2.05 Intr - 8272 8138 135 2 0 74 70 183 0.909 14.72 2.04 Intr - 13796 13609 188 1 2 78 70 126 0.941 8.21 2.03 Intr - 15007 14956 52 2 1 81 93 47 0.997 1.55 2.02 Intr - 16339 16104 236 0 2 108 111 212 0.871 21.91 2.01 Init - 23819 23773 47 2 2 35 121 23 0.443 0.51 2.00 Prom - 25738 25699 40 -3.75 3.00 Prom + 25743 25782 40 -10.94 3.01 Init + 26121 26169 49 2 1 86 58 64 0.669 2.36 3.02 Term + 33697 34052 356 2 2 16 41 292 0.541 10.97 3.03 PlyA + 35548 35553 6 1.05 4.00 Prom + 44126 44165 40 -3.75 4.01 Init + 52428 52529 102 2 0 74 43 98 0.583 4.19 4.02 Intr + 62816 62979 164 1 2 87 33 169 0.044 9.15 4.03 Intr + 70832 70983 152 2 2 -2 76 98 0.033 -1.51 4.04 Intr + 75092 75343 252 0 0 60 67 189 0.578 10.48 4.05 Intr + 76236 76568 333 1 0 3 53 178 0.392 0.42 4.06 Intr + 76780 77064 285 2 0 69 97 140 0.909 9.39 4.07 Term + 77963 78243 281 0 2 119 43 199 0.294 13.12 4.08 PlyA + 78376 78381 6 1.05 5.04 PlyA - 78528 78523 6 1.05 5.03 Term - 80189 80058 132 2 0 82 49 92 0.003 1.81 5.02 Intr - 80813 80696 118 1 1 97 30 80 0.004 2.65 5.01 Init - 99049 98673 377 1 2 90 63 712 0.687 65.35 5.00 Prom - 99178 99139 40 -12.03 6.02 PlyA - 99445 99440 6 1.05 6.01 Sngl - 100407 99997 411 0 0 97 48 653 0.999 58.24 6.00 Prom - 105579 105540 40 -5.95 7.00 Prom + 107102 107141 40 -6.75 7.01 Init + 118566 118994 429 2 0 56 -53 297 0.743 8.70 7.02 Intr + 119360 119703 344 1 2 45 29 670 0.973 50.10 7.03 Intr + 121000 121309 310 1 1 -13 8 296 0.015 6.89 7.04 Intr + 126640 126978 339 0 0 59 20 319 0.856 16.94 7.05 Intr + 127172 127411 240 1 0 90 26 208 0.736 11.62 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 121000 121333 334 1 1 -13 42 341 0.950 12.50 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:149713275_149840749|GENSCAN_predicted_peptide_1|96_aa XPPRDEENTGMAPIAEGGSIDTQEQEWKSRKSRDHLSPTPYMAADIAVVLFSRTQFFHWT PAYVTALTLKHHLLDCILNWTTEFKNPATKGLSASP >gi568815597r:149713275_149840749|GENSCAN_predicted_CDS_1|291_bp nnccctcctagggacgaagaaaacactggcatggcaccaatcgcagaagggggcagcatt gacacccaggaacaggagtggaaaagcaggaaaagcagggatcatctctctcctaccccg tatatggctgcagacatagcagtggttcttttctccaggacccagtttttccactggaca ccagcctacgtaacagccctgactcttaagcaccatctactggactgcatcctaaactgg accactgaattcaaaaaccctgctaccaaagggcttagtgctagtccatga >gi568815597r:149713275_149840749|GENSCAN_predicted_peptide_2|276_aa MSPGSHSHRDDGTLERKYHSLIQDQAQELTHLRQKMKLGRVASALLIQHVKNTLKTFEEL LQSNNIDHYMEQHYCEQLAKGSQLAESLARKFSTDDCTSKKNQVGQVSSTLSILRKMHNM SKVTEVLETKWDAQSQTQPQIWCSNHTRSTPHHSLSSTSPQLDKEEVHPSVTAVIGADLL EKNLAEIQNLRQRLEESISLNDRLRERLEHVLSNGDQGKDNQRVTSCQLESLVSLFLSFP LSPGYGIGEKKCTAQSTVAPHSYTQSHSSGCGEDIL >gi568815597r:149713275_149840749|GENSCAN_predicted_CDS_2|831_bp atgagtcctggctctcactctcatagagatgatgggaccttggaaaggaagtatcattcc ttgattcaggatcaggctcaagagttaacccacctacggcagaagatgaagcttgggaga gtggcctctgctcttctcatccagcatgtcaagaacacactaaagacctttgaggagcta ctccagagcaataacattgaccactatatggagcagcactactgcgagcagctggccaaa ggaagccagctggcagagagccttgccagaaaattcagcacagatgactgtacaagtaag aagaatcaagtaggacaggtgtcttcgactctcagtatcttgaggaagatgcataatatg agtaaagtgacagaagtcctagagaccaagtgggatgcccagtcccagactcagccccag atctggtgcagcaaccacacccggtctaccccacatcactccctgagcagcacgtctcca cagcttgacaaggaggaagtgcatccttcagtgactgcagtcataggggctgacctgctg gaaaagaatcttgctgagatacagaacctgcgccagcgcctggaggagtccatcagtctc aatgaccgcctgagggagaggctggagcatgtgcttagcaatggtgaccaaggaaaagat aatcagagagtcactagctgtcaactagaaagcttggtgtctctcttcctgtctttccca ctttcacctgggtacggaataggagagaagaaatgtactgcacagtccactgtagcccct cattcatatactcagagtcactcttctggctgtggcgaggacatcctgtga >gi568815597r:149713275_149840749|GENSCAN_predicted_peptide_3|134_aa MGLRYFGQAGLELLTSATGGLKQGARGRLPFTYRSLPALAPSEFGLTRDVAGARRNCKLT ARVPAHPHLRLENSWPVEYPVPRRASLQVTGKRCRVAYPAGEAARRGPGAATGLREDRPR VADFDLQGQSLTAA >gi568815597r:149713275_149840749|GENSCAN_predicted_CDS_3|405_bp atggggcttcgctattttggccaggctggtctcgaactcctgacctcagcgactggagga ctgaaacagggcgcaagagggcggctgcccttcacctaccggtccctccctgccttggcg ccttcggagttcggactgacccgggatgttgctggagcccgcaggaactgcaaactcacc gcgcgcgtccctgcacacccgcacctgcgcctcgagaactcctggccagttgaatacccg gtcccccgccgagcttcgctgcaggtcacaggaaagaggtgcagggttgcgtacccggcg ggagaagccgcgcggagaggaccaggtgctgctactggcctccgagaagacaggcctaga gtagctgattttgacctgcagggccaaagtctaacggcggcctaa >gi568815597r:149713275_149840749|GENSCAN_predicted_peptide_4|522_aa MTEGDANTSLFTWQQQGEMQSKSGEKSLIKPSDLDSNCNAKDFCLSQTKELVWRLPVAME TRTKRKKAVGFIEQSDSTKLFSVEGVSSGSTQWFLNGTATQTSTPSYRITSASVNDSGEY RCQRGLSGRSDPIQLEIHRGWLLLQVSSRVFTEGEPLALRCHAWKDKLVYNVLYYRNGKA FKFFHWNSNLTILKTNISHNGTYHCSGMGKHRYTSAGISVTVKDDTVIQGRMDLVEQGFP TLTGPWPVRNWAAQQEVSFFMSQHYGLSSSSCPISGNIRFSQKHEPYCELCVPDDLRWNS YLPKLSPLHPPAVEKLPSTKSIPSAKKVGDHWYRELFPAPVLNASVTSPLLEGNLVTLSC ETKLLLQRPGLQLYFSFYMGSKTLRGRNTSSEYQILTARREDSGLYWCEAATEDGNVLKR SPELELQVLGLQLPTPVWFHVLFYLAVGIMFLVNTVLWVTIRKELKRKKKWDLEISLDSG HEKKVISSLQEDRHLEEELKCQEQKEEQLQEGVHRKEPQGAT >gi568815597r:149713275_149840749|GENSCAN_predicted_CDS_4|1569_bp atgacagaaggagatgcaaatacatccctcttcacatggcagcagcaaggagaaatgcag agcaaaagcggggaaaagtcccttataaaaccatcagatctcgactccaattgtaacgct aaagatttttgccttagccagaccaaagaattggtgtggcggctgcccgtggcgatggaa acacggaccaagagaaaaaaggctgtaggctttattgagcagagtgacagtacaaagctt ttcagcgtggaaggggtttcgagcggctctacacagtggtttctcaatggcacagccact cagacctcgacccccagctacagaatcacctctgccagtgtcaatgacagtggtgaatac aggtgccagagaggtctctcagggcgaagtgaccccatacagctggaaatccacagaggc tggctactactgcaggtctccagcagagtcttcacggaaggagaacctctggccttgagg tgtcatgcgtggaaggataagctggtgtacaatgtgctttactatcgaaatggcaaagcc tttaagtttttccactggaattctaacctcaccattctgaaaaccaacataagtcacaat ggcacctaccattgctcaggcatgggaaagcatcgctacacatcagcaggaatatctgtc actgtgaaagatgacacggtcattcagggaaggatggaccttgtagagcaggggttccca acccttactggtccgtggcctgttaggaactgggctgcacagcaggaggtgagcttcttc atgagccagcattacggcctgagctcctcctcctgtccaatcagtggcaacattagattc tcacagaaacatgaaccctattgtgaattgtgcgtgcctgatgatctgaggtggaacagt taccttccaaaactgtccccacttcaccccccggctgtggaaaagttgccttccacaaaa tccatccctagtgccaaaaaggttggggaccactggtatagagagctatttccagctcca gtgctgaatgcatctgtgacatccccactcctggaggggaatctggtcaccctgagctgt gaaacaaagttgctcttgcagaggcctggtttgcagctttacttctccttctacatgggc agcaagaccctgcgaggcaggaacacatcctctgaataccaaatactaactgctagaaga gaagactctgggttatactggtgcgaggctgccacagaggatggaaatgtccttaagcgc agccctgagttggagcttcaagtgcttggcctccagttaccaactcctgtctggtttcat gtccttttctatctggcagtgggaataatgtttttagtgaacactgttctctgggtgaca atacgtaaagaactgaaaagaaagaaaaagtgggatttagaaatctctttggattctggt catgagaagaaggtaatttccagccttcaagaagacagacatttagaagaagagctgaaa tgtcaggaacaaaaagaagaacagctgcaggaaggggtgcaccggaaggagccccagggg gccacgtag >gi568815597r:149713275_149840749|GENSCAN_predicted_peptide_5|208_aa MPDPAKSAPAPKKGSKKAVTKVQKKDGKKRKRSRKESYSVYVYKVLKQVHPDTGISSKAM GIMNSFVNDIFERIAGEASRLAHYNKRSTITSREIQTAVRLLLPGELAKHAVSEGTKAVT KYTSSNPRNLSPTKPGGSEDRQPPPSQLSAIPPFCLVLRAGIAGQDGPWALVGEARRFPS PCGVAPAEPGAFLPAGLLIPPSQLPSCF >gi568815597r:149713275_149840749|GENSCAN_predicted_CDS_5|627_bp atgccggatccagcgaaatccgctcctgctcccaagaagggctccaaaaaggctgttacg aaagtgcagaagaaggacggcaagaagcgcaagcgcagccgcaaggagagctactccgtt tacgtgtacaaggtgctgaagcaggtccaccccgacaccggcatctcgtccaaggccatg ggcatcatgaactccttcgtcaacgacatcttcgagcgcatcgcgggagaggcgtcccgc ctggcgcactacaacaagcgctccaccatcacatcccgcgagatccagacggccgtgcgc ctgctgctgcccggcgagctggccaagcacgccgtgtccgagggcaccaaggcggtcacc aagtacaccagctcgaacccgaggaatctgtctccaactaagcctggtggcagtgaggac cgtcagccccctcccagccagctgtcagctatacctccattctgtctggttctcagggct ggaatcgctgggcaggatgggccgtgggcccttgtgggtgaagctcggcggttcccgagt ccatgtggggtggcccctgcggagcctggagccttcttgcccgctggcttgctgatcccg ccgagccagctcccttcctgcttttga >gi568815597r:149713275_149840749|GENSCAN_predicted_peptide_6|136_aa MARTKQTARKSTGGKAPRKQLATKAARKSAPATGGVKKPHRYRPGTVALREIRRYQKSTE LLIRKLPFQRLVREIAQDFKTDLRFQSSAVMALQEASEAYLVGLFEDTNLCAIHAKRVTI MPKDIQLARRIRGERA >gi568815597r:149713275_149840749|GENSCAN_predicted_CDS_6|411_bp atggcccgtactaagcagactgcccgcaagtcgaccggcggcaaggccccgaggaagcag ctggctaccaaagcggcccgcaagagcgcgccggccacgggcggggtgaagaagccgcac cgctaccggcccggcaccgtggctctgcgggagatccggcgctaccagaagtctacggag ctgctgatccgcaagctgcccttccagcggctggtacgcgagatcgcgcaggactttaag acggacctgcgcttccagagctcggccgtgatggcgctgcaggaggccagcgaggcctac ctggtggggctgttcgaagacacgaacctgtgcgccatccatgccaagcgcgtgaccatc atgcccaaggacatccagttggcccgccgcatccgcggggagcgggcctaa >gi568815597r:149713275_149840749|GENSCAN_predicted_peptide_7|554_aa MCIVMCKYYATFYERLEQPGFWYRRGPWTNPLSVLPRENCFVSSSTALTDSVLGFADVHE KAWQHAETNFPRARILLCELKYSGSVQQNRAWSQELWRTCSLPSSLGAQIPGAAALEVAS RVSTSSSERRFLKLYISTALNPLGKTVLALTEAVYRAPAVMSGRGKGGKGLGKGGAKRHR KVLRDNIQGITKPAIRRLARRGGVKRISGLIYEETRGVLKVFLENVIRDAVTYTEHAKRK TVTAMDVVYALKRQGRTLNRVASKALDASLAFQKLTSRVASKAVDARLALQELTITTKKT NKCIIYRRHKSILYTFKNKQSSSKRQYIFNLLRNVDNNYYKNPNATEVTGCWETGENLAM DDLNSGTSVCPHVRRKQRKGTTESGQKIQTYHAASKRTECLHWNSESPSVDMDRERLDKA QIHKIPPTPRAPGNFGLGAGYGGYHCASGGGAGTERQPQRSSPHVCPQNPGEAEGPGHAR GEPDRQSPRAGAQESVRSPASADRPIGRSRQSSRRHRSRGTRPRFVGSLSGSAFPALMPE EHGGQAVQLLPSAD >gi568815597r:149713275_149840749|GENSCAN_predicted_CDS_7|1662_bp atgtgcatagttatgtgcaaatactacgccactttctatgagagacttgagcaacctgga ttttggtatcggcgggggccctggaccaatcccctctcagttctaccgagggagaactgt tttgtttcttccagcacggctttgaccgacagtgtgttgggattcgctgacgtccatgag aaagcttggcagcatgctgagaccaattttcccagggccagaattctcctgtgtgagcta aaatacagtggctcggtccaacaaaacagagcctggagccaggaattatggcgaacctgc tccctcccgtcctcccttggcgcacagatccctggcgccgccgctcttgaggtcgcctct cgcgtgtcgacctcatcgtcggaacggcgcttcctgaagctttatataagcacggctctg aatccgctcgggaagacggtgctcgccttgacagaagctgtctatcgggctccagcggtc atgtccggcagaggaaagggcggaaaaggcttaggcaaagggggcgctaagcgccaccgc aaggtcttgagagacaacattcagggcatcaccaagcctgccattcggcgtctagctcgg cgtggcggcgttaagcggatctctggcctcatttacgaggagacccgcggtgtgctgaag gtgttcctggagaatgtgattcgggacgcagtcacctacaccgagcacgccaagcgcaag accgtcacagccatggatgtggtgtacgcgctcaagcgccaggggcgcaccctcaatcgg gttgccagcaaagcactggatgcaagccttgccttccagaagcttaccagtcgggttgcc agcaaagcagtggatgcaagacttgccctccaggagcttaccatcacaacgaagaagaca aataaatgcataatatatagacgacataaatccatactgtacacatttaagaataaacag tccagtagtaagaggcagtacatattcaatctgctgagaaatgtagacaataactactat aagaatcctaatgctacagaagtcactggctgctgggaaaccggggaaaacttggctatg gacgatctgaactcgggcaccagcgtctgcccacatgtccgacgaaaacaacgtaaagga actactgagtctggacagaaaatccagacataccatgcagcttctaagaggaccgagtgc cttcactggaatagtgaatctccttctgtggacatggacagggaacggctcgataaagcc cagattcacaaaattccgcccacaccccgtgcccccgggaattttggcttaggggcagga tatgggggttaccactgtgctagtggaggtggcgcggggactgaacggcagccccagcgc agttctccccacgtttgtccgcagaaccccggcgaggccgagggccccggtcacgcgcgg ggggagccggaccgccaaagcccgcgagccggcgcccaggaaagcgtccgcagcccggcc agtgcggataggccaattggccgaagtcggcaaagctcaagacggcaccgcagcaggggg acccgaccccgttttgttgggagcctaagcggaagtgccttccccgctctaatgccggag gagcacggagggcaagcggtacagcttcttccaagtgctgat