GENSCAN 1.0 Date run: 3-Nov-116 Time: 23:24:04 Sequence gi568815597f:51959848_52186755 : 226908 bp : 43.35% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 224 295 72 1 0 75 97 31 0.755 3.87 1.02 Intr + 777 937 161 2 2 76 89 36 0.692 1.29 1.03 Term + 7033 7072 40 1 1 119 49 33 0.223 -0.84 1.04 PlyA + 7471 7476 6 1.05 2.09 PlyA - 8608 8603 6 1.05 2.08 Term - 10354 10313 42 1 0 51 49 61 0.106 -4.34 2.07 Intr - 17270 16974 297 2 0 85 59 447 0.019 38.57 2.06 Intr - 38191 38102 90 1 0 81 85 30 0.084 2.19 2.05 Intr - 38717 38689 29 0 2 73 92 16 0.050 -1.67 2.04 Intr - 63727 63644 84 2 0 81 75 92 0.926 6.99 2.03 Intr - 64732 64663 70 1 1 58 82 59 0.625 1.05 2.02 Intr - 67501 67428 74 2 2 80 94 29 0.574 1.83 2.01 Init - 68770 68731 40 1 1 56 115 -7 0.572 -0.84 2.00 Prom - 69876 69837 40 -3.26 3.03 PlyA - 70094 70089 6 1.05 3.02 Term - 74039 72850 1190 0 2 50 47 1249 0.167 108.91 3.01 Init - 95249 95153 97 2 1 88 36 150 0.473 8.27 3.00 Prom - 97681 97642 40 -6.36 4.00 Prom + 98191 98230 40 -8.16 4.01 Init + 100001 100054 54 1 0 67 113 43 0.470 5.98 4.02 Intr + 104978 105091 114 1 0 104 64 67 0.600 6.54 4.03 Intr + 123493 123694 202 0 1 48 100 91 0.160 5.16 4.04 Intr + 126265 126324 60 2 0 38 111 120 0.989 8.01 4.05 Intr + 181930 182556 627 2 0 63 105 357 0.305 27.08 4.06 Term + 199476 199558 83 1 2 50 49 108 0.019 0.96 4.07 PlyA + 200904 200909 6 1.05 5.04 PlyA - 201192 201187 6 1.05 5.03 Term - 202560 202406 155 1 2 16 43 101 0.094 -3.52 5.02 Intr - 203499 203384 116 2 2 34 93 113 0.547 6.49 5.01 Init - 213592 211812 1781 1 2 37 53 469 0.337 26.99 5.00 Prom - 214865 214826 40 -4.96 6.02 PlyA - 215034 215029 6 1.05 6.01 Sngl - 216269 215487 783 2 0 79 50 470 0.836 38.27 6.00 Prom - 219893 219854 40 -7.06 7.00 Prom + 219934 219973 40 -8.06 7.01 Sngl + 220138 220575 438 0 0 56 48 354 0.999 24.46 7.02 PlyA + 222897 222902 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 73914 72850 1065 0 0 71 47 1179 0.825 109.25 S.002 Init + 123499 123694 196 0 1 77 100 87 0.820 7.89 S.003 Term + 126865 126911 47 2 2 124 42 39 0.933 0.27 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:51959848_52186755|GENSCAN_predicted_peptide_1|90_aa MEGKPEGRECHRAAAVQGGEQESKQHIRQVPSRLGESPALDGNWTSSGAEHCCSLHPLKM GVKGPQLTTNFSVDDSALSPAYTQDEIAPS >gi568815597f:51959848_52186755|GENSCAN_predicted_CDS_1|273_bp atggagggcaagcctgagggcagggagtgccaccgggcagctgcagtccaagggggagaa caggaatctaagcaacacatcaggcaggttccaagcaggctaggagaatctccagcctta gatggcaactggacgtcttcaggggcagagcactgctgctctctccatccattgaaaatg ggagtaaaagggccacaactcacaactaatttctctgttgatgactccgccttgtctcca gcttacactcaagacgagattgcaccttcctga >gi568815597f:51959848_52186755|GENSCAN_predicted_peptide_2|241_aa MVIIHKSWCGACKALKPKFAESTEISELSHNFVMVNLEDEEEPKDEDFSPDGGYIPRILF LDPSGKVHPEIINENGNPSYKYFYVSAEQAGVREIFASWIPPLLTVCALAMEQCPCCRFE PLSPRKQQLMASVTDGKTGVKDASDQNFDYMFKLLIIGNSSVGKTSFLFRYADDTFTPAF VSTVGIDFKVKTVYRHEKRVKLQIWVSPGNLGQDFPEKRLACQYKGLERLSGADQENNPE D >gi568815597f:51959848_52186755|GENSCAN_predicted_CDS_2|726_bp atggtgattattcataaatcctggtgtggagcttgcaaagctctaaagcccaaatttgca gaatctacggaaatttcagaactctcccataattttgttatggtaaatcttgaggatgaa gaggaacccaaagatgaagatttcagccctgacgggggttatattccacgaatccttttt ctggatcccagtggcaaggtgcatcctgaaatcatcaatgagaatggaaaccccagctac aagtatttttatgtcagtgccgagcaagcaggtgttcgggagatatttgctagctggatt ccacccctgctgacagtgtgtgccctggcgatggagcagtgtccttgttgcagatttgaa ccactttcacctcgtaaacagcagctgatggcttcagtgacagatggtaaaactggagtc aaagatgcctctgaccagaattttgactacatgtttaaactgcttatcattggcaacagc agtgttggcaagacctccttcctcttccgctatgctgatgacacgttcaccccagccttc gttagcaccgtgggcatcgacttcaaggtgaagacagtctaccgtcacgagaagcgggtg aaactgcagatctgggtgagtcccgggaatcttgggcaggattttcctgagaagcggctg gcctgccagtacaagggcttagagaggctgtctggtgctgatcaagagaacaatccagag gattga >gi568815597f:51959848_52186755|GENSCAN_predicted_peptide_3|428_aa METRPRLGATCLLGFSFLLLVISSDGHNGLGKETRGSVKVKGDRKSSLHFRLTSETRTTR KLAQRGCQWSLPERMPLVVFCGLPYSGKSRRAEELRVALAAEGRAVYVVDDAAVLGAEDP AVYGDSAREKALRGALRASVERRLSRHDVVILDSLNYIKGFRYELYCLARAARTPLCLVY CVRPGGPIAGPQVAGANENPGRNVSVSWRPRAEEDGRAQAAGSSVLRELHTADSVVNGSA QADVPKELEREESGAAESPALVTPDSEKSAKHGSGAFYSPELLEALTLRFEAPDSRNRWD RPLFTLVGLEEPLPLAGIRSALFENRAPPPHQSTQSQPLASGSFLHQLDQVTSQVLAGLM EAQKSAVPGDLLTLPGTTEHLRFTRPLTMAELSRLRRQFISYTKMHPNNENLPQLANMFL QYLSQSLH >gi568815597f:51959848_52186755|GENSCAN_predicted_CDS_3|1287_bp atggagacgcggcctcgtctcggggccacctgtttgctgggcttcagtttcctgctcctc gtcatctcttctgatggacataatgggcttggaaaggaaacaagagggtctgtaaaagtc aagggagacaggaagagcagtttgcacttccggcttacgtcggagacgcgtacaacccgg aagttggcgcagcgcggttgccaatggtcgctccctgagaggatgccgctcgtggtgttt tgcgggctgccgtacagcggcaagagccggcgtgctgaagagttgcgcgtggcgctggct gccgagggccgcgcggtgtacgtggtggacgacgcagctgtcctgggcgcagaggaccca gcggtgtacggcgattctgcccgtgagaaggcattgcgtggagctctgcgagcctccgtg gaacggcgcctgagtcgccacgacgtggtcatcctggactcgcttaactacatcaaaggt ttccgttacgagctctactgcctggcacgggcggcgcgcaccccgctctgcctggtctac tgcgtacggcccggcggcccgatcgcgggacctcaggtggcgggcgcgaacgagaaccct ggccggaacgtcagtgtgagttggcggccacgcgctgaggaggacgggagagcccaggcg gcgggcagcagcgtcctcagggaactgcatactgcggactctgtagtaaatggaagtgcc caggccgacgtacccaaggaactggagcgagaagaatccggggctgcggagtctccagct cttgtgactccggattcagagaaatctgcaaagcatgggtccggtgccttttactctccc gaactcctggaggccctaacgctgcgctttgaggctcccgattctcggaatcgctgggac cggcctttattcactttggtgggcctagaggagccgttgcccctggcggggatccgctct gccctgtttgagaaccgggccccaccaccccatcagtctacgcagtcccagcccctcgcc tccggcagctttctgcaccagttggaccaggtcacgagtcaagtactggccggattgatg gaagcgcagaagagcgctgtccccggggacttgctcacgcttcctggtaccacagagcac ttgcggtttacccggcccttgaccatggcagaactgagtcgccttcgtcgccagtttatt tcgtacactaaaatgcatcccaacaatgagaacttgccgcaactggccaacatgtttctt cagtatttgagccagagcctgcactga >gi568815597f:51959848_52186755|GENSCAN_predicted_peptide_4|379_aa MNQEKLAKLQAQVRIGGKGTARRKKKVVHRTATADDKKLQSSLKKLAVNNIAGIEEVNMI KDDGTVIHFNNPKVQASLSANTFAITGHAEAKPITEMLPGILSQLGADSLTSLRKLAEQF PRQVLDSKAPKPEDIDEEDDDVPGAGPSDKNVHVSSGENEKVLGSGGGGERQGLVGTEGL PVPGKRSPGPAAGRGLRREHGALAAPRPRSLAFPAARAQRGRRSPDAVAGRGVQGALWPS GAGGRSGHPTLEASLRIPACRTSVVPPQPCAPGPARRRSSGIRVAAAARAPMRLGGCRAL AAVAAAAAATAAPTGVGGFAAAPGAAPVPSGGGFRLRDDPRGHAEAEAAAARVNFWYQLC QFESFRKQLLRQDYKRYMK >gi568815597f:51959848_52186755|GENSCAN_predicted_CDS_4|1140_bp atgaatcaagaaaagttagccaaacttcaggctcaggtccggatagggggcaagggtaca gctcgcagaaagaagaaggtggtacatagaacagccacagctgatgacaaaaagcttcag agttctctaaaaaaactggctgtgaataatatagctggtattgaagaggtgaacatgatt aaagatgatgggacagttattcatttcaacaatcccaaagtccaagcttccctttctgct aatacctttgcaattactggtcatgcagaagccaaaccaatcacagaaatgcttcctgga atattaagtcagcttggtgctgacagtttaacaagccttaggaagttagctgaacagttc ccacggcaagtcttggacagtaaagcaccaaaaccagaagacattgatgaggaagatgat gatgttccaggtgcgggtccgagtgacaagaacgttcacgtatcttcgggagagaacgaa aaggtgctcgggtctggcgggggcggggaacgccaggggttggtgggaactgaggggctt ccggttcccgggaagcggagtcccgggccggccgcgggccgcgggctacgcagggagcac ggggccctcgccgcccccaggccgcgctccttagcgttccctgcggctcgtgcccagcgc ggccgccgcagtcccgacgccgtagccggacgtggggtccagggggcgctgtggcccagc ggcgccggcggcaggagcggccacccgacgctggaggcttcgctgaggatccccgcctgc cgcacctcggtcgtcccgccgcagccttgcgcccccggcccggctcggcggcgctcctcc gggatccgcgtggctgccgcggcccgggcgccgatgcggctgggcgggtgccgggcctta gcagcagtagcagccgcagctgcggctaccgcagctcctacaggagtgggaggtttcgcg gcggcgcctggggccgcgccggtgccctccggaggaggctttcggctcagggacgacccc cgtggccatgctgaagcggaggcggccgctgctcgagtcaatttctggtaccagctgtgc cagtttgagtccttcaggaagcagctgctgagacaggactataagagatacatgaagtag >gi568815597f:51959848_52186755|GENSCAN_predicted_peptide_5|683_aa MNIDAKILNKILANQIQQHIKKLIHHDQVGFIPGMQDQFNISKSINVIQHINRTKDKNHM IISVDAEKAFDKIQQHFMLKTLNKLGIDGTYLKIIRAIYDKPTANIILNGQKLEAFPLKT GTRQGCPLSPLLFNIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFVDDMIVYLENPIVSA QNLLKLISNFGKVSGYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLTR DVKDLFKENYKPLLNEIKEDTNKWKNIPCSWVGTINTVKMAILPKVIYRFNAIPIKLPMT FFTELEKTTLKFIRNQKRARITKSILSQKNKAGGIMLPDFKLYYKATVTKTAWYWYQNRD IDQWNRTEPSEITPHIYNYLIFDKSEINKQWGKDSLFNKWCWENWLAICRKLKLDPFLTS YKKFNSRWTKDLNVRPKTIKTLEENLGITIQDIGMGKDFMSKTPKAMATKAKIDKWDLIK LKSFCTAKETTIRVNRQPTKWKKIFATYSSDKELISRIYNEHKQIYKKKTNNPIKKQAKD MNRHFSKEDIYVAKRHMKKCSSSLAIREMQIKTTMRYHLIPVRMAIIKKSGNNSSGEHLI DMSDVEENNFKGIVNVHEETEEFFPKMLKINQGIYMGRTHSGGGGGGGKGGSGGGHRRRD SYYDRGYHRGYDRYEDYDYRYRR >gi568815597f:51959848_52186755|GENSCAN_predicted_CDS_5|2052_bp atgaacattgatgcaaaaatcctcaataaaatactggcaaaccaaatccagcagcacatc aaaaagcttatccaccatgatcaagtgggcttcatccctgggatgcaagaccagttcaat ataagcaaatcaataaatgtaatccagcatataaacagaaccaaagacaaaaaccacatg attatctcagtagatgcagaaaaagcctttgacaaaattcaacaacacttcatgctaaaa actctcaataaattaggtattgatgggacgtatctcaaaataataagagctatctatgac aaaccaacagccaatatcatactgaatgggcaaaaactggaagcattccctttgaaaact ggcacaagacagggatgccctctctcaccactcctattcaacatagtgttggaagttctg gccagggcaattaggcaggagaaggaaataaagggtattcaattaggaaaagaggaagtc aaattgtccctgtttgtagatgacatgattgtatatctagaaaaccccattgtctcagcc caaaatctccttaagctgataagcaacttcggcaaagtctcaggatacaaaatcaatgta caaaaatcacaagcattcttatacaccaataacagacaaacagagagccaaatcatgagt gaactcccattcacaattgcttcaaagagaataaaatacttaggaatccaacttacaagg gacgtgaaggacctcttcaaggagaactacaaaccactgctcaatgaaataaaagaggat acaaacaaatggaagaacattccatgctcatgggtaggaacaatcaataccgtgaaaatg gccatactgcccaaggtaatttatagattcaatgccatccccatcaagctaccaatgact ttcttcacagaattggaaaaaactactttaaagttcataaggaaccaaaaaagagcccga atcaccaagtcaatcctaagccaaaagaacaaagctggaggcatcatgctacctgacttc aaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaatagagat atagatcaatggaacagaacagagccctcagaaataacgccgcatatctacaactatctg atctttgacaaatctgagataaacaagcaatggggaaaggattccctatttaataaatgg tgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttccttacatct tataaaaaatttaattcaagatggactaaagacttaaacgttagacctaaaaccataaaa accctagaagaaaacctaggcattaccattcaggacataggcatgggcaaggacttcatg tctaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatctcattaaa ctaaagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaacctacaaaa tggaagaaaattttcgcaacctactcatctgacaaagagctaatatccagaatctacaat gaacacaaacaaatttacaagaaaaaaacaaacaaccccatcaaaaagcaggcaaaggat atgaacagacacttctcaaaagaagacatttatgtagccaaaagacacatgaaaaaatgc tcatcatcactggccatcagagaaatgcaaatcaaaactacaatgagataccatctcata ccagttagaatggcgatcattaaaaagtcaggaaacaacagttctggggagcacctcata gacatgagtgatgtagaggagaacaacttcaagggcatagttaatgttcatgaagaaact gaagagttttttccaaaaatgttgaagataaatcaaggcatctatatgggcagaactcat agtggtggtggcggaggtggtggcaaaggcggcagtggaggtggtcacagacgtcgagat tcttactatgatagaggatatcatcgtggatatgacagatatgaagactatgattaccgg tacagaagatga >gi568815597f:51959848_52186755|GENSCAN_predicted_peptide_6|260_aa MRKKQSRKTGNSKKQRVSPPPKERSSSPAMEQTWMENDFDELREEGFRRSNYSKLQEEIQ TKGKEVENFEKNLDKCITRITNTEKCLKELKAKARELREECRSLRSRCDQLEETVSVMED EMNEMKQEGKFREKRIKRNEQSLQEKWDYVKRPNLHLNGVPESDGENGTKLANTLQDIIQ ENFPNLARQANIQIQEIQRTPQRYSSRRATPRHIIVRFTKVEMKEKMLRAATEKGRVTHK GKPIRLTADLSAETLRARRE >gi568815597f:51959848_52186755|GENSCAN_predicted_CDS_6|783_bp atgaggaaaaaacagagcagaaaaactggaaactctaaaaagcagcgtgtctctcctcct ccaaaggaacgcagttcctcaccagcaatggaacaaacctggatggagaatgactttgac gagctgagagaagaaggcttcagacgatcaaactactccaagctacaggaggaaattcaa accaaaggcaaagaagttgaaaactttgaaaaaaacttagacaaatgtataactagaata accaatacagagaagtgcttaaaggagctgaaagccaaggctcgagaactacgtgaagaa tgcagaagcctcaggagccgatgcgatcaactggaagaaacggtttcagtgatggaagat gaaatgaatgaaatgaagcaagaagggaagtttagagaaaaaagaataaaaagaaacgaa caaagcctccaagaaaaatgggactatgtgaaaagaccaaatctacatctgaatggtgta cctgaaagtgacggggagaatggaaccaagttggcaaacactctgcaggatattatccag gagaacttccccaatctagcaaggcaggccaacattcagattcaggaaatacagagaacg ccacaaagatactcctcgagaagagcaactccaagacacataattgtcagattcaccaaa gttgaaatgaaggaaaaaatgttaagggcagccacagagaaaggtcgggttacccacaaa gggaagcccatcagactaacagctgatctctcggcagaaactctacgagccagaagagag tag >gi568815597f:51959848_52186755|GENSCAN_predicted_peptide_7|145_aa MTLEELEDREDKFNEEDECAIEMYRQQRLAEWKATKLKNKFEEVLEISGKDYVQEVTKAS EGLWVVLHLYKQGIPLCALINQHLSGVARKFPDVKFIKAISTTCIPSYPDRNLPMIFVYV EGDIKAQFIGPLVFGGINLTRDELQ >gi568815597f:51959848_52186755|GENSCAN_predicted_CDS_7|438_bp atgactttggaagagctggaggatcgtgaagacaagtttaatgaggaggatgaatgtgct attgaaatgtacagacagcagagactggctgagtggaaagcaactaaattgaagaataaa tttgaagaagttttggagatctcagggaaggattatgttcaagaagttaccaaagccagt gagggtttgtgggtcgtcttgcacctttacaaacaaggaattcccctctgtgccctgata aatcagcacctcagtggagttgccaggaagtttcctgatgtcaaatttatcaaagccatt tcaacaacctgcatacccagttatcctgataggaatctgcccatgatatttgtttacgtg gaaggagatatcaaggctcagtttattggtcctctggtgtttggcggcattaacctgaca agagatgagttgcagtga