GENSCAN 1.0 Date run: 3-Nov-116 Time: 14:53:05 Sequence gi568815584f:38109432_38310562 : 201131 bp : 38.02% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 4298 4303 6 1 0 64 116 10 0.191 1.76 1.02 Term + 19662 19835 174 2 0 71 44 167 0.712 7.38 1.03 PlyA + 20246 20251 6 1.05 2.04 PlyA - 24277 24272 6 1.05 2.03 Term - 29612 29441 172 1 1 61 47 133 0.360 2.82 2.02 Intr - 64472 64372 101 2 2 69 47 71 0.144 -0.91 2.01 Init - 65464 65408 57 1 0 73 91 70 0.830 7.16 2.00 Prom - 76799 76760 40 -5.65 3.00 Prom + 81519 81558 40 -2.55 3.01 Init + 81981 82136 156 2 0 94 19 39 0.476 -2.30 3.02 Intr + 83037 83225 189 2 0 122 33 62 0.794 2.96 3.03 Term + 83558 83794 237 1 0 33 42 196 0.759 4.68 3.04 PlyA + 86240 86245 6 1.05 4.03 PlyA - 86620 86615 6 1.05 4.02 Term - 89880 89584 297 0 0 93 55 139 0.851 5.38 4.01 Init - 90009 89917 93 0 0 59 75 84 0.918 4.63 4.00 Prom - 96870 96831 40 -4.75 5.00 Prom + 97247 97286 40 -10.84 5.01 Init + 98319 98324 6 2 0 69 103 4 0.643 0.79 5.02 Intr + 99615 99799 185 2 2 101 74 61 0.943 3.66 5.03 Term + 100003 101134 1132 1 1 121 52 1476 0.953 137.76 5.04 PlyA + 103462 103467 6 1.05 6.02 PlyA - 103662 103657 6 1.05 6.01 Sngl - 146591 145119 1473 2 0 49 50 1010 0.998 86.58 6.00 Prom - 154598 154559 40 -5.65 7.05 PlyA - 154715 154710 6 1.05 7.04 Term - 159216 158683 534 0 0 103 49 238 0.973 14.66 7.03 Intr - 161776 161696 81 1 0 44 53 122 0.588 3.22 7.02 Intr - 168807 168635 173 1 2 18 71 70 0.056 -2.96 7.01 Intr - 172732 172699 34 1 1 94 106 44 0.040 3.68 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584f:38109432_38310562|GENSCAN_predicted_peptide_1|59_aa MKAPQAEKQQRRTEENKLPNVREKQQPDVREKQLDFRGMAWWQDPREEFGWSISLFSHC >gi568815584f:38109432_38310562|GENSCAN_predicted_CDS_1|180_bp atgaaggctccacaagctgagaagcagcagagaaggacagaagagaataagctgccgaat gtcagagagaagcagcagccagatgtcagagagaagcagcttgacttcagagggatggct tggtggcaggaccccagagaagagtttggctggtctattagtctgttttcacactgctga >gi568815584f:38109432_38310562|GENSCAN_predicted_peptide_2|109_aa MESGKGGTKAMKRVKESEQKRLTINQLAIKRKIIISDLRAKHSMLEEYRIESRKESPVHR ACLYQSYKMYDLALRPVCVTLSHDVGNLAAEMQLHSGDTQPESKQHSDS >gi568815584f:38109432_38310562|GENSCAN_predicted_CDS_2|330_bp atggaatcggggaaaggagggacaaaggccatgaagagggtgaaggagagtgagcagaaa agattgaccattaaccaattagccatcaaaaggaagattataataagtgatctaagagcc aaacactcaatgctggaggagtacagaattgagagtaggaaagaaagtccagttcatagg gcttgcttatatcagagttacaaaatgtatgacctggctttgaggccagtctgtgtgacg ctgagtcacgatgttggcaatttagcagcggagatgcagcttcacagtggtgacactcag cccgagagtaagcaacacagcgactcctga >gi568815584f:38109432_38310562|GENSCAN_predicted_peptide_3|193_aa MDWEGSLPLVFNPCRDAFLIIHPCFKGVRPRRDACLCPSPLAASPTFLGKAQAAPRQAEL VPNSSSASPPPPYNLFITSPPHTWSGLQFRSVTSPPPPAQQLTLKNVAGAKGIVKHTRTS KRLNRSGQAFLQNLLPQELATLAGNLATGPRNARSPGFLLSHVLSVWDPTENQTVQLTWQ PLPEPLELWPKAL >gi568815584f:38109432_38310562|GENSCAN_predicted_CDS_3|582_bp atggactgggaaggcagccttcccttggtgtttaatccttgcagggatgcctttctgatt attcacccatgtttcaaaggtgtcagaccacgcagggacgcctgcctttgtccttcaccc ttagcggcaagtcccacttttctggggaaggcgcaagctgctcctcgccaggccgagcta gttcccaattcttcctcagcctctcctcctccaccctataatctttttatcacctcccct cctcacacctggtccggcttacagtttcgttccgtgactagccctccccctcctgcccag caacttactcttaaaaatgtggctggagccaaaggcatagtcaagcacacaagaacttcc aaacgcctgaaccgcagcggccaggcgttcctccagaacctcctcccccaggagcttgct acacttgccggaaatctggccactgggccaaggaatgcccgcagcccgggattcctccta agccacgtcctatctgtgtgggaccccactgaaaatcagactgttcaactcacctggcag ccactcccagagcccctggaactctggcccaaggctctctga >gi568815584f:38109432_38310562|GENSCAN_predicted_peptide_4|129_aa MEGGAKRWFPLKVEQRRCFSGTGMDSMAGRQDERTLSGKFVKHNNQQSVAEWGMQPAGQG YWARHPRAQWVVSSNRVGDAEEPTEEFQRRKKHSQRAQCMIITAPLKEEVGCPQMIFPLP CRNQRQRKG >gi568815584f:38109432_38310562|GENSCAN_predicted_CDS_4|390_bp atggaggggggtgcaaaacgctggttcccactgaaagtggagcagaggcggtgtttctca ggcactggcatggacagtatggcaggcaggcaggatgagaggacgctcagcgggaagttc gtgaagcacaacaatcaacaatcagtagctgagtggggaatgcagccagctggccaagga tattgggcacgccatccaagggcacaatgggtggtctccagcaacagagtgggcgatgct gaagaacccactgaagaatttcaaaggagaaagaagcactcacagagggcacaatgcatg attataactgcaccgctcaaggaagaagttggctgtccccaaatgattttccctctcccc tgcaggaaccaaagacagagaaaggggtga >gi568815584f:38109432_38310562|GENSCAN_predicted_peptide_5|440_aa MVLLTRWWCLLIIYQSRAAPVNGCAVRCSHILASPLHGRLCPGTPELQTAEPRQPLGCAP RRRRPSPGSCGEGGGSRGPGAGAADGMEEPGRNASQNGTLSEGQGSAILISFIYSVVCLV GLCGNSMVIYVILRYAKMKTATNIYILNLAIADELLMLSVPFLVTSTLLRHWPFGALLCR LVLSVDAVNMFTSIYCLTVLSVDRYVAVVHPIKAARYRRPTVAKVVNLGVWVLSLLVILP IVVFSRTAANSDGTVACNMLMPEPAQRWLVGFVLYTFLMGFLLPVGAICLCYVLIIAKMR MVALKAGWQQRKRSERKITLMVMMVVMVFVICWMPFYVVQLVNVFAEQDDATVSQLSVIL GYANSCANPILYGFLSDNFKRSFQRILCLSWMDNAAEEPVDYYATALKSRAYSVEDFQPE NLESGGVFRNGTCTSRITTL >gi568815584f:38109432_38310562|GENSCAN_predicted_CDS_5|1323_bp atggtgctgctgacgcgctggtggtgcctattaatcatttaccagtccagagccgcgcca gttaatggctgtgccgtgcggtgctcccacatcctggcctctcctctccacggtcgcctg tgcccgggcaccccggagctgcaaactgcagagcccaggcaaccgctgggctgtgcgccc cgccggcgccgccccagcccgggcagctgcggcgaaggcggcggcagcaggggccccggg gccggcgctgcggacggcatggaggagccagggcgaaatgcgtcccagaacgggaccttg agcgagggccagggcagcgccatcctgatctctttcatctactccgtggtgtgcctggtg gggctgtgtgggaactctatggtcatctacgtgatcctgcgctatgccaagatgaagacg gccaccaacatctacatcctaaatctggccattgctgatgagctgctcatgctcagcgtg cccttcctagtcacctccacgttgttgcgccactggcccttcggtgcgctgctctgccgc ctcgtgctcagcgtggacgcggtcaacatgttcaccagcatctactgtctgactgtgctc agcgtggaccgctacgtggccgtggtgcatcccatcaaggcggcccgctaccgccggccc accgtggccaaggtagtaaacctgggcgtgtgggtgctatcgctgctcgtcatcctgccc atcgtggtcttctctcgcaccgcggccaacagcgacggcacggtggcttgcaacatgctc atgccagagcccgctcaacgctggctggtgggcttcgtgttgtacacatttctcatgggc ttcctgctgcccgtgggggctatctgcctgtgctacgtgctcatcattgctaagatgcgc atggtggccctcaaggccggctggcagcagcgcaagcgctcggagcgcaagatcacctta atggtgatgatggtggtgatggtgtttgtcatctgctggatgcctttctacgtggtgcag ctggtcaacgtgtttgctgagcaggacgacgccacggtgagtcagctgtcggtcatcctc ggctatgccaacagctgcgccaaccccatcctctatggctttctctcagacaacttcaag cgctctttccaacgcatcctatgcctcagctggatggacaacgccgcggaggagccggtt gactattacgccaccgcgctcaagagccgtgcctacagtgtggaagacttccaacctgag aacctggagtccggcggcgtcttccgtaatggcacctgcacgtcccggatcacgacgctc tga >gi568815584f:38109432_38310562|GENSCAN_predicted_peptide_6|490_aa MRPAFALCLLWQALWPGPGGGEHPTADRAGCSASGACYSLHHATMKRQAAEEACILRGGA LSTVRAGAELRAVLALLRAGPGPGGGSKDLLFWVALERRRSHCTLENEPLRGFSWLSSDP GGLESDTLQWVEEPQRSCTARRCAVLQATGGVEPAGWKEMRCHLRANGYLCKYQFEVLCP APRPGAASNLSYRAPFQLHSAALDFSPPGTEVSALCRGQLPISVTCIADEIGARWDKLSG DVLCPCPGRYLRAGKCAELPNCLDDLGGFACECATGFELGKDGRSCVTSGEGQPTLGGTG VPTRRPPATATSPVPQRTWPIRVDEKLGETPLVPEQDNSVTSIPEIPRWGSQSTMSTLQM SLQAESKATITPSGSVISKFNSTTSSATPQAFDSSSAVVFIFVSTAVVVLVILTMTVLGL VKLCFHESPSSQPRKESMGPPGLESDPEPAALGSSSAHCTNNGVKVGDCDLRDRAEGALL AESPLGSSDA >gi568815584f:38109432_38310562|GENSCAN_predicted_CDS_6|1473_bp atgaggccggcgttcgccctgtgcctcctctggcaggcgctctggcccgggccgggcggc ggcgaacaccccactgccgaccgtgctggctgctcggcctcgggggcctgctacagcctg caccacgctaccatgaagcggcaggcggccgaggaggcctgcatcctgcgaggtggggcg ctcagcaccgtgcgtgcgggcgccgagctgcgcgctgtgctcgcgctcctgcgggcaggc ccagggcccggagggggctccaaagacctgctgttctgggtcgcactggagcgcaggcgt tcccactgcaccctggagaacgagcctttgcggggtttctcctggctgtcctccgacccc ggcggtctcgaaagcgacacgctgcagtgggtggaggagccccaacgctcctgcaccgcg cggagatgcgcggtactccaggccaccggtggggtcgagcccgcaggctggaaggagatg cgatgccacctgcgcgccaacggctacctgtgcaagtaccagtttgaggtcttgtgtcct gcgccgcgccccggggccgcctctaacttgagctatcgcgcgcccttccagctgcacagc gccgctctggacttcagtccacctgggaccgaggtgagtgcgctctgccggggacagctc ccgatctcagttacttgcatcgcggacgaaatcggcgctcgctgggacaaactctcgggc gatgtgttgtgtccctgccccgggaggtacctccgtgctggcaaatgcgcagagctccct aactgcctagacgacttgggaggctttgcctgcgaatgtgctacgggcttcgagctgggg aaggacggccgctcttgtgtgaccagtggggaaggacagccgacccttggggggaccggg gtgcccaccaggcgcccgccggccactgcaaccagccccgtgccgcagagaacatggcca atcagggtcgacgagaagctgggagagacaccacttgtccctgaacaagacaattcagta acatctattcctgagattcctcgatggggatcacagagcacgatgtctacccttcaaatg tcccttcaagccgagtcaaaggccactatcaccccatcagggagcgtgatttccaagttt aattctacgacttcctctgccactcctcaggctttcgactcctcctctgccgtggtcttc atatttgtgagcacagcagtagtagtgttggtgatcttgaccatgacagtactggggctt gtcaagctctgctttcacgaaagcccctcttcccagccaaggaaggagtctatgggcccg ccgggcctggagagtgatcctgagcccgctgctttgggctccagttctgcacattgcaca aacaatggggtgaaagtcggggactgtgatctgcgggacagagcagagggtgccttgctg gcggagtcccctcttggctctagtgatgcatag >gi568815584f:38109432_38310562|GENSCAN_predicted_peptide_7|273_aa VFGEQVVFGYMDCRIFEINKRYECNITPDDYSGYNMKICQELCWHRINPLEYVSGKKKES NSLSSCQWKNVLRLRLENCRSCMRVFGEGPQEVGEEGISEGKAAAPGRGLQIKLPSTWKR APGGRGGCGHSYSRLKHSCLPALKRAADLPAQCLSSAKGQTASSSCSLTPMHPDLETPPS MGRQTPHTGELWLASGRCLFGMKLSEEGAGSNLGGSAASASDTQANRVWSGPQQTPADLQ KMGLTVRGKTNKQKEIRSTSTKNKTSQSFKNFI >gi568815584f:38109432_38310562|GENSCAN_predicted_CDS_7|822_bp gtttttggcgaacaggtagtatttggttacatggactgcagaatatttgaaattaataaa aggtatgagtgcaatataacccctgatgactattcaggatacaacatgaaaatttgccaa gagctttgctggcacagaattaatcccttggagtacgttagtggaaagaagaaagaaagt aattctctctcatcatgtcaatggaagaatgttttaaggctgcgacttgagaactgccga tcttgtatgagggtgtttggagaaggcccacaagaagtaggcgaggagggcatctctgaa ggaaaggcagcagccccaggcaggggcttacagataaaactcccatctacctggaaaaga gcacctgggggaaggggcggctgtgggcacagttacagcagacttaaacattcctgcctg ccagctctgaagagagcagctgatctcccagcacagtgcttgagctctgctaagggacag actgcctcctcaagttgttccctaacccccatgcatcctgacttggagacacctcctagc atgggtcgacagacacctcatacaggagagctatggctggcatctggcaggtgccttttt gggatgaagctttcagaggaaggagcaggcagcaatcttggaggttctgcagcctccgct agtgatacccaggcaaacagggtctggagtggaccccagcaaactccagcagacctgcag aagatgggcctgactgttaggggaaaaactaacaaacagaaagaaataagatcaacatca acaaaaaacaaaacatcacaatcattcaaaaacttcatctga