GENSCAN 1.0 Date run: 4-Nov-116 Time: 12:33:38 Sequence gi568815596f:15842065_16046094 : 204030 bp : 45.24% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 PlyA - 158 153 6 1.05 1.04 Term - 1602 1442 161 1 2 51 44 116 0.392 1.60 1.03 Intr - 2269 2153 117 2 0 79 39 97 0.637 4.34 1.02 Intr - 4273 4224 50 0 2 84 100 -13 0.246 -2.18 1.01 Init - 4983 4901 83 0 2 88 94 85 0.724 7.55 1.00 Prom - 10496 10457 40 -3.36 2.00 Prom + 27088 27127 40 -3.36 2.01 Sngl + 27875 28192 318 1 0 115 35 366 0.970 29.97 2.02 PlyA + 28200 28205 6 1.05 3.00 Prom + 28246 28285 40 -5.66 3.01 Init + 29870 29924 55 1 1 77 114 -26 0.286 0.35 3.02 Intr + 33429 33611 183 1 0 94 80 17 0.169 1.26 3.03 Intr + 40971 41088 118 1 1 82 67 87 0.062 5.52 3.04 Intr + 56077 56314 238 1 1 -30 94 164 0.003 2.82 3.05 Intr + 70140 70325 186 2 0 21 72 130 0.609 4.59 3.06 Intr + 70426 70537 112 0 1 63 87 62 0.907 3.55 3.07 Term + 71898 72076 179 1 2 82 52 73 0.725 0.95 3.08 PlyA + 74288 74293 6 1.05 4.04 PlyA - 75134 75129 6 1.05 4.03 Term - 83130 83055 76 2 1 80 48 71 0.854 -0.39 4.02 Intr - 85513 85415 99 0 0 126 121 17 0.920 8.03 4.01 Init - 89736 89654 83 0 2 66 119 33 0.804 4.64 4.00 Prom - 98351 98312 40 -5.96 5.00 Prom + 99938 99977 40 -1.46 5.01 Init + 100025 100790 766 1 1 102 103 1008 0.400 98.56 5.02 Term + 103429 104033 605 2 2 41 42 866 0.945 71.98 5.03 PlyA + 104920 104925 6 1.05 6.00 Prom + 118578 118617 40 -4.56 6.01 Sngl + 119310 119921 612 2 0 42 39 175 0.354 4.20 6.02 PlyA + 120325 120330 6 1.05 7.10 PlyA - 121121 121116 6 1.05 7.09 Term - 127810 127794 17 2 2 135 47 -3 0.198 -1.30 7.08 Intr - 148180 148017 164 0 2 41 91 150 0.343 10.22 7.07 Intr - 155545 155467 79 2 1 61 76 59 0.094 0.61 7.06 Intr - 166789 166741 49 1 1 60 97 63 0.132 2.65 7.05 Intr - 171327 171111 217 2 1 35 23 143 0.063 1.01 7.04 Intr - 175702 175545 158 1 2 51 68 152 0.419 8.31 7.03 Intr - 176998 176962 37 0 1 50 98 16 0.137 -2.94 7.02 Intr - 179921 179843 79 0 1 54 107 61 0.075 3.21 7.01 Init - 184687 184648 40 1 1 87 100 8 0.754 2.33 7.00 Prom - 185571 185532 40 -4.56 8.00 Prom + 187800 187839 40 -0.46 8.01 Init + 196715 196788 74 1 2 61 59 137 0.648 7.29 8.02 Intr + 197770 197944 175 1 1 106 63 0 0.550 -0.76 8.03 Intr + 199874 199951 78 1 0 54 110 41 0.675 2.65 8.04 Intr + 200673 200717 45 2 0 98 116 30 0.816 5.41 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:15842065_16046094|GENSCAN_predicted_peptide_1|136_aa MIPKVVAQVRTLCLWGSMARLNPPPCSSPPALGTHLRARGPRSASVSMETGEGRQCTKVI LRVLIGVTAPHPSVPPEGRDLEAGVGFWQAPASPMGSPDGRVSSAEKGDLRPVLLLGPPW PCWHLRAFSPLIDASD >gi568815596f:15842065_16046094|GENSCAN_predicted_CDS_1|411_bp atgatacctaaggtggtggctcaggtgcggaccctctgcctctggggctcaatggccagg ctgaatcctccaccctgcagcagccctccagcactgggcactcacctccgtgcaagaggc cccaggagtgccagtgtttccatggaaacaggagaaggaaggcagtgcaccaaggtgata ctccgggtgctcattggtgtgaccgccccccatccatcagttccacctgagggcagagac ctggaggcgggagtaggcttctggcaggcaccggccagccccatgggcagcccagatggc agagtgtcttctgccgaaaagggggacctgcggccggtgcttctgctggggcctccgtgg ccctgctggcatctgcgggccttttctcccctcatagatgcttcagactga >gi568815596f:15842065_16046094|GENSCAN_predicted_peptide_2|105_aa MASISRLTCISSDLILHNDEVTVTEDKIDALIKAGGVNVELFWPSLLAKALASVNIGSLI CNVGAGGPAPVAGAAPAEEESGSKERRIRGSDDDMGFGFLTKPLS >gi568815596f:15842065_16046094|GENSCAN_predicted_CDS_2|318_bp atggcctccatctcccggcttacctgcatctcctcggacctcattctgcacaacgatgaa gtgaccgtcacggaggataagatcgatgccctcattaaagcaggcggtgtaaatgttgaa ctgttttggcctagcttgttggcaaaggccctggccagcgtcaacattgggagcctcatc tgcaatgtaggagctggtggacctgctccagtagctggtgctgctccagctgaggaagaa agtggaagcaaagaaagaagaatccgaggatctgatgatgacatgggctttgggtttttg actaaacctctttcataa >gi568815596f:15842065_16046094|GENSCAN_predicted_peptide_3|356_aa MWNCESINPPSFINYPASAPSCIPGLSQDWVGSCERSEHSDVFQELCGPGAKVHQLPRGT GSPGEGEVLSDKCWDHRLLEIFRLINKEGLIEHNHFATLDEIKDLGNDYQKLQKGEKIRP VKQIEPYRQGEHIQTEAKGFFEFEEMKRHIWGNVLEKPREPTFIELNTRVQSAAQKGNFK DLKRVPLRIQLSTYQCVQKVEDKTEKKIGTKMENPRDGSRWPGSWHTALEEAAFPWETAK ESFQDSSWASGEPVPADYCKAAFHAYDIISQKHLLFGNFSFLNISPLDEKQEVKEGQTVP PKCIMEVLPIAKHAWGNQCSTATRDGVLVVNWLMSDQTSKSHTQSASSNPSWHQHL >gi568815596f:15842065_16046094|GENSCAN_predicted_CDS_3|1071_bp atgtggaactgtgagtccattaaccctccttcctttataaattacccagcctcggccccc agttgcatcccagggttgtctcaggactgggtggggtcctgtgagagaagtgagcacagt gatgtcttccaagagctctgcggaccaggagcaaaggtccaccagctgcccagaggaaca ggcagtccaggcgagggagaggtgctcagtgataagtgttgggaccataggctgctggaa atattccggttaataaataaagaaggactaatagaacataaccattttgcgacccttgat gaaataaaagatctaggtaacgactatcaaaaattgcaaaaaggagagaaaatcagacca gtgaaacaaatagagccttacaggcagggagaacacatccagacagaagccaagggcttc tttgagtttgaggagatgaagcggcacatctggggaaatgtcctggaaaaaccaagggag ccaacattcatagaattgaatacccgtgtacagagcgctgcacagaaagggaacttcaaa gatctgaagagagttcctttaagaattcaattgagtacttaccagtgcgtacagaaagtg gaggacaagactgaaaagaagataggaaccaagatggaaaatcccagagacgggagccgc tggccaggttcctggcacacagcgttggaagaagcagccttcccctgggaaaccgccaag gagtcattccaggacagcagctgggcttccggagaaccggtgcccgctgactactgtaag gctgcctttcatgcttacgacataatctctcaaaaacatctcctgtttggtaacttcagt tttctgaacatctcccctttagatgagaagcaagaggtgaaagaagggcaaactgttcca cccaagtgtatcatggaagttttaccaattgcaaagcatgcctgggggaatcagtgctcc actgccacaagggatggggtcctcgtggtcaactggctgatgagtgatcaaacttcaaaa tcccatacacagtctgcttcctcaaacccttcctggcaccaacatctctga >gi568815596f:15842065_16046094|GENSCAN_predicted_peptide_4|85_aa MYGVPKTRLFGMDTFESSYNAYNGHISCGVSASVNLSFLLFKMGIVGTSLLAGLQPLPKA RSQFLEESSYGKWNFAVAPESEIYS >gi568815596f:15842065_16046094|GENSCAN_predicted_CDS_4|258_bp atgtatggagttcctaaaacaaggctgtttgggatggacacgtttgaatcctcttacaat gcttataatggtcatatttcctgcggcgtgtcagcttctgtgaatctcagtttcctcctc tttaaaatgggaatcgtgggcacctctctccttgcaggactgcagccattgccaaaagca aggagtcagttcttggaggaatcttcctatgggaaatggaattttgctgttgctccagaa agtgaaatttacagctga >gi568815596f:15842065_16046094|GENSCAN_predicted_peptide_5|456_aa MPGMICKNPDLEFDSLQPCFYPDEDDFYFGGPDSTPPGEDIWKKFELLPTPPLSPSRGFA EHSSEPPSWVTEMLLENELWGSPAEEDAFGLGGLGGLTPNPVILQDCMWSGFSAREKLER AVSEKLQHGRGPPTAGSTAQSPGAGAASPAGRGHGGAAGAGRAGAALPAELAHPAAECVD PAVVFPFPVNKREPAPVPAAPASAPAAGPAVASGAGIAAPAGAPGVAPPRPGGRQTSGGD HKALSTSGEDTLSDSDDEDDEEEDEEEEIDVVTVEKRRSSSNTKAVTTFTITVRPKNAAL GPGRAQSSELILKRCLPIHQQHNYAAPSPYVESEDAPPQKKIKSEASPRPLKSVIPPKAK SLSPRNSDSEDSERRRNHNILERQRRNDLRSSFLTLRDHVPELVKNEKAAKVVILKKATE YVHSLQAEEHQLLLEKEKLQARQQQLLKKIEHARTC >gi568815596f:15842065_16046094|GENSCAN_predicted_CDS_5|1371_bp atgccgggcatgatctgcaagaacccagacctcgagtttgactcgctacagccctgcttc tacccggacgaagatgacttctacttcggcggccccgactcgacccccccgggggaggac atctggaagaagtttgagctgctgcccacgcccccgctgtcgcccagccgtggcttcgcg gagcacagctccgagcccccgagctgggtcacggagatgctgcttgagaacgagctgtgg ggcagcccggccgaggaggacgcgttcggcctggggggactgggtggcctcacccccaac ccggtcatcctccaggactgcatgtggagcggcttctccgcccgcgagaagctggagcgc gccgtgagcgagaagctgcagcacggccgcgggccgccaaccgccggttccaccgcccag tccccgggagccggcgccgccagccctgcgggtcgcgggcacggcggggctgcgggagcc ggccgcgccggggccgccctgcccgccgagctcgcccacccggccgccgagtgcgtggat cccgccgtggtcttcccctttcccgtgaacaagcgcgagccagcgcccgtgcccgcagcc ccggccagtgccccggcggcgggccctgcggtcgcctcgggggcgggtattgccgcccca gccggggccccgggggtcgcccctccgcgcccaggcggccgccagaccagcggcggcgac cacaaggccctcagtacctccggagaggacaccctgagcgattcagatgatgaagatgat gaagaggaagatgaagaggaagaaatcgacgtggtcactgtggagaagcggcgttcctcc tccaacaccaaggctgtcaccacattcaccatcactgtgcgtcccaagaacgcagccctg ggtcccgggagggctcagtccagcgagctgatcctcaaacgatgccttcccatccaccag cagcacaactatgccgccccctctccctacgtggagagtgaggatgcacccccacagaag aagataaagagcgaggcgtccccacgtccgctcaagagtgtcatccccccaaaggctaag agcttgagcccccgaaactctgactcggaggacagtgagcgtcgcagaaaccacaacatc ctggagcgccagcgccgcaacgaccttcggtccagctttctcacgctcagggaccacgtg ccggagttggtaaagaatgagaaggccgccaaggtggtcattttgaaaaaggccactgag tatgtccactccctccaggccgaggagcaccagcttttgctggaaaaggaaaaattgcag gcaagacagcagcagttgctaaagaaaattgaacacgctcggacttgctag >gi568815596f:15842065_16046094|GENSCAN_predicted_peptide_6|203_aa MIISIDAEKAFDKIQQPFMLKTLNKLGIDGMYLKIIRAIYDKPTANVILNRQKLEAFPLK TGTRQACPLSPLLFNIVLEVLARTIRQEKEIKGIQLEEKEVKLSLFADDMIIYLENPNVS APNLLKLISNFSKVSGYKINVQKSQAFLYTNNRKTESQIMSELPFTIASKKIKHLGMQLT RDEKDLFKENYKPLLNKIKEDTN >gi568815596f:15842065_16046094|GENSCAN_predicted_CDS_6|612_bp atgattatctcaatagatgcagaaaaggcctttgacaaaattcaacagcccttcatgcta aaaactctcaataaactaggtattgatgggatgtatctcaaaataataagagctatttat gacaaacccacagccaatgtcatactgaataggcaaaaactggaagcattccctttgaaa actggcacaagacaggcatgccctctctcaccactcctattcaacatagtgttggaagtt ctggccaggacaatcaggcaggagaaagaaataaagggtattcaattagaagaaaaggaa gtcaaattgtccctgtttgcagatgacatgattatatatttagaaaaccccaacgtctca gccccaaatctccttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaat gtgcaaaaatcacaagcattcctatacaccaataacagaaaaacagagagccaaatcatg agtgaactcccattcacaattgcttcaaagaaaataaaacacctgggaatgcaacttaca agggatgagaaggacctcttcaaggagaactacaaaccactgctcaacaaaataaaagag gacacaaactaa >gi568815596f:15842065_16046094|GENSCAN_predicted_peptide_7|279_aa MAMSFVEISQLIQGNSNPEKAELSKFPQHNGDLGGKKDTSSAMVPSNHKAPKPIALVASG LEQREEAKESIIVREQVAMPVDDNVMATLVHTLKLPETSLCPNARGCGRSTRGAGGVVAV AAALRPDWPSVCKSQQTQQTCISLAGLAGCALGNYQCTSGLCAHGRALWVPSTCTSEGLP SLTYKTDNYIAAEGHTDETWRDQNVNPDLFPLSFTTTGLSPQKTFCFITCTDPKVTRLSP LKELALYALSLMTEAEALCPAEDAGNLYEDDGCIGLPIP >gi568815596f:15842065_16046094|GENSCAN_predicted_CDS_7|840_bp atggccatgtcatttgtagaaatttcacagctgattcaaggaaacagcaacccggagaaa gcagaattgtccaagtttccacagcacaatggtgaccttgggggcaaaaaagacacaagt tctgccatggtcccaagtaaccacaaagccccgaagcccatagctttggtggcctctggc ctggagcagagagaagaagccaaggaaagcatcatagtaagagaacaagttgctatgccc gttgatgacaatgtcatggctacacttgtccatacgttaaaattgcctgagactagcttg tgccccaacgccaggggctgcggccggtctacccggggcgcgggcggcgttgtcgcggtg gcggcggccctgcgccccgactggccatctgtatgcaaatctcagcagacccagcaaact tgcatcagcctggccggcctggcgggctgtgcactcggcaattatcagtgcacatcgggc ctttgtgcccacggccgggctttgtgggtgcccagcacctgcacctcggagggactcccc tccctcacctataaaacagacaactacatcgcagcagaaggtcacacggatgaaacatgg agggaccagaatgtgaatcctgatctgtttccactctcattcactacaacaggcttgagc ccccagaaaaccttctgcttcatcacatgcactgaccccaaagtcaccaggctttcccct ctcaaggaacttgcactctatgccttgtctctcatgacagaagcagaggccttgtgccca gcagaagatgccgggaatctttatgaggatgatggatgcattggcctccctattccctga >gi568815596f:15842065_16046094|GENSCAN_predicted_peptide_8|124_aa MEFSDPFQLFLAAALKLSAARRVAGNILFLDVTALGTICTLCTMEDEHLASFISVPVSIP LAPAPVPDPELVLLLLRPGFIMKVFAWMVMPFTGMEDTRDDERGFWGKRVEIHQNEILQN QDSK >gi568815596f:15842065_16046094|GENSCAN_predicted_CDS_8|372_bp atggagttcagtgacccgttccagctgtttctggcggcagcgctgaagctgtcggcagcc cggcgggtggcaggcaacatattgtttcttgatgtgacagctttaggaacaatctgtaca ctttgcaccatggaagatgaacatttagcttcttttatatcggtccctgtctcaatacca ctggcgccagcgccagtgcctgatcctgaactagttttgctacttctcagacccggcttc atcatgaaggtatttgcctggatggtgatgccattcactggaatggaggacacaagagat gatgagagaggtttttggggaaaaagggtggagattcaccagaatgagatattgcagaac caggattcaaag