GENSCAN 1.0 Date run: 5-Nov-116 Time: 23:03:36 Sequence gi568815586f:27680331_27898163 : 217833 bp : 40.68% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1216 1366 151 1 1 65 57 178 0.895 11.64 1.02 Intr + 2057 2168 112 1 1 64 57 136 0.999 7.13 1.03 Intr + 2285 2373 89 0 2 82 81 113 0.999 8.77 1.04 Intr + 7055 7177 123 1 0 123 57 80 0.984 8.26 1.05 Intr + 7968 8093 126 2 0 56 58 141 0.995 7.86 1.06 Intr + 8685 8873 189 2 0 107 81 136 0.996 13.66 1.07 Intr + 11419 11598 180 0 0 53 99 233 0.987 20.04 1.08 Intr + 12261 12326 66 2 0 83 113 44 0.959 4.58 1.09 Term + 12466 12552 87 0 0 125 47 109 0.999 7.18 1.10 PlyA + 12726 12731 6 -0.45 2.00 Prom + 15195 15234 40 -8.65 2.01 Init + 16233 16913 681 2 0 78 64 575 0.268 49.26 2.02 Intr + 30418 30625 208 0 1 55 66 209 0.830 13.13 2.03 Intr + 31065 31174 110 1 2 52 76 78 0.394 2.08 2.04 Intr + 32102 32245 144 1 0 41 98 50 0.283 0.86 2.05 Intr + 35961 36128 168 2 0 50 116 146 0.434 12.82 2.06 Intr + 39478 39538 61 0 1 52 115 30 0.996 -0.51 2.07 Intr + 43717 43856 140 2 2 70 52 186 0.950 12.46 2.08 Intr + 55117 55226 110 0 2 77 63 96 0.612 4.16 2.09 Intr + 57209 57278 70 2 1 109 106 -13 0.415 0.87 2.10 Term + 74851 75120 270 0 0 107 38 287 0.971 20.10 2.11 PlyA + 75944 75949 6 1.05 3.06 PlyA - 75967 75962 6 1.05 3.05 Term - 83066 82408 659 0 2 114 39 282 0.948 19.02 3.04 Intr - 86469 86335 135 1 0 85 47 89 0.910 4.12 3.03 Intr - 89281 89147 135 2 0 33 67 91 0.536 1.12 3.02 Intr - 90854 90718 137 1 2 71 115 79 0.694 8.19 3.01 Init - 97098 97022 77 0 2 54 40 86 0.594 1.01 3.00 Prom - 97335 97296 40 -3.85 4.00 Prom + 98166 98205 40 -13.40 4.01 Init + 100001 100872 872 1 2 85 99 1117 0.805 106.56 4.02 Intr + 111378 111571 194 0 2 121 107 176 0.964 21.01 4.03 Term + 117385 117836 452 2 2 111 42 530 0.913 45.06 4.04 PlyA + 117857 117862 6 -0.45 5.06 PlyA - 118529 118524 6 1.05 5.05 Term - 119210 119073 138 2 0 136 38 28 0.130 -0.42 5.04 Intr - 123460 123317 144 1 0 86 40 91 0.372 3.66 5.03 Intr - 123955 123868 88 0 1 21 44 101 0.252 -1.95 5.02 Intr - 124083 124021 63 2 0 11 59 144 0.059 0.71 5.01 Init - 134068 133875 194 1 2 62 90 152 0.479 11.29 5.00 Prom - 139473 139434 40 -7.65 6.04 PlyA - 140286 140281 6 1.05 6.03 Term - 144543 144428 116 1 2 52 55 130 0.661 3.85 6.02 Intr - 149525 149408 118 2 1 49 92 61 0.082 1.72 6.01 Init - 158195 157848 348 2 0 50 60 144 0.083 5.03 6.00 Prom - 158347 158308 40 -5.55 7.05 PlyA - 158933 158928 6 1.05 7.04 Term - 163528 163373 156 1 0 44 55 152 0.449 4.45 7.03 Intr - 170835 170709 127 2 1 54 53 106 0.755 3.46 7.02 Intr - 197496 197352 145 1 1 119 43 62 0.003 3.22 7.01 Init - 214169 214115 55 2 1 76 59 64 0.335 3.80 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 173090 172975 116 2 2 67 43 103 0.836 3.43 S.002 Term - 194136 194042 95 1 2 75 49 90 0.826 0.81 S.003 Init - 194342 194222 121 2 1 68 74 95 0.901 6.50 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:27680331_27898163|GENSCAN_predicted_peptide_1|374_aa XDLDMPFAKWTKEQVCNWLMEQGLGSYLNSGKHWIASGQTLLQASQQDLEKELGIKHSLH RKKLQLALQALGSEEETNHGKLDFNWVTRWLDDIGLPQYKTQFDEGRVDGRMLHYMTVDD LLSLKVVSVLHHLSIKRAIQVLRINNFEPNCLRRRPSDENTIAPSEVQKWTNHRVMEWLR SVDLAEYAPNLRGSGVHGGLMVLEPRFNVETMAQLLNIPPNKTLLRRHLATHFNLLIGAE AQHQKRDAMELPDYVLLTATAKVKPKKLAFSNFGNLRKKKQEDGEEYVCPMELGQASGSA SKKGFKPGLDMRLYEEDDLDRLEQMEDSEGTVRQIGAFSEGINNLTHMLKEDDMFKDFAA RSPSASITDEDSNV >gi568815586f:27680331_27898163|GENSCAN_predicted_CDS_1|1125_bp nntgacttggatatgccatttgccaagtggaccaaggagcaggtttgcaattggctgatg gaacagggcttgggctcgtacctgaattctggcaagcactggattgcatctggccaaacg cttttgcaggcttctcaacaagatctagagaaggaacttggaatcaagcattcacttcat cgaaagaaactccagctagcactccaagccctgggatctgaagaagaaaccaatcatggg aagctggatttcaactgggtcactagatggttggatgacattggcctccctcaatataag acccagtttgatgaaggacgggttgatggtcgaatgctacattacatgactgttgatgac ttactgtctctgaaggttgtaagtgtgctacaccatctcagtatcaaaagggccatccag gtcctgaggatcaataactttgaaccaaactgtctacggaggcggccatctgatgagaat accatcgccccatcagaagttcagaagtggactaaccatcgagtgatggagtggctgcgc tccgtggacttggcagaatatgcgcccaatctcagaggcagtggtgtccatggtgggctc atggttctagagcctcgttttaacgtagaaacaatggctcagttattgaacatcccaccc aataagactttgctgcgaagacatttggccactcatttcaaccttctgattggggctgag gcacagcaccagaagcgagatgccatggagctgccggattatgtacttctaacagctact gccaaagtgaagccaaagaaacttgcctttagcaattttgggaatttgagaaagaagaaa caggaagatggtgaagaatatgtttgtccaatggaattgggacaggcatcaggaagtgca tctaagaaaggatttaaacctggtttggatatgcgcctgtatgaggaagatgatttggac cggttagagcagatggaagattcagaagggacagtgagacagataggtgcattctctgaa ggcatcaacaatctgacgcacatgttaaaagaagatgacatgtttaaagattttgctgcc cgttcccccagtgccagcattacagatgaagactcaaacgtttga >gi568815586f:27680331_27898163|GENSCAN_predicted_peptide_2|653_aa MGQKASQQLALKDSKEVPVVCEVVSEAIVHAAQKLKEYLGFEYPPSKLCPAANTLNEIFL IHFITFCQEKGVDEWLTTTKMTKHQAFLFGADWIWTFWGSNKQIKLQLAVQTLQMSSPPP VESKPCDLSNPESRVEESSWKKSRFDKLEEFCNLIGEDCLGLFIIFGMPGKPKDIRGVVL DSVKSQMVRSHLPGGKAVAQFVLETEDCVFIKELLRNCLSKKDGLREGGASPGSLRLAAP GPPLTLNAACPLRLAVLAAMAAAALPAWLSLQSRARTLRAFSTAVYSATPVPTPSLRVDD LHLTEIVGMLDSVLTPEDSSGKYRFISGEVLCRITGCFTGVRVEAKDLFGGCCSNPNEVM VTWIKVIVEKEVWLYLRYILKALPPRTEKMAVDQDWPSVYPVAAPFKPSAVPLPVRMGYP VKKGVPMAKEGNLELLKIPNFLHLTPVAIKKHCEALKDFCTEWPAALDSDEKCEKHFPIE IDSTDYVSSGPSVRNPRARVVVLRVKLSSLNLDDHAKKKLIKLVGERYCKTTDVLTIKTD RCPLRRQNYDYAVYLLTVLYHESWNTEEWEKSKTEADMEEYIWENSSSERNILETLLQMK AAEKNMEINKEELLGTKEIEEYKKSVVSLKNEEENENSISQYKESVKRLLNVT >gi568815586f:27680331_27898163|GENSCAN_predicted_CDS_2|1962_bp atggggcagaaagcatcgcaacagttggctctgaaggacagcaaagaggtgcccgtcgtc tgtgaggtggtcagtgaagctatagtccatgcagctcagaaactgaaggagtaccttgga tttgaatatcctccaagtaaactctgcccagctgcaaatactctgaatgagatcttctta atccatttcatcactttctgccaagaaaagggagttgatgagtggctgaccaccaccaag atgaccaagcaccaagccttcctgtttggtgcagactggatttggaccttttggggatcc aacaagcaaataaagcttcagctcgcagtacagactctgcagatgtcttcacctcctcct gtggaatctaagccttgtgacctttccaatccagaatcaagggtagaggagtcttcctgg aagaaaagtagatttgataagctggaagaattctgtaacttaataggagaggattgcctg ggtctgtttatcatctttggtatgccaggaaagcctaaagacatcaggggagttgtcctg gacagtgtcaaaagtcagatggtgaggagccatctgccaggagggaaggctgtggctcag tttgtcctggaaactgaagattgtgtgttcatcaaagagctgctcagaaattgtctgagt aagaaagacgggctgagagagggcggtgccagcccagggagcctgcgcttagcggcccca ggcccgcccctgacactgaacgccgcttgtcccctccggcttgccgtcctcgcagccatg gcggccgccgcgctcccagcatggctgtctctgcagtcgagggcaaggactctgcgtgca ttctccactgccgtctactcggccactccggtcccgacacctagcctgcgcgttgatgac ctccatttaacagagattgtagggatgcttgatagcgtgcttaccccggaggactctagt ggaaaatacagatttattagcggagaagtgctatgtaggatcactggctgctttactggg gtaagagtggaagccaaagatctgtttggaggctgttgcagtaatccaaatgaggtgatg gtgacttggattaaagtaatagtggagaaggaggtgtggttgtatttgcgatatattttg aaggcactacctcctaggacagagaaaatggctgttgaccaggactggcctagtgtttac ccagttgcagcaccatttaaaccctctgcagtacctcttcctgttcgaatgggttatcca gtaaaaaagggcgtgcccatggcaaaggagggaaatctagaacttttaaagattcccaat tttctgcatttgactcctgtagcaattaaaaagcactgtgaagcccttaaagatttttgc actgagtggccagccgcactggacagtgacgagaaatgtgagaagcattttccaattgaa attgacagcactgattatgtttcatcaggaccatctgttcggaaccccagagcacgagta gtagtcttaagagtaaagctttccagtttgaatttagatgatcacgcaaagaagaaatta attaaacttgtaggagagcgatactgcaagaccacagatgtgcttaccatcaaaacagat aggtgccctttaaggaggcagaattacgattatgcagtgtatctactaacagtgttatac catgagtcttggaatactgaagaatgggaaaaaagtaagactgaagcagacatggaagag tatatatgggaaaatagctcatcagaaagaaatatcctggaaacgcttctccagatgaaa gctgctgagaaaaatatggaaataaataaagaagagctccttggtactaaagaaattgaa gagtacaaaaagtctgttgttagtcttaaaaatgaggaggaaaatgaaaattccatttct cagtacaaagaatccgtgaagagactattaaatgtgacatga >gi568815586f:27680331_27898163|GENSCAN_predicted_peptide_3|380_aa MRPLYQRDLDLRFFYASTGNNARFSGDCWIRRFPGLLINLEESQKLGAQFLKYYSESTGQ KCSRSCCLRKDDAFNSRKSSLRGDSGGAGRFLEFSRSMNLPDSTAEKLWPLDCRGYFSCN LAVFYHSPIHDNINCLHVHCPTLESCILEPGTSAILYNITDGIDPDLLVFEQSPTYLNTR SSSNRWDRLRILKAMNLDKQTTTINGMLPSTEAPSSTTHQDLVVNTNSTSYSKELTTDFW ARFTSLNESITTKINKVSPSTDFISNPDNKTISPFFEPIDTKLSHMPVPPGLNSSKQLLN KTKGYNSRNHTSANEDEVSVTSKTWLVSVALCTSVIFLGCCIVILASGCCGKQQGQYKPG QRKSGSLQIKNRNHMKENSS >gi568815586f:27680331_27898163|GENSCAN_predicted_CDS_3|1143_bp atgcgtcccctgtatcaaagagacttggacctacgatttttctatgcctcaaccggcaac aatgccagattcagtggagactgctggatccgtcgcttcccaggtcttctaatcaatctg gaggagtctcagaagctgggagcccagttcttgaagtattattctgaaagcactggccag aagtgcagtaggagctgctgtcttcggaaggatgatgcttttaactcccgtaagtcctca ctgaggggagatagcggaggggcaggacgctttcttgaattttcccgtagcatgaattta cctgactccacagcagagaagctctggcctttggactgtcggggatacttttcctgtaac ctggctgtcttctaccacagtcctattcatgacaatatcaactgcctccatgttcactgc ccaacactggagagctgcatattagagcctggaaccagtgccattttgtacaacataaca gacggtatagatccggatttgctggtttttgaacaatctcccacatatctaaatactcgt tcttcatccaatagatgggacagactaaggattctaaaagctatgaatttagataaacaa accaccacgataaatggtatgctgccatccacagaggctccatcctcaaccacgcatcaa gatttggttgtaaacacaaacagtaccagttattctaaggaattaaccacagatttttgg gcaagatttacttccctgaatgagtccattaccacaaagataaataaggtgtcaccaagt actgatttcatcagcaatccagataataagactatttctcctttctttgaacccatagac acaaaactttctcatatgcctgttccacctggactcaacagtagcaaacaattactaaac aaaaccaaaggatacaatagcagaaaccacacatctgcaaatgaagatgaggtatctgtg acttcaaagacttggctggtttctgtggccctttgcacctctgtcatctttctcggctgt tgtatagtcatcctggcatctggatgctgtggaaagcagcagggccagtataaaccagga cagagaaaatcaggatccttgcaaattaaaaaccgtaaccatatgaaggagaactcttca tag >gi568815586f:27680331_27898163|GENSCAN_predicted_peptide_4|505_aa MSAEEMVQIRLEDRCYPVSKRKLIEQSDYFRALYRSGMREALSQEAGGPEVQQLRGLSAP GLRLVLDFINAGGAREGWLLGPRGEKGGGVDEDEEMDEVSLLSELVEAASFLQVTSLLQL LLSQVRLNNCLEMYRLAQVYGLPDLQEACLRFMVVHFHEVLCKPQFHLLGSPPQAPGDVS LKQRLREARMTGTPVLVALGDFLGGPLAPHPYQGEPPSMLRYEEMTERWFPLANNLPPDL VNVRGYGSAILDNYLFIVGGYRITSQEISAAHSYNPSTNEWLQVASMNQKRSNFKLVAVN SKLYAIGGQAVSNVECYNPEQDAWNFVAPLPNPLAEFSACECKGKIYVIGGYTTRDRNMN ILQYCPSSDMWTLFETCDVHIRKQQMVSVEETIYIVGGCLHELGPNRRSSQSEDMLTVQS YNTVTRQWLYLKENTSKSGLNLTCALHNDGIYIMSRDVTLSTSLEHRVFLKYNIFSDSWE AFRRFPAFGHNLLVSSLYLPNKAET >gi568815586f:27680331_27898163|GENSCAN_predicted_CDS_4|1518_bp atgtcggccgaggagatggtgcagatccgcctggaggaccgctgctacccggtgagcaag aggaagctcatcgagcagagcgactacttccgcgccctctaccgctccggcatgcgcgag gccctgagccaggaggccggcggcccggaggtgcagcagctgcgcggcctcagcgcgccg ggcctgcggctggtgctggacttcatcaacgccggcggggcccgcgaaggctggctcctg ggcccgcgcggggaaaagggcggcggggtggacgaggacgaggagatggatgaggtgagc ctgctgtccgagctggtggaggcggcctccttcctgcaggtcacgtccctgctgcagctg ctgctgtcccaggtgcggctcaataactgcctggagatgtaccgcctggcgcaggtgtac gggctgcccgacctgcaggaggcctgcctgcgcttcatggtcgtccacttccacgaggtg ctgtgcaagccccagttccacctcctggggtctcctccccaagctccaggggatgtcagc ctgaagcagaggctgagggaggcccggatgactgggactcctgtcctcgtggccctcggg gacttcctggggggacccctggcccctcacccctaccagggggagcccccgtccatgctc aggtacgaggagatgactgagcgttggttcccgctggccaacaaccttcctcccgacctg gtcaatgtcaggggctatgggtctgccatcctggacaactacctcttcatagtgggcggg tacaggatcactagccaggagatctccgctgcgcattcctacaaccccagcaccaacgag tggctccaggtggcctccatgaaccagaagaggtctaacttcaaacttgtggctgttaat tcaaaactctatgccatcggagggcaggccgtttctaacgttgagtgttacaaccccgag caggatgcgtggaattttgtggcgcccttacccaatcctctggctgagttctctgcctgt gagtgtaagggaaaaatttatgtcattggaggatacactaccagagaccggaacatgaac attttgcagtactgcccctcttccgacatgtggacgctctttgaaacatgtgacgtccac attcgcaagcagcagatggtgtctgtggaagagaccatctacatcgtgggggggtgtctc cacgagctggggcccaaccgcaggagcagccagagcgaggacatgctcaccgtgcagtcc tacaacaccgtcacccgccagtggctctacctcaaggagaacacgtccaaatcgggtctt aacttgacttgtgcgctccataacgacggcatctacatcatgagcagagacgtcaccctg tcgaccagcttggaacaccgagtgttcctcaagtacaacatcttttcagatagttgggaa gcatttcggcgttttccagcttttggacataacttgctggtttcttctctttatctgccc aataaagcagaaacatga >gi568815586f:27680331_27898163|GENSCAN_predicted_peptide_5|208_aa MESYLVMKKEILSSFSMRMSLEDIVSSETSQAQKDTTHSHSYMEAKKVDPIKIESRIVVT GGWKGEGGGGKGGEGEKEEEEEEEIGPWDVVWERQTPMVSDYPKAKREIDAIKLKEEAAT SPPGRSAMERAAAKQDQRLPDWLTRKPTPPLARERIPFKPVTTDFLKVRHMLTEMVLPAV TAERWNPAHSLTDAITKYQIISECFQSK >gi568815586f:27680331_27898163|GENSCAN_predicted_CDS_5|627_bp atggaatcctatttagtcatgaaaaaggagatcctgtcttccttctctatgcggatgagc ctggaggacatcgtgtcaagtgaaacaagtcaggcacagaaagataccacccattctcat tcctatatggaagctaaaaaagttgatcccataaaaatagagagtagaattgtggttacc ggaggctggaaaggagaaggaggaggaggaaaaggaggagaaggagaaaaagaggaagaa gaagaagaggaaataggtccctgggatgtagtctgggaacggcagacccctatggtttca gattatcccaaagctaagagagagattgacgccatcaaactgaaggaagaggctgccact tcacccccaggcaggtcagccatggaacgcgctgctgcaaagcaggaccagaggctccca gactggctcactcggaaacccactcctccactagcaagagaaaggataccctttaaacca gtgaccactgactttcttaaagtgaggcacatgctaactgaaatggttctgcccgctgtt actgcagaaagatggaatccagctcacagtttaactgatgccattacaaaatatcagata atttcagagtgttttcaatcaaaataa >gi568815586f:27680331_27898163|GENSCAN_predicted_peptide_6|193_aa MLKVNQRTVDCLTDGGKSKPVLKWWDAGSRGKLWFEKQIQENSEFKVPVRNEAKALKRKG LSESLHKESLSQTLGPPPCYTVRNSPSSTPPQDTRVFSLATFPQRNYSHEAQGSVGRMCS TMRQVWWKADSLQAKNESSQRWINDLKCFLQGPLPGKSFQAQYNSQGGDRDGRDRGLEEP KKSEFCEPKFPRY >gi568815586f:27680331_27898163|GENSCAN_predicted_CDS_6|582_bp atgctaaaagtaaatcagagaactgtggattgcttaactgatggaggaaagtcaaaacca gtgctgaaatggtgggatgcaggaagtaggggcaagctatggtttgagaagcaaatccag gaaaactcagaattcaaggtacctgttaggaatgaggccaaggcactgaaaagaaaaggc ctgtctgaaagtctacataaggaaagtctatctcagaccctaggcccacccccatgctac actgtcaggaactctccatcttccaccccaccacaagatacaagggttttctccctagca acatttccccagagaaactacagtcatgaagcacagggaagtgtggggaggatgtgctct actatgcggcaggtctggtggaaagctgacagtctccaagccaagaacgaaagcagccag cgctggataaacgacttgaagtgctttctccagggaccccttccaggaaagagttttcaa gcccagtacaacagccagggtggagaccgggacggcagagacagagggctggaagaacct aaaaagagtgaattctgtgaaccaaagtttccacgatattga >gi568815586f:27680331_27898163|GENSCAN_predicted_peptide_7|160_aa MSAETLQARKEWDDMLKVHADLKDTPNKHSQHYAVSGSASWRTQIVSSGINLALSLGFLI DTQYKKRAQNCAEAEMDELTGVVFGKWVITNFAELKDYVLTQCKEAKTMITEEPLPGYAR VYTDKLCGGVARTLKWELGCEASATRAATCDSAASQETRF >gi568815586f:27680331_27898163|GENSCAN_predicted_CDS_7|483_bp atgtcagcagaaaccctgcaggccagaaaagaatgggatgacatgttgaaagtgcatgct gatctcaaggacacacccaataaacattctcagcactacgctgtctcaggttctgcttcc tggagaacccaaattgtgtcatcaggaatcaatttggcattatcccttgggtttttaatc gatacccagtacaaaaaaagggcacagaactgtgctgaagctgagatggatgaactgaca ggagtagtcttcggaaagtgggtaataacaaacttcgctgagctaaaggattatgttcta acccaatgcaaagaagctaagaccatgataacagaagagccattgccgggttatgctcgt gtgtacacagacaagctttgtggtggtgtggcgaggaccctgaagtgggagttgggatgc gaagcttctgcaacaagggctgccacatgtgattcagctgcatctcaagaaacaaggttt tga