GENSCAN 1.0 Date run: 4-Nov-116 Time: 05:53:54 Sequence gi568815575f:137466692_137670067 : 203376 bp : 41.12% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 7114 7225 112 0 1 65 37 113 0.181 4.32 1.02 Term + 14920 15095 176 2 2 51 34 135 0.193 1.34 1.03 PlyA + 15743 15748 6 1.05 2.00 Prom + 16181 16220 40 -2.75 2.01 Init + 19346 19424 79 1 1 68 26 77 0.320 0.77 2.02 Intr + 24013 24120 108 2 0 91 97 38 0.558 4.34 2.03 Intr + 27530 27567 38 0 2 97 80 49 0.588 2.06 2.04 Intr + 31366 31531 166 0 1 61 69 122 0.505 6.21 2.05 Intr + 40261 40335 75 2 0 64 116 19 0.107 0.87 2.06 Term + 44237 44370 134 0 2 83 41 95 0.616 1.57 2.07 PlyA + 44517 44522 6 1.05 3.12 PlyA - 45316 45311 6 1.05 3.11 Term - 48682 48460 223 0 1 108 43 158 0.905 8.71 3.10 Intr - 51923 51818 106 0 1 80 31 81 0.090 -0.05 3.09 Intr - 59186 59055 132 0 0 50 34 136 0.024 4.10 3.08 Intr - 64384 64348 37 1 1 61 96 14 0.033 -3.58 3.07 Intr - 77773 77636 138 1 0 62 116 97 0.849 9.64 3.06 Intr - 79209 78967 243 0 0 -7 62 212 0.761 5.87 3.05 Intr - 79339 79256 84 1 0 46 69 94 0.792 2.40 3.04 Intr - 81105 80909 197 1 2 30 36 287 0.708 15.91 3.03 Intr - 81790 81736 55 1 1 94 65 32 0.613 -0.87 3.02 Intr - 83005 82869 137 2 2 96 6 141 0.959 6.07 3.01 Init - 83465 83396 70 2 1 61 42 106 0.855 2.56 3.00 Prom - 85603 85564 40 -9.45 4.00 Prom + 86869 86908 40 -4.25 4.01 Init + 92072 92125 54 1 0 56 77 35 0.018 0.53 4.02 Intr + 96795 97057 263 2 2 86 77 314 0.976 25.46 4.03 Intr + 97278 97382 105 0 0 73 65 75 0.214 2.11 4.04 Intr + 98955 99349 395 0 2 64 60 301 0.792 18.27 4.05 Intr + 99831 101060 1230 1 0 119 110 1047 0.889 96.36 4.06 Intr + 102211 102374 164 2 2 121 86 241 0.961 25.97 4.07 Intr + 103200 103369 170 2 2 47 88 129 0.901 6.62 4.08 Intr + 105669 105778 110 0 2 -2 15 75 0.036 -9.79 4.09 Intr + 107278 107428 151 2 1 102 89 120 0.989 11.80 4.10 Intr + 107911 108024 114 1 0 63 87 84 0.953 4.44 4.11 Intr + 108896 109049 154 2 1 15 94 183 0.500 10.75 4.12 Intr + 110431 110494 64 0 1 70 64 108 0.045 3.97 4.13 Intr + 111978 112048 71 1 2 78 121 27 0.094 2.98 4.14 Intr + 122160 122276 117 2 0 76 34 86 0.550 1.74 4.15 Intr + 134089 134337 249 0 0 89 54 180 0.158 11.41 4.16 Term + 138705 139082 378 2 0 -2 48 253 0.210 6.10 4.17 PlyA + 139614 139619 6 1.05 5.03 PlyA - 141062 141057 6 1.05 5.02 Term - 168324 168215 110 1 2 80 53 63 0.838 -0.31 5.01 Init - 171509 171443 67 2 1 67 88 52 0.745 4.39 5.00 Prom - 174312 174273 40 -3.65 6.00 Prom + 176466 176505 40 -6.15 6.01 Init + 190249 190316 68 0 2 62 95 81 0.881 6.80 6.02 Intr + 192849 192994 146 0 2 25 80 133 0.951 5.21 6.03 Term + 193877 194130 254 0 2 48 46 279 0.927 14.52 6.04 PlyA + 194398 194403 6 -0.45 7.00 Prom + 194877 194916 40 -12.13 7.01 Sngl + 195033 195605 573 2 0 68 43 484 0.999 37.81 7.02 PlyA + 198858 198863 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 91512 91222 291 0 0 76 48 207 0.846 10.90 S.002 Init + 95483 95485 3 1 0 66 101 0 0.956 -0.85 S.003 Term + 110431 110580 150 0 0 70 45 186 0.941 9.43 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575f:137466692_137670067|GENSCAN_predicted_peptide_1|95_aa MLKGPCDWRVLNDEENMEGDEVQRGWQGPDVEGIIDQEQPFESMPQGSTFGDGSVNVCRI WPFSYLRKEIKSDIRKGSFFRAAAALKIEARESCY >gi568815575f:137466692_137670067|GENSCAN_predicted_CDS_1|288_bp atgctcaaaggaccctgtgactggagggtcctgaatgatgaggaaaacatggaaggagat gaggttcaaagaggttggcaaggaccagatgtggagggcattatagatcaggaacaaccc tttgagtctatgccacagggctccacatttggtgatggctcagtcaatgtttgtcggatc tggcccttctcttatttgaggaaagaaatcaaaagtgacataaggaaaggtagttttttc agggcagctgcagctctcaagattgaagctcgagaaagctgctattga >gi568815575f:137466692_137670067|GENSCAN_predicted_peptide_2|199_aa MGCFITSYLEKSVPRRKEAGLDAIFFDIKIQSSDHTVLLSGMPFLSLFLCKFLSHILALS PTGWDSYSKPDKGFQNTTDRLLYKEKEFIWLKVLETAELKNMAPAFGKDLCAASFHGRIW NARKRKGEQETFISYCHLLELAACAGPAPECELLEELEKTTLNFIWNQKRACIAKTIRGK KNKVGGIMLPDFKLYYRLQ >gi568815575f:137466692_137670067|GENSCAN_predicted_CDS_2|600_bp atgggctgtttcatcaccagttacttagagaaaagtgtgccaaggcgaaaggaagcaggc ttggatgccattttctttgacataaaaattcagtcctcagatcatactgttttgctgtct ggaatgcccttcctctcccttttcctgtgtaaattcttatcccacattctagccctatct cctacaggatgggactcctacagtaaacccgacaaaggattccaaaataccacagacagg ttgctttataaagaaaaagagtttatttggctcaaagttctggagactgcagagctcaag aacatggcaccagcatttggcaaggacctttgtgctgcatcattccatggaagaatatgg aatgcaaggaagcgcaagggagagcaagaaacattcatctcatattgccacctattagag ttagctgcatgtgcaggccctgcccctgagtgtgagctccttgaagaattggaaaaaact actttaaacttcatatggaaccaaaaaagagcctgcatagccaagacaatccgaggcaag aagaacaaagttggaggcatcatgctacctgacttcaaactatactacaggctacagtaa >gi568815575f:137466692_137670067|GENSCAN_predicted_peptide_3|473_aa MRPRTLLAWLPERPPGPNEAPAHFSRDFPTLPREAVPTPEGPLNDMGLGRAQSPEPEAQL ETPTGRKEPGLPSSSPCGLVPWRAGPAGVNQFKCELDGCSSRFANSCDWEMSTHAHMGDQ LQASGKPVATRTQAEEQKETPAPGLTPEQPLPRLYTINICFQPDRSRFLTQLVPGVLQAY VPLEHDSGTVREPAAAQPGNAHSLRESYRTHLAACESLARGTEAVQKMPYVPELPAMIIG ISRALGSTSSLGLQRARSSVNKGQSSLIQWLIFTRNIRRLPQGDVGQQEDSVLPFISFPM CSPVGKVKRCLLTYYVPPIAVFQQVKHKGSKKRPATMNQNSQTAPGEKEKPLTELRQVSM DVHKVLILLDLAGIICAPAYEGLEGRAYSYYFQSPQYPKWTPADWMVPAHIGEGRSLLSP LIQMPISSGTTFTDISRNNALSGYLNPVKLTPKINHHNYHLDADNCKWLPAFL >gi568815575f:137466692_137670067|GENSCAN_predicted_CDS_3|1422_bp atgaggccgcggacactgctggcctggctccccgagcgcccgcccggccctaacgaggcg ccggcgcatttttcgagggacttccccacccttccgagagaggccgtccccaccccggag gggccactaaatgacatgggccttggcagagctcagagccctgaaccagaggctcagcta gagacgcccacaggcaggaaggaaccggggctcccctcttcttctccctgtggccttgtt ccgtggagagcaggccctgccggcgtaaaccagttcaagtgtgaactcgacggctgtagt agtcggtttgccaacagttgtgactgggagatgagcactcacgcgcatatgggcgaccag ctccaggcctctgggaagcctgtggctacccgaacacaggcggaagagcagaaagaaacg ccagcgccaggtctgacccccgagcagcctctgccgcggttgtacaccattaacatctgc ttccagccagatcgaagtcgcttccttacccagcttgtccccggtgtgctacaagcctac gtgcctctagagcacgactctggcactgtccgggagcccgctgcggcccagccggggaac gcgcatagtctgagggaaagttacaggacgcacctggcagcttgcgagtcgctggcccga gggaccgaggctgttcagaaaatgccttatgttccagagctcccggcgatgattattggt atttcgagggcgcttgggtccacatctagcttgggtcttcaaagggcgaggtcaagtgta aataagggccaatcctccttgatccagtggctcatctttacacggaacatccgaagactg ccccaaggggatgtgggccaacaggaagattctgtccttcccttcatttcctttccaatg tgttctccggtgggtaaagtgaagagatgcttacttacgtactatgtgcctcctattgca gtatttcaacaagttaagcacaaaggaagtaaaaaacggccagcaacaatgaaccagaat tcccagacagctccaggggagaaagagaagcctttgacagaattaagacaagtcagtatg gatgtgcacaaagtccttatcttactggaccttgctggaatcatttgtgcacctgcctat gagggccttgaaggaagggcttattcctactactttcagtctccccagtaccccaaatgg accccagccgattggatggtgcctgctcatattggcgagggtagatctttactgagtcca ctgattcaaatgccaatctcttctggaaccaccttcacagacatatccagaaataatgct ttatctggatatctcaacccagtcaagttgacacctaaaattaaccatcacaactaccat cttgatgcagacaactgcaagtggcttcctgcttttctatag >gi568815575f:137466692_137670067|GENSCAN_predicted_peptide_4|1262_aa MELLENTLASGTKSGYTLNNDTPSEPHSQDGPGCPVKGSIARDFGLAESPQVCPLNKHPP PRPPPPHFSVDDLIELCLFVPGTTEHVLEAINYRNVAVSKRTLGTGLENRCPRDQLSEEI QTFSNQLLSQGRLGLVGGEERICALQLQRGHVLKVVRQRGPKQDWTLSAASPPSAQQWDR SSRGKEPARGGGKPARVPAASRRLPAGAWPQFPAATQARWSPLAAARQPPPHLPGTRRLP NQRGSLGRGCHVLRSALPISLAAANGQSGRRSGDSCSDGKLQPLVAPWGSPRSVQPPPPL SDYGTSEISSFAGTLSHFGRIACAQNVPPMTMLLDGGPQFPGLGVGSFGAPRHHEMPNRE PAGMGLNPFGDSTHAAAAAAAAAAFKLSPAAAHDLSSGQSSAFTPQGSGYANALGHHHHH HHHHHHTSQVPSYGGAASAAFNSTREFLFRQRSSGLSEAASGGGQHGLFAGSASSLHAPA GIPEPPSYLLFPGLHEQGAGHPSPTGHVDNNQVHLGLRGELFGRADPYRPVASPRTDPYA AGAQFPNYSPMNMNMGVNVAAHHGPGAFFRYMRQPIKQELSCKWIDEAQLSRPKKSCDRT FSTMHELVTHVTMEHVGGPEQNNHVCYWEECPREGKSFKAKYKLVNHIRVHTGEKPFPCP FPGCGKIFARSENLKIHKRTHTGEKPFKCEFEGCDRRFANSSDRKKHMHVHTSDKPYICK VCDKSYTHPSSLRKHMKVHESQGSDSSPAASSGYESSTPPAIASANSKDTTKTPSAVQTS TSHNPGLPPNFNECLNAMNRDWDDYTCDEQVTWSVYHSQPRHSDLDLDIEGPLAAVSRRS PGLRGRFAPHRLPASLGRRGCKAGISLCLIRTWGQYEPKQWRTLRSRRVGFCSEGVGDRV ESRFAVPRPRSSWEGGEGRKISGTNSSRLGLRGDQRPNGKVSPLGVVQCHAFGLLKVWGS SRGPPWEGPQCCPAWYPGQSLIPDEELDTDVEQELCSFRQAVEYNNGIVEKALGKKKTSN RRVPCKVTVAQSQGEKVGKNWKRQEKRRKGMQVKQPCGMDILCTWERESTEIVRLCIELS GALSQRKVEPSSISSHLPTEGEFGLALDSGESPIPVVRASALASLNVVWSALGPYVNINK IDKPLAIPRKKRENTQINNIRDEKRDITTDTAEMQGIISGQFQLYASKLEKLEEMDKFPD TYNLPRLNHEEIQNLNIPITGNEIKAIIKIISVKKSTGLDGFTGEFYQTFKEELIPILLK LF >gi568815575f:137466692_137670067|GENSCAN_predicted_CDS_4|3789_bp atggagcttttggaaaacactttggcaagtggcacaaagtcaggctacactttgaacaac gacaccccctcagaaccgcactctcaggatgggcccggctgtcccgtgaagggctccatt gcccgggattttggcctggctgaatcgcctcaggtctgcccccttaacaagcatccaccc ccccggcccccacctccccacttctcggttgatgatctaattgaattatgcctttttgtt ccagggactacagagcacgtgctggaggcaattaattatcggaatgtggccgtatcaaaa cggacactgggcaccggtctcgagaaccgctgtccgcgggaccagctcagtgaggaaatc caaactttttcaaaccaactacttagccaagggcggttggggttggtgggaggtgaagag aggatctgcgccctgcagctacagaggggacatgttctcaaggtggtgaggcagcggggg cccaagcaggactggacgttgagcgccgcgtctcctccatcggcgcagcagtgggaccga tcctcgcggggcaaagagcccgcgaggggtgggggaaagcccgcgagagtcccggcggcg tcccgccgtttgccggctggagcctggccccagttccccgcagccactcaagcccggtgg agcccactcgcggccgcccggcagccgccgcctcacctgccaggcacccggcggctgccc aatcagcgggggtcgctcggccgcggctgccatgttctccgctccgcgctgccaatcagc ctcgcggcggccaatgggcagtccggccggaggtcaggagactcttgcagtgacggaaag ttgcagcccctggtagcgccttgggggtctccccgcagtgtccaaccgccgccacccctt tccgactacggcacttcggagatctcctccttcgccggtaccctctctcacttcggccgg atcgcctgtgcccagaacgtcccacccatgacgatgctcctggacggaggcccgcagttc cctgggctgggagtgggcagcttcggcgcgccgcgccaccacgagatgcccaaccgtgag ccggcaggcatggggctgaatcccttcggggactcaacccacgccgccgccgccgccgcc gccgccgctgccttcaagctgagccctgccgcggcgcacgatctatcttcaggccagagc tcggctttcacgccgcagggttcgggctacgccaacgccctgggccaccatcaccaccac catcaccatcatcaccacaccagccaggtgcccagctacggtggcgctgcctctgccgcc ttcaactcaacgcgcgagtttctgttccgccagcgcagctccgggctcagtgaggcggcc tcgggtggcgggcagcacgggctcttcgccggctcggcgagcagcctgcatgctccagct ggcatccccgagccccctagctacttgctgtttcccgggctgcatgagcagggcgctggg cacccgtcgcccacagggcacgtggacaacaaccaggtccacctggggctgcgtggggag ctgttcggccgtgctgacccataccgcccagtggccagcccgcgcacggacccctacgcg gccggcgctcagtttcctaactacagccccatgaacatgaacatgggagtgaacgtggcg gcccaccacgggcccggcgccttcttccgttatatgcggcagcctatcaagcaggagctg tcgtgcaagtggatcgacgaggctcagctgagccggcccaagaagagctgcgaccggacc ttcagcaccatgcatgagctggtgacacatgtcaccatggagcatgtggggggcccggag cagaacaaccacgtctgctactgggaggagtgcccccgggagggcaagtctttcaaggcg aagtacaaactggtcaaccacatccgagtgcacacgggcgagaagcccttcccatgcccc ttcccgggctgcgggaagatctttgcccgttctgagaacctcaagatccacaagaggacc cacacaggtgagaaacctttcaaatgtgaatttgaaggctgtgacagacgctttgccaac agcagcgaccgtaagaagcacatgcatgtgcatacctcggacaagccctatatctgcaaa gtgtgcgacaagtcctacacgcacccgagctccctgcgcaaacacatgaaggttcatgaa tctcaagggtcagattcctcccctgctgccagttcaggctatgaatcttccactccaccc gctatagcttctgcaaacagtaaagataccactaaaaccccttctgcagttcaaactagc accagccacaaccctggacttcctcctaattttaacgaatgcctgaatgccatgaacaga gattgggatgactacacatgtgatgagcaggtaacctggtcagtctatcactcccaaccc agacattccgatcttgaccttgacattgaggggcccttggcggccgtaagccggcggtcg cccgggctccgaggacgcttcgcgccccatcgcctcccagcctcattaggccggcggggc tgtaaagcaggaatcagcctctgcctaatccggacctggggtcaatacgagcccaaacaa tggcgcacacttcgcagccgtcgagtcggattttgcagcgagggggttggggatagagtt gaaagccgctttgcagtgccgcggcctcggtcgagctgggagggaggggaggggaggaaa ataagcggcacgaactcttccaggttgggtcttcgtggagaccaacggccaaacggaaaa gtcagccctcttggggttgtccagtgtcacgcatttgggctcctgaaggtgtggggcagt tcccgtgggccaccttgggaagggccccagtgttgtcctgcttggtatccgggacagtct ctaattcctgacgaagaacttgatactgacgttgaacaagaactgtgttctttcagacag gcagtggagtataataatggaattgttgaaaaagccctaggaaagaaaaaaacatctaac agaagagtcccatgtaaggtgactgtagctcagtcacaaggcgagaaagtaggcaagaac tggaagagacaagagaaaaggagaaaagggatgcaagtcaaacaaccctgtggcatggac attctgtgtacttgggagagggagagcacagagattgtgaggctttgcattgaactcagt ggtgccctgtcacagcggaaagtagaaccaagcagtattagttcacatctgcccacagag ggagaatttggactggccctagacagtggggagtctcccatcccagtggtcagagcttca gctctggcaagcctgaatgtggtctggagtgctctagggccctacgtgaatataaataaa attgacaaacctttagccataccaagaaaaaaaagagagaatacccaaataaataacatc agagatgaaaaacgtgacattacgactgatactgcagaaatgcaagggatcattagtggc caattccaactatatgccagcaagttggaaaagctagaggaaatggataaattcccagat acatataacctaccaagactgaaccatgaagaaatccaaaacctgaacataccaataaca ggtaacgagatcaaagccataataaaaattatctcagtaaagaaaagcacaggacttgat ggcttcactggtgaattctatcaaacatttaaagaagaactaataccaatcctactgaag ctattctga >gi568815575f:137466692_137670067|GENSCAN_predicted_peptide_5|58_aa MKVGAKGTGMVMKAVFTVEVLEVHFEEKVQSLHSIAVRHTFHTLKEGNSKLVLKCLAD >gi568815575f:137466692_137670067|GENSCAN_predicted_CDS_5|177_bp atgaaagtgggagcaaagggaactggcatggtaatgaaggcagtctttactgtggaagta ttagaagttcattttgaagaaaaagttcaaagtctacattctattgctgtcagacatact tttcacaccttgaaggaagggaattcaaaattggttttgaagtgtcttgcagactga >gi568815575f:137466692_137670067|GENSCAN_predicted_peptide_6|155_aa MASADGVNFMGSEAFHEAGAMACLIQAKALTLFNSMKSERSEEAAEEKMEASRGRFMKFK ERSHVHNIKVQRTLAVSHDPVKLVTTSPLPAEMDIPLFSGSITQVRSPVGSAMSLIPEDG LPPILISTGVKGDYAVEDKPSQIYFRNAAVGRRWP >gi568815575f:137466692_137670067|GENSCAN_predicted_CDS_6|468_bp atggcatctgcagatggtgtgaacttcatgggcagtgaggcttttcatgaagctggagca atggcctgcctaatccaggccaaggccctaactctcttcaattctatgaagtctgagaga agtgaggaagctgcagaagaaaagatggaagctagcagaggtcgcttcatgaagtttaag gaaagaagccatgtccataacataaaagtacaaagaaccctggctgtgtcacacgaccca gtcaagctagtaactaccagtcctctaccagcagagatggatattcctctattctctggc agtataactcaggtcagaagtcctgttggaagtgcaatgagccttattcctgaagatggc cttcctcctattctcatctccactggtgtaaaaggagactatgctgtggaagataaacca tcacagatttatttcagaaatgcagcagttggaagacggtggccctga >gi568815575f:137466692_137670067|GENSCAN_predicted_peptide_7|190_aa MKDFTITRGKAHAEDPQEHIHTQWVDDDKNVSKGVVSPIDGKSMETITNVKIFYGSECKA NGKVIRWTEVFFPENPDQHNCLSDPADHSRLTEHVTKAFCLALCPHLKLLKEDGMTKLGL RVTLDSDQVGYQAGRNGQPLPSQCMNDLDSALVPVIHGGACQLSEGSVVMELIVIFKSCH GRLNHQQLAS >gi568815575f:137466692_137670067|GENSCAN_predicted_CDS_7|573_bp atgaaggacttcaccatcacccgtgggaaggcgcacgcagaggatccccaggagcacatc cacacccaatgggtggatgatgacaagaacgttagcaagggcgtcgtaagtcctatagat gggaagtccatggagactataacaaatgtgaagatattctatggatcagaatgtaaagca aatggaaaagtcatcagatggacagaggtgtttttcccagaaaatcctgaccagcacaat tgcctcagtgatcctgcagatcacagtagattgactgagcatgttaccaaggctttttgt cttgctctctgtcctcacctgaagcttctgaaggaagatggaatgaccaaactgggacta cgtgtgacacttgattcagatcaggtcggctatcaagcagggagaaatggccagcccctt ccctcgcagtgcatgaatgatctggacagcgccttggtgccggtgatccatggaggggcc tgtcagctcagtgagggctctgtcgtcatggaactcattgttatttttaaaagttgccac ggccgcctcaaccatcagcaactagcatcctga