GENSCAN 1.0 Date run: 4-Nov-116 Time: 14:10:10 Sequence gi568815576f:39847107_40071081 : 223975 bp : 44.89% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 26979 27018 40 -3.06 1.01 Sngl + 28158 29009 852 2 0 26 44 713 0.640 56.50 1.02 PlyA + 29146 29151 6 1.05 2.00 Prom + 29731 29770 40 -4.86 2.01 Init + 31052 31226 175 1 1 72 39 153 0.201 8.31 2.02 Intr + 46227 46301 75 1 0 51 53 115 0.910 3.79 2.03 Intr + 46638 46784 147 1 0 90 115 29 0.958 6.01 2.04 Intr + 50130 50222 93 1 0 107 34 51 0.055 1.54 2.05 Intr + 60737 60780 44 0 2 64 86 39 0.009 -0.74 2.06 Intr + 73544 73612 69 1 0 91 89 22 0.049 1.98 2.07 Intr + 80743 80838 96 0 0 75 95 43 0.147 3.91 2.08 Intr + 89477 89537 61 1 1 2 116 61 0.112 -1.39 2.09 Intr + 99987 100078 92 1 2 73 100 101 0.211 9.51 2.10 Intr + 108713 108804 92 1 2 86 108 68 0.979 7.39 2.11 Intr + 112949 113068 120 2 0 116 84 123 0.668 14.31 2.12 Term + 113123 113138 16 2 1 87 49 16 0.409 -4.69 2.13 PlyA + 114744 114749 6 1.05 3.00 Prom + 117022 117061 40 -6.96 3.01 Init + 117269 117434 166 1 1 94 44 314 0.579 27.39 3.02 Intr + 118796 119052 257 0 2 39 85 63 0.478 -1.74 3.03 Intr + 120936 121166 231 2 0 81 70 190 0.694 14.47 3.04 Intr + 122305 122427 123 0 0 106 39 54 0.885 3.08 3.05 Term + 123799 123978 180 0 0 82 37 259 0.990 17.81 3.06 PlyA + 126032 126037 6 1.05 4.00 Prom + 130132 130171 40 -6.36 4.01 Init + 131038 131084 47 0 2 63 61 62 0.373 1.15 4.02 Term + 137069 137189 121 2 1 139 44 122 0.997 10.85 4.03 PlyA + 144217 144222 6 1.05 5.00 Prom + 145889 145928 40 -7.86 5.01 Init + 147120 147187 68 2 2 68 81 14 0.354 -0.75 5.02 Intr + 147407 147611 205 2 1 83 44 113 0.421 5.60 5.03 Intr + 147871 148425 555 0 0 53 102 1029 0.765 93.84 5.04 Intr + 172062 172229 168 2 0 92 114 269 0.982 30.04 5.05 Intr + 172781 172902 122 1 2 83 109 116 0.999 12.49 5.06 Intr + 174184 174857 674 1 2 88 99 811 0.896 73.52 5.07 Intr + 175568 175710 143 0 2 84 80 46 0.838 3.47 5.08 Intr + 176674 176791 118 0 1 -12 107 73 0.163 -0.76 5.09 Intr + 180970 181102 133 2 1 44 49 79 0.073 -0.70 5.10 Intr + 181768 181952 185 1 2 73 41 139 0.473 7.13 5.11 Intr + 182273 182409 137 0 2 61 98 7 0.444 -0.71 5.12 Intr + 189641 189700 60 1 0 62 81 81 0.458 3.73 5.13 Intr + 197730 197892 163 2 1 43 101 102 0.482 6.55 5.14 Intr + 223870 223935 66 2 0 48 82 76 0.000 1.88 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 50130 50239 110 1 2 107 37 74 0.837 2.87 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576f:39847107_40071081|GENSCAN_predicted_peptide_1|283_aa MQQIYTVKEIRSVAARSGPFAPVLSATSRGVAGALRPLVQATVPATPEQPVLDLKRPFLS RESLSGQAVRRPLVASVGLNVPASVCYSHTDVKVPDFSEYRRLEVLDSTKSSRESTEARK GFSYLVTGVTTVGVAYAAKNAVTQFVSSMSASADVLALAKIEIKLSDIPEGKNMAFKWRG KPLFVRHRTQKEIKQEAAVELSQLRDPQHDLDRVKKPEWVILIGVCTHLGCVPIANAGDF GGYYCPCHGSHYDASGRIRLGPATLNLEVPTYEFTSDDMVIVG >gi568815576f:39847107_40071081|GENSCAN_predicted_CDS_1|852_bp atgcaacaaatctatacagttaaagaaattcggtcggtcgcagcccgctcgggcccgttc gcgcccgtcctgtcggccacgtcccgcggggtggcgggcgcgctgcggcccttggtgcag gccacggtgcccgccaccccggagcagcctgtgttggacctgaagcggcccttcctcagc cgggagtcgctgagcggccaggccgtgcgccggcctttggtcgcctccgtgggcctcaat gtccctgcttctgtttgttattcccacacagacgtcaaggtgcctgacttctctgaatac cgccgccttgaagttttagatagtacgaagtcttcaagagaaagcaccgaggctaggaaa ggtttctcctatttggtaactggagtaactactgtgggtgtcgcatatgctgccaagaat gccgtcacccagttcgtttccagcatgagtgcttctgctgatgtgttggccctggcgaaa atcgaaatcaagttatccgatattccagaaggcaagaacatggctttcaaatggagaggc aaacccctgtttgtacgtcatagaacccagaaggaaattaagcaggaagctgcagttgaa ttatcacagttgagggacccacagcatgatctagatcgagtaaagaaacctgaatgggtt atcctgataggtgtttgcactcatcttggttgtgtacccattgcaaatgcaggagatttt ggtggttattactgcccttgccatgggtcacactatgatgcatctggcaggatcagattg ggtcctgctactctcaaccttgaagtccccacgtatgagttcaccagtgacgatatggtg attgttggttag >gi568815576f:39847107_40071081|GENSCAN_predicted_peptide_2|359_aa MKNAFGRLSGRLDTAEERIFELEDISIETTKTEKQREKPNDNNNNKNRISKNYGTTIKGP FLGILVFIHTTPTSDDDCVPDEPGYSTYSSQAQGPRPVPTSASSLSGLVQFLSGRQARWL QIPELEGTKGNEGSQAMNLGVTLTPVFVSYSSHDIHELTFEIQEWEKQGLESFRDLLKRQ AKQACLGKNNREEDADSPTPRFVFTAANCSQACLPGFDKNYFLTGTMDTTSGDEKEKNLE KAEYIVGKVDEGKASRYSMEAVAKFDFTASGEDELSFHTGDVLKILSNQEEWFKAELGSQ EGYVPKNFIDIQFPKWFHEGLSRHQAENLLMGKEVGFFIIRASQSSPGDFSISVRMLKP >gi568815576f:39847107_40071081|GENSCAN_predicted_CDS_2|1080_bp atgaagaatgcctttggcaggctttctgggagactggacacagctgaggaaagaatcttt gagctggaggacatctcaatagaaaccactaaaactgaaaagcaaagagaaaaaccgaat gacaacaacaataacaagaacagaatatccaagaactatgggacaactataaaagggcct ttcctcggcatcctcgtcttcattcatacaacccccacttcagatgacgactgtgtacca gatgagccaggatattccacctactccagccaggctcaagggcccaggccggtccccact tccgcttcctcactttctggcctggtgcagttcctgtcagggcgtcaagcaagatggttg cagatacctgagctggagggtaccaaggggaatgaaggttctcaggccatgaacctggga gtcaccttaactcctgtgtttgtctcttatagctctcatgatatccatgagcttaccttt gaaatacaagagtgggaaaagcaaggcttggagagtttccgggacttgctcaagaggcag gctaaacaagcttgccttggaaaaaataacagggaagaagatgctgactctccaaccccc agatttgtcttcacagctgcaaattgctctcaggcctgccttcctggatttgacaaaaac tattttcttactggtactatggataccacaagtggagatgagaaagaaaagaacttggaa aaagcagaatatattgtggggaaagttgatgaagggaaagcttcacgttacagcatggaa gctgttgccaagtttgatttcactgcttcaggtgaggatgaactgagctttcacactgga gatgttttgaagattttaagtaaccaagaggagtggtttaaggcggagcttgggagccag gaaggatatgtgcccaagaatttcatagacatccagtttcccaaatggtttcacgaaggc ctctctcgacaccaggcagagaacttactcatgggcaaggaggttggcttcttcatcatc cgggccagccagagctccccaggggacttctccatctctgtcagaatgctgaagccatga >gi568815576f:39847107_40071081|GENSCAN_predicted_peptide_3|318_aa MSSHEGGKKKALKQPKKQAKEMDEEEKAFKQKQKEEQKKLEVLKAKVVGKGPLATENSTI PDVSSLEVVTLSPCNIQFCFALISRHEDDVQHFKVMRDNKGNYFLWTEKFPSLNKLVDYY RTNSISRQKQIFLRDRTREDQGHRGNSLDRRSQGGPHLSGAVGEEIRPSMNRKLSDHPPT LPLQQHQHQPQPPQYAPAPQQLQQPPQQRYLQHHHFHQERRGGSLDINDGHCGTGLGSEM NAALMHRRHTDPVQLQAAGRVRWARALYDFEALEDDELGFHSGEVVEVLDSSNPSWWTGR LHNKLGLFPANYVAPMTR >gi568815576f:39847107_40071081|GENSCAN_predicted_CDS_3|957_bp atgtccagccacgaaggtggcaagaagaaggcactgaaacagcccaagaagcaggccaag gagatggacgaggaagagaaggctttcaagcagaaacaaaaagaggagcagaagaaactc gaggtgctaaaagcgaaggtcgtggggaaggggcctctggccacagagaactccaccatc ccagatgtgagcagcctggaggtggtgacattatcaccgtgtaacatacaattttgtttt gccttgatctccaggcatgaggatgacgttcaacacttcaaggtcatgcgagacaacaag ggtaattactttctgtggactgagaagtttccatccctaaataagctggtagactactac aggacaaattccatctccagacagaagcagatcttccttagagacagaacccgagaagac cagggtcaccggggcaacagcctggaccggaggtcccagggaggcccacacctcagtggg gctgtgggagaagaaatccgaccttcgatgaaccggaagctgtcggatcaccccccgacc cttcccctgcagcagcaccagcaccagccacagcctccgcaatatgccccagcgccccag cagctgcagcagcccccacagcagcgatatctgcagcaccaccatttccaccaggaacgc cgaggaggcagccttgacataaatgatgggcattgtggcaccggcttgggcagtgaaatg aatgcggccctcatgcatcggagacacacagacccagtgcagctccaggcggcagggcga gtgcggtgggcccgggcgctgtatgactttgaggccctggaggatgacgagctggggttc cacagcggggaggtggtggaggtcctggatagctccaacccatcctggtggaccggccgc ctgcacaacaagctgggcctcttccctgccaactacgtggcacccatgacccgataa >gi568815576f:39847107_40071081|GENSCAN_predicted_peptide_4|55_aa MGKRHGQLMEEDIDGKQCGEVEKSQALSQKHVGCSPDSATYAISLDKFLDHSELI >gi568815576f:39847107_40071081|GENSCAN_predicted_CDS_4|168_bp atgggcaaaagacatggacagctcatggaagaagatatagatggcaagcaatgtggagag gtggaaaagtcacaagctttgagtcagaaacacgtgggctgcagtccagactctgctact tacgctatcagcctggacaagtttcttgaccactctgagctgatctag >gi568815576f:39847107_40071081|GENSCAN_predicted_peptide_5|933_aa MSLAAYLAQTHTERHHTPYFKLSRPPHQHRTLDGEGVSSAAPEATGPEHVLRTPRARKMN SYPRRGRRGRMAARAGQRTFTPRRSRGRSPHVRLWDLGPRRGRGQGRGRGRGAMAESQLN CLDEAHVNEKVTEAQAAFYYCERRRAALEALLGGGEQAYRERLKEEQLRDFLSSPERQAL RAAWSPYEDAVPAANARGKSKAKAKAPAPAPAESGESLAYWPDRSDTEVPPLDLGWTDTG FYRGVSRVTLFTHPPKDEKAPHLKQVVRQMIQQAQKVIAVVMDLFTDGDIFQDIVDAACK RRVPVYIILDEAGVKYFLEMCQDLQLTDFRIRNIRVRSVTGVGFYMPMGRIKGTLSSRFL MVDGDKVATGSYRFTWSSSHVDRNLLLLLTGQNVEPFDTEFRELYAISEEVDLYRQLSLA GRVGLHYSSTVARKLINPKYALVSGCRHPPGEMMRWAARQQREAGGNPEGQEEGASGGES AWRLESFLKDLVTVEQVLPPVEPIPLGELSQKDGRMVSHMHRDLKPKSREAPSRNGMGEA ARGEAAPARRFSSRLFSRRAKRPAAPNGMASSVSTETSEVEFLTGKRPNENSSADISAVS GVAFQVAGGGAFLISASLSQGTFHGGVSPSIAVEPSDENWALAGQRERGNSEGEGTDSQE GPGMPGTIIPFLVRPLDEHRAASSAEPPQTCSAPLRKVTPSALAMEPLGRSLLPKGGGTP LMRVQLDLCLGGLWRKELELDREGQVLALSLAGSGSCGHLSGRMRQPGRGTREKDRSLTL QAVELHVLAGNQQLGRETVTPAPQLEMWTAHRVGTEAESTKRFTYILPRLCPLPLQVALE RSLGELGEAHAAGSIMFLGSPCCCKAKREGVTPAPLSPPYDVASIVCGEAETARLASQRN AAHSEPARLGGFTTNEFQKSFGFQKFLDFGIVX >gi568815576f:39847107_40071081|GENSCAN_predicted_CDS_5|2799_bp atgtctctggcagcctatcttgcccaaacacacacggagagacaccacacaccctacttc aaattaagcaggcccccccaccaacaccggaccctcgacggggaaggggtcagctcggcg gccccagaggccactgggccagaacacgtgctccggactccacgtgcgcggaaaatgaac tcgtacccgcgacgcggccggcgggggcggatggcagcgagggccggacagcgaaccttt accccgcggaggagtcggggccggagcccgcacgtgcggctgtgggacctcggaccgcgg cggggccggggccagggccggggccggggccggggcgccatggccgagtcccagctgaac tgcctggacgaggcgcacgtgaacgagaaggtgaccgaggcgcaggccgccttctactac tgcgagcggcggcgggccgcgctggaggcgctgctgggcggcggcgagcaggcctaccgc gagcggctcaaggaggagcagctgcgggacttcctctccagcccggagcgccaggccctg cgggccgcctggagcccctacgaggacgccgtccccgccgccaacgcccggggcaagagc aaggccaaggccaaggcccccgcgccggcgccggctgagtccggcgagtccctggcctac tggcccgaccgttccgacaccgaggtgcctcctctggacctgggctggacggacactggt ttctaccgcggcgtgagccgggtcacgctcttcacccacccgcccaaggacgagaaggcg ccgcacctcaagcaggtggtcaggcagatgatccaacaggcccagaaggtcattgctgtg gtcatggacctcttcactgatggtgatatctttcaagacattgtggatgctgcctgtaag cgccgggtcccagtgtacatcatcctggacgaggcaggagtgaagtatttcctggagatg tgtcaggacctgcagctcactgacttccggattcggaacatccgtgtccgctctgtgaca ggcgtcggcttctacatgcccatggggaggatcaaggggaccctgtcatcaaggttcctg atggtggacggtgacaaagtggccactggatcttacaggttcacctggagttcctcccat gtggacagaaacctcctcctgctcctgacaggacagaacgtagagccctttgacacggag ttccgggagctgtacgccatctccgaggaggtggacttgtaccggcagctgagcctggcg ggcagggttggcctccattactcctccactgtggctcgaaagcttatcaaccccaagtac gccttggtgtcaggctgccgccacccgcctggggagatgatgcgctgggctgcccggcaa cagcgggaggcgggcggcaacccggaggggcaggaggagggcgccagcggtggcgagtcg gcctggcgcctggagagcttcctgaaagacctggttacggtggagcaggtgctgcccccc gtggagcccatccccttgggagagctgagccagaaggatggcaggatggtctctcacatg cacagagacctgaagcccaaatcccgagaggcacccagccgaaacggcatgggagaagcg gcccggggggaggccgcccccgccaggcgcttcagcagcaggctcttcagtcgccgagcc aagaggcctgcggcgcccaatggcatggccagctctgtctccaccgagacctctgaagtg gagtttctgacggggaagaggcccaacgagaattccagtgctgacatctcagcagtctct ggggtggcctttcaggtggctggtggcggtgcctttctcatctctgcatccctgagtcag gggactttccatggtggggtttctccctccatcgctgtggagcccagtgatgagaactgg gctctggctggccagagagagagagggaatagtgagggcgagggcacggacagccaggaa ggtccggggatgcccggaaccatcatcccgttccttgtccgtcctttagacgaacacaga gcagcctcctccgcagagccgccccagacgtgctcggcaccactgcggaaagtgaccccc agcgccttggccatggagcccttggggcgcagcctcctgcccaaaggtgggggaacaccc ttaatgagggtgcagctggatttgtgcctaggcggcctgtggcggaaggagctggaactg gaccgggaaggccaggttctggcactgtcgctggctggctctgggagctgtgggcacctc agcggtagaatgcggcagcccgggaggggaaccagagagaaggaccgcagtcttactttg caggcggtggagctgcatgtactagctggtaatcagcagcttgggagggagacagtgacc ccagcccctcaactggagatgtggacagcacacagggtaggaactgaagctgaatccacc aagcgtttcacttacatccttccccgcctctgtcccttgcctctgcaggtggccctggag agaagccttggagaacttggagaagcccacgcggctggcagcatcatgtttttggggtcc ccgtgctgctgcaaggcgaaacgggagggagtgacccctgcgcccctcagtcccccctac gacgtcgcctccatagtctgcggagaagcggagactgcgcgcctcgcctcacagcgaaac gccgcgcactcggagccggcacggctcggcggtttcaccaccaatgaattccagaaaagt tttggttttcagaagtttttggattttggaattgtggnn