GENSCAN 1.0 Date run: 5-Nov-116 Time: 05:31:48 Sequence gi568815597f:21978479_22190029 : 211551 bp : 48.72% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2346 2443 98 2 2 66 80 151 0.989 10.91 1.02 Intr + 2560 2694 135 1 0 93 92 196 0.979 20.18 1.03 Intr + 5216 5352 137 2 2 98 54 145 0.977 12.31 1.04 Intr + 5711 5853 143 0 2 104 82 169 0.530 17.97 1.05 Intr + 8053 8205 153 0 0 87 78 187 0.913 17.87 1.06 Term + 20477 20581 105 1 0 88 48 31 0.009 -2.49 1.07 PlyA + 21169 21174 6 1.05 2.00 Prom + 21811 21850 40 -5.06 2.01 Init + 23200 23239 40 0 1 60 91 88 0.647 4.99 2.02 Intr + 24525 24610 86 1 2 121 110 34 0.987 8.44 2.03 Intr + 26969 27066 98 1 2 66 80 154 0.991 11.21 2.04 Intr + 27184 27318 135 1 0 102 92 156 0.979 17.08 2.05 Intr + 28400 28536 137 2 2 98 54 137 0.999 11.51 2.06 Intr + 28895 29037 143 0 2 127 92 159 0.941 20.27 2.07 Intr + 31227 31379 153 2 0 92 78 238 0.912 23.47 2.08 Intr + 46814 46894 81 1 0 95 103 20 0.282 4.03 2.09 Term + 51958 52077 120 0 0 69 55 66 0.329 -0.13 2.10 PlyA + 52740 52745 6 1.05 3.00 Prom + 67791 67830 40 -4.66 3.01 Init + 74023 74194 172 0 1 86 96 117 0.659 9.90 3.02 Intr + 74216 74264 49 0 1 55 105 69 0.430 3.04 3.03 Intr + 96520 96613 94 1 1 47 77 33 0.009 -1.93 3.04 Intr + 103244 103316 73 1 1 115 67 96 0.998 9.18 3.05 Intr + 107961 108070 110 1 2 113 121 7 0.993 6.50 3.06 Intr + 108191 108388 198 1 0 29 110 124 0.605 8.25 3.07 Intr + 112950 113023 74 2 2 104 89 94 0.782 9.30 3.08 Intr + 134839 134974 136 1 1 60 69 78 0.263 3.67 3.09 Intr + 135931 136102 172 0 1 33 105 65 0.332 2.22 3.10 Term + 136759 136838 80 2 2 99 44 18 0.173 -3.57 3.11 PlyA + 137551 137556 6 1.05 4.08 PlyA - 140552 140547 6 1.05 4.07 Term - 142039 141572 468 1 0 125 47 721 0.998 66.67 4.06 Intr - 142875 142733 143 1 2 45 67 180 0.842 11.67 4.05 Intr - 143098 142967 132 2 0 142 73 126 0.929 17.12 4.04 Intr - 151373 151138 236 1 2 91 100 496 0.996 48.43 4.03 Intr - 152625 152555 71 0 2 12 84 60 0.299 -4.02 4.02 Intr - 157919 157793 127 1 1 107 100 47 0.712 8.48 4.01 Init - 164444 164368 77 2 2 99 87 141 0.921 13.98 4.00 Prom - 177542 177503 40 -5.46 5.06 PlyA - 178485 178480 6 1.05 5.05 Term - 180672 180375 298 2 1 71 52 139 0.255 3.34 5.04 Intr - 184855 184728 128 1 2 73 15 131 0.481 3.78 5.03 Intr - 198063 197984 80 1 2 98 46 42 0.222 0.27 5.02 Intr - 207290 207237 54 0 0 85 82 65 0.412 4.55 5.01 Init - 211126 210649 478 1 1 93 86 200 0.736 15.90 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 90390 90260 131 1 2 36 42 149 0.931 3.44 S.002 Init - 91458 91392 67 0 1 63 82 55 0.919 1.73 S.003 Init + 100001 100105 105 1 0 69 93 39 0.964 2.72 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:21978479_22190029|GENSCAN_predicted_peptide_1|256_aa VSLQYEKSGSFYHTCGGSLIAPDWVVTAGHCISSSRTYQVVLGEYDRAVKEGPEQVIPIN SGDLFVHPLWNRSCVACGNDIALIKLSRSAQLGDAVQLASLPPAGDILPNETPCYITGWG RLYTNGPLPDKLQEALLPVVDYEHCSRWNWWGSSVKKTMVCAGGDIRSGCNGDSGGPLNC PTEDGGWQVHGVTSFVSAFGCNTRRKPTVFTRVSAFIDWIEEVLEGGSSSGISLKLLLGQ KTLGELISRATASVSP >gi568815597f:21978479_22190029|GENSCAN_predicted_CDS_1|771_bp gtttccctgcagtatgagaaaagcggaagcttctaccacacctgtggcggtagcctcatc gcccccgactgggttgtgactgccggccactgcatctcgagctcccggacctaccaggtg gtgttgggcgagtacgaccgtgctgtgaaggagggccccgagcaggtgatccccatcaac tctggggacctctttgtgcatccactctggaaccgctcgtgtgtggcctgtggcaatgac atcgccctcatcaagctctcacgcagcgcccagctgggagacgccgtccagctcgcctca ctccctccggctggtgacatccttcccaacgagacaccctgctacatcaccggctggggc cgtctctataccaacgggccactcccagacaagctgcaggaggccctgctgccggtggtg gactatgaacactgctccaggtggaactggtggggttcctccgtgaagaagaccatggtg tgtgctggaggggacatccgctccggctgcaatggtgactctggaggacccctcaactgc cccacagaggatggtggctggcaggtccatggcgtgaccagctttgtttctgcctttggc tgcaacacccgcaggaagcccacggtgttcactcgagtctccgccttcattgactggatt gaggaggtgttggaaggaggttcatcttcagggatttcactcaagctactcctaggccag aaaaccctcggagagttaatcagccgtgcaactgccagtgtcagtccataa >gi568815597f:21978479_22190029|GENSCAN_predicted_peptide_2|330_aa MLRLLSSLLLVAVASGYGPPSSHSSSRVVHGEDAVPYSWPWQVSLQYEKSGSFYHTCGGS LIAPDWVVTAGHCISRDLTYQVVLGEYNLAVKEGPEQVIPINSEELFVHPLWNRSCVACG NDIALIKLSRSAQLGDAVQLASLPPAGDILPNKTPCYITGWGRLYTNGPLPDKLQQARLP VVDYKHCSRWNWWGSTVKKTMVCAGGYIRSGCNGDSGGPLNCPTEDGGWQVHGVTSFVSA FGCNFIWKPTVFTRVSAFIDWIEELGKLRLRKIKNFSGAKKLLRMYPKLRTVQIPREPSG LPPETLSEDSVEDFKTMLESLPAEVQEVLL >gi568815597f:21978479_22190029|GENSCAN_predicted_CDS_2|993_bp atgctccggctgctcagttccctcctccttgtggccgttgcctcaggctatggcccacct tcctctcactcttccagccgcgttgtccatggtgaggatgcggtcccctacagctggccc tggcaggtttccctgcagtatgagaaaagtggaagcttctaccacacgtgtggcggtagc ctcatcgcccccgattgggttgtgactgccggccactgcatctcgagggatctgacctac caggtggtgttgggtgagtacaaccttgctgtgaaggagggccccgagcaggtgatcccc atcaactctgaggagctgtttgtgcatccactctggaaccgctcgtgtgtggcctgtggc aatgacatcgccctcatcaagctctcacgcagcgcccagctgggagatgccgtccagctc gcctcactccctcccgctggtgacatccttcccaacaagacaccctgctacatcaccggc tggggccgtctctataccaatgggccactcccagacaagctgcagcaggcccggctgccc gtggtggactataagcactgctccaggtggaactggtggggttccaccgtgaagaaaacc atggtgtgtgctggagggtacatccgctccggctgcaacggtgactctggaggacccctc aactgccccacagaggatggtggctggcaggtccacggtgtgaccagctttgtttctgcc tttggctgcaacttcatctggaagcccacggtgttcactcgagtctccgccttcatcgac tggattgaggagttgggcaaactgcggctcagaaagataaagaacttctccggagcgaaa aagctgctaagaatgtatccaaaactccgcacggtccagattccacgagagccttctggg ctcccaccagagaccctcagtgaagactctgttgaagactttaagaccatgctagaaagc ctcccggcagaggttcaggaagttctgctatga >gi568815597f:21978479_22190029|GENSCAN_predicted_peptide_3|385_aa MQSIAPNSRAAPARACATGGRSQGSPGTALTRPSLFPFPVPPTSAGTQLCVSCALTSAEE TPRSAANAPVEKLRVKAEGTSLSFQLKIESGLFGKLAVISPVSQKVFDNYAVTVMIGGEP YTLGLFDTAGQEDYDRLRPLSYPQTDVFLVCFSVVSPSSFENVKEKWVPEITHHCPKTPF LLVGTQIDLRDDPSTIEKLAKNKQKPITPETAEKLARDLKAVKYVECSALTQKGLKNVFD EAILAALEPPEPKKSRREAMSVAFSSRSFKEATVPDVATKSIVDPYIWQEACEDLRKKGS PWAWEQWLPFWGSQGTWPECALSGTVSSSAGPGPPLNLEKALGYWCCSWRTTEPLSKQGD FPWTGCSQEEAEKEVYFVTAFSRLS >gi568815597f:21978479_22190029|GENSCAN_predicted_CDS_3|1158_bp atgcagagcatagccccgaactcacgagctgcgccggcccgcgcgtgcgcgacaggcggg cggagccagggcagtccaggcaccgccttgacccgccccagcctcttccccttccctgtt cctcccacttccgcgggcacccaactgtgcgtctcctgcgcgctgacgtcagccgaggag accccgcgcagtgctgccaacgccccggtggagaagctgagggttaaagcagagggaact tcattatcattccaattgaagattgaaagtggcctgtttggtaaactggctgtcatctct cctgtttctcagaaggtttttgacaactatgcagtcacagttatgattggtggagaacca tatactcttggactttttgatactgcagggcaagaggattatgacagattacgaccgctg agttatccacaaacagatgtatttctagtctgtttttcagtggtctctccatcttcattt gaaaacgtgaaagaaaagtgggtgcctgagataactcaccactgtccaaagactcctttc ttgcttgttgggactcaaattgatctcagagatgacccctctactattgagaaacttgcc aagaacaaacagaagcctatcactccagagactgctgaaaagctggcccgtgacctgaag gctgtcaagtatgtggagtgttctgcacttacacagaaaggcctaaagaatgtatttgac gaagcaatattggctgccctggagcctccagaaccgaagaagagccgcagggaggccatg tctgttgctttctcaagtagaagcttcaaggaggcaactgttcctgatgtcgctacaaag tccattgttgacccttatatctggcaggaggcctgtgaggacttaagaaaaaaagggagt ccttgggcttgggaacagtggctgcctttttggggcagccagggcacctggccagagtgt gctctgtcgggcactgtcagctcttcagcaggccccggtcctcctcttaacctagagaag gccctgggttattggtgttgttcttggcggaccaccgagcccctcagcaaacagggagac ttcccatggactggttgttctcaggaagaggctgagaaggaggtctactttgtcactgca ttttcacggctctcctag >gi568815597f:21978479_22190029|GENSCAN_predicted_peptide_4|417_aa MSPRSCLRSLRLLVFAVFSAAASNWLFPQPSREDSSMGQWPQDMKGCGDTGRMGLAKEEL SAGKLGDKAVKPGKCQVQVLLNLTGCVALHMRYLAKLSSVGSISEEETCEKLKGLIQRQV QMCKRNLEVMDSVRRGAQLAIEECQYQFRNRRWNCSTLDSLPVFGKVVTQGTREAAFVYA ISSAGVAFAVTRACSSGELEKCGCDRTVHGVSPQGFQWSGCSDNIAYGVAFSQSFVDVRE RSKGASSSRALMNLHNNEAGRKAILTHMRVECKCHGVSGSCEVKTCWRAVPPFRQVGHAL KEKFDGATEVEPRRVGSSRALVPRNAQFKPHTDEDLVYLEPSPDFCEQDMRSGVLGTRGR TCNKTSKAIDGCELLCCGRGFHTAQVELAERCSCKFHWCCFVKCRQCQRLVELHTCR >gi568815597f:21978479_22190029|GENSCAN_predicted_CDS_4|1254_bp atgagtccccgctcgtgcctgcgttcgctgcgcctcctcgtcttcgccgtcttctcagcc gccgcgagcaactggctgttccctcagcccagcagggaggacagttcaatgggtcagtgg ccacaggacatgaaaggttgtggggacacaggtcggatgggtctagcaaaggaagagctc tctgctggcaaattgggggacaaggctgtgaagccaggcaaatgccaggttcaagtcctg ctcaacctcactggctgtgtagccctgcacatgaggtacctggccaagctgtcgtcggtg gggagcatctcagaggaggagacgtgcgagaaactcaagggcctgatccagaggcaggtg cagatgtgcaagcggaacctggaagtcatggactcggtgcgccgcggtgcccagctggcc attgaggagtgccagtaccagttccggaaccggcgctggaactgctccacactcgactcc ttgcccgtcttcggcaaggtggtgacgcaagggactcgggaggcggccttcgtgtacgcc atctcttcggcaggtgtggcctttgcagtgacgcgggcgtgcagcagtggggagctggag aagtgcggctgtgacaggacagtgcatggggtcagcccacagggcttccagtggtcagga tgctctgacaacatcgcctacggtgtggccttctcacagtcgtttgtggatgtgcgggag agaagcaagggggcctcgtccagcagagccctcatgaacctccacaacaatgaggccggc aggaaggccatcctgacacacatgcgggtggaatgcaagtgccacggggtgtcaggctcc tgtgaggtaaagacgtgctggcgagccgtgccgcccttccgccaggtgggtcacgcactg aaggagaagtttgatggtgccactgaggtggagccacgccgcgtgggctcctccagggca ctggtgccacgcaacgcacagttcaagccgcacacagatgaggacctggtgtacttggag cctagccccgacttctgtgagcaggacatgcgcagcggcgtgctgggcacgaggggccgc acatgcaacaagacgtccaaggccatcgacggctgtgagctgctgtgctgtggccgcggc ttccacacggcgcaggtggagctggctgaacgctgcagctgcaaattccactggtgctgc ttcgtcaagtgccggcagtgccagcggctcgtggagttgcacacgtgccgatga >gi568815597f:21978479_22190029|GENSCAN_predicted_peptide_5|345_aa MATSKMAQVAQSLNSHIRGPPECQGPCWILEHHAAPELIPEVLNQLGLSGRTQVSELEAG TTAQLALVIRKLQSEVEHFWYSNYSPAKLSSASSWDPASKSSVQRTPWQAPAERPALKEE AKVLCHDPTFKEVLCPPHPSWLLPQDNGSKITPTKAPGQGEVNSVQEVKRIAKNLTTGPA KVPLSSCQHLLLTSQGHVLHFARELLDVTHDAVTRQREDDSCSHPGGCSDNETSAVPAWD STLTDRGELNQVTPEQISACTLASIAAKATPQGSPTMGARRAFAKAAQTVSLDLHWCWGS DQHRDLSTVRAGKSMGVGAGRDLGRNLHFILNSLCDLDQSVPQLL >gi568815597f:21978479_22190029|GENSCAN_predicted_CDS_5|1038_bp atggccacctccaagatggcacaggtagctcaatctctcaatagccatatacgagggcct cctgagtgtcagggcccatgctggatactggagcaccacgctgcaccagaactcatccca gaagtgctcaaccagttgggactcagtggaagaacccaggtctctgagttggaagctggg accacagcccagctggccctggtgatacggaagctgcagtccgaggtcgagcatttttgg tattccaattactctccagccaagctcagctctgcaagctcatgggacccagccagcaaa tcctctgtgcaacgcacaccttggcaagcacctgccgagcgcccagcactcaaggaagaa gccaaggtgctgtgccatgatcccacctttaaggaagtcctctgtcctccacatccctcc tggctccttccccaagacaatggaagcaagataacacccacgaaagccccaggccaaggt gaggtgaatagtgttcaagaggtaaagcggattgccaaaaacctcaccactgggcctgcc aaagtcccactctccagctgccagcacctgctcctcacctcacagggccacgtccttcac tttgctcgggagttactggacgttactcatgacgcagtgacgcggcagcgtgaggatgac agctgtagccaccccgggggctgctctgacaatgaaactagcgcagtgccggcatgggac tcaacactgactgaccgaggggagctcaaccaagtcaccccagagcagatttcagcctgc actctggccagcatagccgccaaagccacgccccagggctcccctactatgggagcaagg agagcctttgccaaagctgctcagactgtctcactggacctccactggtgctggggaagt gaccagcacagagacctgagcactgtccgtgcaggaaagagcatgggagttggggcaggc agggacctaggccggaatctgcacttcatccttaattcactctgtgacctcgaccagtca gtccctcagcttctctga