GENSCAN 1.0 Date run: 3-Nov-116 Time: 00:53:17 Sequence gi568815593f:134435858_134679314 : 243457 bp : 47.89% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 415 474 60 0 0 114 57 14 0.501 2.41 1.02 Intr + 649 793 145 0 1 55 105 31 0.538 1.36 1.03 Intr + 24752 24997 246 0 0 90 84 67 0.170 3.93 1.04 Term + 26218 26252 35 2 2 85 42 38 0.102 -3.35 1.05 PlyA + 26737 26742 6 1.05 2.00 Prom + 39772 39811 40 -2.06 2.01 Init + 50936 51012 77 1 2 81 89 40 0.927 3.96 2.02 Intr + 55548 55676 129 0 0 132 -2 86 0.460 3.81 2.03 Intr + 55927 56050 124 1 1 82 49 2 0.261 -3.71 2.04 Term + 56272 56394 123 0 0 141 38 55 0.515 4.18 2.05 PlyA + 57608 57613 6 1.05 3.00 Prom + 64266 64305 40 -4.56 3.01 Init + 72961 73028 68 0 2 93 61 56 0.202 3.94 3.02 Intr + 73129 73215 87 1 0 119 95 18 0.421 4.69 3.03 Intr + 102132 102226 95 0 2 141 90 168 0.971 21.91 3.04 Intr + 108288 108498 211 1 1 20 46 165 0.549 3.47 3.05 Intr + 108609 108681 73 0 1 60 105 -25 0.396 -4.19 3.06 Intr + 116195 116352 158 1 2 122 83 225 0.706 24.21 3.07 Intr + 119474 119559 86 2 2 99 72 34 0.422 2.36 3.08 Intr + 120283 120335 53 2 2 96 80 -13 0.371 -2.67 3.09 Intr + 123780 123898 119 2 2 80 53 58 0.589 0.76 3.10 Intr + 123973 124133 161 1 2 124 69 219 0.994 23.23 3.11 Intr + 124889 125100 212 0 2 126 80 412 0.999 42.83 3.12 Intr + 126343 126510 168 0 0 118 90 179 0.646 21.24 3.13 Intr + 128637 128753 117 2 0 67 91 141 0.993 12.96 3.14 Intr + 130259 130723 465 1 0 128 96 899 0.996 88.02 3.15 Intr + 134014 134079 66 0 0 113 82 7 0.736 1.70 3.16 Intr + 137788 137905 118 0 1 70 80 68 0.875 4.24 3.17 Intr + 140911 141039 129 2 0 79 92 120 0.950 12.17 3.18 Term + 142637 143460 824 0 2 108 35 823 0.612 72.26 3.19 PlyA + 143932 143937 6 1.05 4.03 PlyA - 144720 144715 6 -0.45 4.02 Term - 145893 145633 261 0 0 75 53 124 0.528 3.13 4.01 Init - 147250 147179 72 1 0 83 60 48 0.275 2.75 4.00 Prom - 168033 167994 40 -4.96 5.05 PlyA - 170015 170010 6 1.05 5.04 Term - 171209 171093 117 2 0 95 41 117 0.999 6.24 5.03 Intr - 172646 172515 132 2 0 80 93 101 0.998 10.64 5.02 Intr - 173817 173714 104 1 2 47 95 62 0.938 2.69 5.01 Init - 176618 176597 22 2 1 71 116 12 0.426 2.39 5.00 Prom - 198977 198938 40 -6.26 6.00 Prom + 200100 200139 40 -4.66 6.01 Init + 206745 206747 3 2 0 108 81 0 0.741 1.30 6.02 Intr + 212579 212737 159 1 0 49 41 134 0.199 4.98 6.03 Intr + 213151 213316 166 0 1 87 113 69 0.317 8.83 6.04 Intr + 231020 231139 120 0 0 50 115 55 0.355 4.87 6.05 Intr + 235952 236029 78 0 0 76 72 90 0.817 5.72 6.06 Intr + 238841 238918 78 0 0 49 97 62 0.754 2.72 6.07 Intr + 239198 239360 163 0 1 59 61 94 0.285 2.83 6.08 Intr + 240166 240268 103 1 1 54 68 89 0.320 3.68 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 65605 65453 153 1 0 124 32 38 0.806 -0.28 S.002 Intr - 66217 66110 108 1 0 56 68 117 0.880 6.98 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:134435858_134679314|GENSCAN_predicted_peptide_1|161_aa MPSQHEAFHLLLPIPYGFLWLWKPKPCSHLKHALSLSARFHLGNLSDSDVQAAGRRTVNR GACDELRGAIAKSSRLLRVSQRASQLSQHCPSLTPSFHPMSWPIPTGHSTPGKTEQATCQ APSAQSAIAMALLGVYRSPQLPYQDRDSGQVLCPDDSNLLD >gi568815593f:134435858_134679314|GENSCAN_predicted_CDS_1|486_bp atgcccagccagcatgaagcctttcatttgctgctacccattccgtatggctttctctgg ctctggaaaccaaaaccctgcagccacctaaagcatgccctgagtctctcagccaggttt catctgggaaatctgagtgacagtgacgtgcaggcagctggaagacgcacagtaaaccga ggagcctgtgatgagctcagaggagccattgccaagagcagcagactcctgagggtgagt caacgggcaagccagctgagccagcactgtccctcccttacgccatccttccaccccatg tcctggcccatccccactggccattccactccggggaaaacagagcaagccacttgccag gccccttctgcccagagtgccattgccatggctctcctgggggtgtaccgctccccgcaa cttccgtatcaagatagggactctggccaggtgctgtgcccagatgattccaacctgttg gactaa >gi568815593f:134435858_134679314|GENSCAN_predicted_peptide_2|150_aa MRGSFLRSFHKVDAGSIMLHVQPAECSHQGATTTAELPFQGLSTLDAAGNYRTSPSLHLR SVEQASPPCQSLEAESWPPLTNQFQLANSLLSITGPLSGRHLNSDPPIYRVKWHSNAFQE DIAEGPNCEDIAHQSWSLHTKLGFSLFWTS >gi568815593f:134435858_134679314|GENSCAN_predicted_CDS_2|453_bp atgagaggaagcttcctgaggtccttccataaagtagatgctggcagcatcatgcttcat gtacagcctgcagaatgctcccatcagggcgccaccactacagcagaactgccttttcaa ggactctccacgttagatgcagcgggcaattatcggacctcaccttccttgcacctccgt agcgtggaacaggcttctcctccctgccagtctcttgaagctgagtcatggccgcccctt accaatcagtttcagctggccaactccctgctgtccataactggcccgctttcaggtaga catttgaattctgaccccccaatctatagggtgaagtggcacagtaacgcattccaggag gacatcgctgagggccccaattgtgaggacattgcccatcaatcttggagcctccacacc aagctgggattctctctcttctggacttcatag >gi568815593f:134435858_134679314|GENSCAN_predicted_peptide_3|1069_aa MALAAPQFGMSIFRNSGNKCLLLWSLTAFEPHHLLGLPLCVSPVLDTYSADLSCDIYIRI KMLQTAQQHQVGLAPTERKEALRGSQSPQKSASGQRPGSLDRELLMLIKISKCLRVPSGP SSVLGTESTGPVPGELVLCQQDAGATQEGVCYGRELMICRCDSFPRLMEPWGVKGTPEVF RTDLITAMKIPDSYQLSPDDYYILADPWRQEWEKGVQVPAGAEAIPEPVVRVKTPSPPID PSQLTPSSQLTSAVLATAEALSHSFWQPCPRPQDASQSSESLQGGQMVEEEVKEGVEGGG PCVVSKHTCPSAPELDWILPPLEGPPAQASPSSTMLGEGSQPDWPGGSRYDLDEIDAYWL ELINSELKEMERPELDELTLERVLEELETLCHQNMARAIETQEGLGIEYDEDVVCDVCRS PEGEDGNEMVFCDKCNVCVHQACYGILKVPTGSWLCRTCALGVQPKCLLCPKRGGALKPT RSGTKWVHVSCALWIPEVSIGCPEKMEPITKISHIPASRWALSCSLCKECTGTCIQCSMP SCVTAFHVTCAFDHGLEMRTILADNDEVKFKSFCQEHSDGGPRNEPTSEPTEPSQAGEDL EKVTLRKQRLQQLEEDFYELVEPAEVAERLDLAEALVDFIYQYWKLKRKANANQPLLTPK TDEVDNLAQQEQDVLYRRLKLFTHLRQDLERGSPNPLVPLPHPSPVTTSSCSQVRNLCYM VTRRERTKHAICKLQEQIFHLQMKLIEQDLCRERSGRRAKGKKSDSKRKGCEGSKGSTEK KEKVKAGPDSVLGQLAGLSTSFPIDGTFFNSWLAQSVQITAENMAMSEWPLNNGHREDPA PGLLSEELLQDEETLLSFMRDPSLRPGDPARKARGRTRLPAKKKPPPPPPQDGPGSRTTP DKAPKKTWGQDAGSGKGGQGPPTRKPPRRTSSHLPSSPAAGDCPILATPESPPPLAPETP DEAASVAADSDVQVPGPAASPKPLGRLRPPRESKVTRRLPGARPDAGMGPPSAVAERPKV SLHFDTETDGYFSDGEMSDSDVEAEDGGVQRGPREAGAEEVVRMGVLAS >gi568815593f:134435858_134679314|GENSCAN_predicted_CDS_3|3210_bp atggctttagcagcacctcaatttgggatgtctattttccggaacagcgggaataagtgt ctgctgctctggtctctcactgccttcgagccccatcacctgctcgggctccccctctgt gtgtccccagttctggacacttattcagcagacctgtcatgcgacatctacatccgcatc aagatgctccaaactgcccagcagcaccaagtcgggctggccccgacagaacgaaaagaa gccctccgaggaagtcagagtccccagaagtcagccagtgggcagaggcctgggtctctg gaccgggagctgctgatgctgatcaaaatcagcaagtgcttaagagtacctagtgggcct agctcagtgctggggacagaaagcacaggccctgtccctggggagcttgtcctttgtcag caggatgcaggtgctacacaggaaggtgtctgttatgggagggagttgatgatctgtaga tgtgactccttcccgagactcatggagccctggggggtgaagggaaccccagaggttttc cggacagacttgatcacagccatgaagatcccggactcataccagctcagcccggatgac tactacatcctggcagacccatggcgacaggaatgggagaaaggtgtgcaggtgcctgcc ggggcagaggccatcccagagcccgtggtgagggtaaagaccccctcccctcccatagat cccagccagctgacgccctcctcccagctgacctccgccgtcctggctacagctgaggcg ctgagtcacagcttctggcagccttgcccccgaccccaggatgcctctcaatcatctgag tctctgcagggtgggcagatggtggaggaggaagtgaaggagggggtggaaggaggtggg ccctgtgtggtgtcaaagcacacttgtccatcagctcctgagctggactggatcctccca ccactggaaggcccccctgcccaggcatccccgagcagcaccatgcttggtgagggctcc cagcctgattggccagggggcagccgctatgacttggacgagattgatgcctactggctg gagctcatcaactcggagcttaaggagatggagaggccggagctggacgagctgacatta gagcgtgtgctggaggagctggagaccctgtgccaccagaatatggccagggccattgag acgcaggaggggctgggcatcgagtacgacgaggatgttgtctgcgacgtgtgtcgctct cctgagggcgaggatggcaacgagatggtcttctgtgacaagtgcaacgtctgtgtgcat caggcatgctacgggatcctcaaggtgcccacgggcagctggctgtgccggacgtgtgcc ctgggtgtccagccaaagtgcctgctctgccccaagcgaggaggagccttgaagcccact agaagtgggaccaagtgggtgcatgtcagctgtgccctatggattcctgaggtcagcatc ggctgcccagagaagatggagcccatcaccaagatctcgcatatcccagccagccgctgg gctctgtcctgcagcctctgcaaggaatgcacaggcacctgcatccagtgttccatgcct tcctgcgtcacagcgttccatgtcacatgcgcctttgaccacggcctggaaatgcggact atattagcagacaacgatgaggtcaagttcaagtcattctgccaggagcacagtgacggg ggcccacgtaatgagcccacatctgagcccacggaacccagccaggctggcgaggacctg gaaaaggtgaccctgcgcaagcagcggctgcagcagctagaggaggacttctacgagctg gtggagccggctgaggtggctgagcggctggacctggctgaggcactggtcgacttcatc taccagtactggaagctgaagaggaaagccaatgccaaccagccgctgctgacccccaag accgacgaggtggacaacctggcccagcaggagcaggacgtcctctaccgccgcctgaag ctcttcacccatctgcggcaggacctagagaggggctcccccaacccattagtgcccttg cctcacccaagcccagtcaccacctccagctgctcccaggttagaaatctgtgctacatg gtgacaaggcgcgagagaacgaaacacgccatctgcaaactccaggagcagatattccac ctgcagatgaaacttattgaacaggatctgtgtcgagagcggtctgggaggagagcaaag ggcaagaagagtgactcgaagaggaagggctgcgagggctccaagggcagcactgagaag aaagagaaagtgaaggcggggcctgactcagtcctggggcagctggcaggcctgtccacc tcattccccatcgatggcaccttcttcaacagctggctggcacagtcggtgcagatcaca gcagagaacatggccatgagcgagtggccactgaacaatgggcaccgcgaggaccctgct ccagggctgctgtcagaggaactgctgcaggacgaggagacactgctcagcttcatgcgg gacccctcgctgcgacctggtgaccctgctaggaaggcccgaggccgcacccgcctgcct gccaagaagaaaccaccaccaccaccaccgcaggacgggcctggttcacggacgactcca gacaaagcccccaagaagacctggggccaggatgcaggcagtggcaaggggggtcaaggg ccacctaccaggaagccaccacgtcggacatcttctcacttgccgtccagccctgcagcc ggggactgtcccatcctagccacccctgaaagccccccgccactggcccctgagaccccg gacgaggcagcctcagtagctgctgactcagatgtccaagtgcctggccctgcagcaagc cctaagcctttgggccggctccggccaccccgcgagagcaaggtaacccggagattgccg ggtgccaggcctgatgctgggatgggaccaccttcagctgtggctgagaggcccaaggtc agcctgcattttgacactgagactgatggctacttctctgatggggagatgagcgactca gatgtagaggccgaggacggtggggtgcagcggggtccccgggaggcaggggcagaggag gtggtccgcatgggcgtactggcctcctaa >gi568815593f:134435858_134679314|GENSCAN_predicted_peptide_4|110_aa MGLGTMQQRQHRIMQGTAGTGHTQQPASLTDIWVLQDTGSATVQSKATANEEGPESRGQQ CEQRLLEEHQPARGKGPDSLHPLPNSPPTPATREAQPLPQDGVKGQGAKC >gi568815593f:134435858_134679314|GENSCAN_predicted_CDS_4|333_bp atggggctgggaactatgcagcagcggcaacacagaatcatgcagggcacagcgggcact ggccacacccagcagccagccagcctaacagacatctgggttctgcaggacacaggctca gcaacagtccagtccaaagcaacagccaatgaagaagggccagaatccagaggccaacaa tgtgagcagaggctcttggaagaacaccagccagcccgaggcaaggggccagactcgttg caccctctccctaattctccacccacccctgccaccagggaggcccagcccctgccccag gatggtgtgaaggggcaaggggccaagtgctga >gi568815593f:134435858_134679314|GENSCAN_predicted_peptide_5|124_aa MGFRHVQARRVWKNYLPAINGIVFLVDCADHERLLESKEELDSLMTDETIANVPILILGN KIDRPEAISEERLREMFGLYGQTTGKGSISLKELNARPLEVFMCSVLKRQGYGEGFRWMA QYID >gi568815593f:134435858_134679314|GENSCAN_predicted_CDS_5|375_bp atgggatttcgccacgttcaagctcgaagagtgtggaaaaactaccttcctgctatcaat ggcattgtatttctggtggattgtgcagaccacgaaaggctgttagagtcaaaagaagaa cttgattcactaatgacagatgaaaccattgctaatgtgcctatactgattcttgggaat aagatcgacagacctgaagccatcagtgaagagaggttgcgagagatgtttggtttatat ggtcagacaacaggaaaggggagtatatctctgaaagaactgaatgcccgacccttagaa gttttcatgtgtagtgtgctcaaaagacaaggttacggagaaggcttccgctggatggca cagtacattgattaa >gi568815593f:134435858_134679314|GENSCAN_predicted_peptide_6|290_aa MVIFTKPEERNCQSHPTKAATESQIAPRSWLCERGHVHNGGNFRSGDTRHFRPAQWSFSS LLVRCCRPRPAPSNPVIMSQPGIPASGGAPASLQAQNGAALASGSPYTNGAPHGPPPAGG PPPVRALTPLTSSYRDVPQPLFNSAVNQEGITSNTNNGSMVVHSSYDEIEGGGLLEHNTT WCNWSTTLFLELPKWATSLYSGANHLTTSMSGLSLQPEGLRVVNLLQERNMLPSTPLKPP VPNLHEDIQKLNCNPELFRCTLTSIPQTQALLNKAKLPLGLLLHPFKDLV >gi568815593f:134435858_134679314|GENSCAN_predicted_CDS_6|870_bp atggtaatcttcacaaaacccgaggagaggaactgtcagtcccaccccacgaaagcggct actgaatctcagattgccccacgatcctggctctgcgagcgaggtcacgttcacaacggc gggaatttccgtagcggtgacacacggcacttccggccggcccagtggtctttcagctct cttcttgtgcgctgttgtcgaccccgaccagccccttccaacccagtcatcatgtcccag ccgggaataccggcctccggcggcgccccagccagcctccaggcccagaacggagccgcc ttggcctcggggtctccctacaccaacggagctcctcatgggccccctccagctggaggc ccacccccagtgagggccctcacgcccctgacatcatcatatagagatgtaccccagccc ttatttaattcagctgtcaaccaagaaggtattacatcaaataccaataacggatctatg gtggtccacagtagttacgacgagattgaaggaggtggcttattggaacacaacaccacc tggtgcaactggagtaccaccctcttccttgaattacccaagtgggccacaagcctttac tcaggtgctaatcatttaaccacaagcatgagtggattaagtctacaaccagagggtcta agagttgtcaatcttcttcaagaaagaaacatgcttccgtcaacacctttgaagcctcca gttccaaatttgcatgaagacatccagaaactcaactgtaacccagagttatttcgatgc acgctgactagcattcctcagacgcaggccttattgaataaagccaaacttcctttgggg ctgctgcttcatcctttcaaagacttagtg