GENSCAN 1.0 Date run: 7-Nov-116 Time: 15:34:44 Sequence gi568815581f:45862338_46124168 : 261831 bp : 45.84% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1628 1765 138 1 0 64 81 34 0.453 0.96 1.02 Term + 6273 6467 195 2 0 -12 55 192 0.512 3.31 1.03 PlyA + 6719 6724 6 1.05 2.00 Prom + 22941 22980 40 -2.86 2.01 Init + 27321 27381 61 2 1 108 57 18 0.319 2.11 2.02 Intr + 32186 32349 164 0 2 50 115 124 0.370 10.99 2.03 Intr + 32434 32555 122 0 2 129 77 -14 0.615 0.89 2.04 Intr + 33071 33236 166 2 1 29 57 143 0.626 5.26 2.05 Intr + 52051 52184 134 0 2 22 84 101 0.039 2.54 2.06 Intr + 67548 67639 92 0 2 42 82 62 0.024 0.64 2.07 Intr + 73237 73375 139 2 1 74 55 62 0.083 1.02 2.08 Intr + 74416 74499 84 1 0 49 89 78 0.118 2.94 2.09 Intr + 78053 78143 91 2 1 69 63 41 0.066 -0.30 2.10 Intr + 82149 82259 111 2 0 22 44 116 0.079 1.18 2.11 Intr + 90633 90786 154 2 1 55 80 178 0.407 13.35 2.12 Intr + 99303 99420 118 1 1 94 68 31 0.040 1.22 2.13 Intr + 100000 100133 134 1 2 23 96 69 0.033 1.49 2.14 Intr + 109522 109608 87 2 0 55 86 58 0.656 2.24 2.15 Intr + 112048 112140 93 2 0 79 98 88 0.408 8.84 2.16 Intr + 116038 116103 66 2 0 29 86 87 0.727 1.48 2.17 Intr + 120478 121593 1116 2 0 45 66 575 0.477 40.66 2.18 Intr + 127659 127738 80 1 2 66 116 24 0.442 2.27 2.19 Intr + 129123 129249 127 2 1 91 99 53 0.998 6.95 2.20 Intr + 134062 134327 266 2 2 127 100 313 0.984 33.63 2.21 Intr + 147973 148065 93 0 0 77 65 58 0.782 2.56 2.22 Intr + 149655 149681 27 2 0 112 85 2 0.600 0.61 2.23 Intr + 151906 152002 97 0 1 143 100 56 0.980 11.88 2.24 Intr + 156281 156393 113 0 2 75 98 133 0.969 13.10 2.25 Intr + 161619 161826 208 2 1 142 9 313 0.836 27.45 2.26 Term + 162161 162333 173 0 2 73 48 106 0.442 3.09 2.27 PlyA + 165974 165979 6 1.05 3.13 PlyA - 167600 167595 6 1.05 3.12 Term - 169366 169139 228 1 0 53 53 151 0.916 4.73 3.11 Intr - 169895 169710 186 2 0 60 110 50 0.700 4.29 3.10 Intr - 171123 171066 58 2 1 115 80 40 0.969 4.79 3.09 Intr - 171948 171824 125 0 2 80 75 53 0.941 2.58 3.08 Intr - 176349 176201 149 1 2 83 123 0 0.811 2.95 3.07 Intr - 176878 176690 189 2 0 103 97 85 0.997 10.46 3.06 Intr - 177547 177365 183 2 0 99 100 82 0.930 10.26 3.05 Intr - 188367 188196 172 0 1 91 95 83 0.865 8.82 3.04 Intr - 204395 204200 196 1 1 46 71 149 0.694 8.42 3.03 Intr - 205330 205212 119 1 2 95 113 -6 0.684 1.86 3.02 Intr - 220205 220104 102 2 0 119 111 -2 0.800 5.57 3.01 Intr - 232364 232223 142 1 1 82 116 113 0.969 13.86 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 240781 240790 10 0 1 64 96 -3 0.888 -1.22 S.002 Intr + 244998 245120 123 1 0 84 113 112 0.993 13.86 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f:45862338_46124168|GENSCAN_predicted_peptide_1|110_aa GLCPVVASTGKLFPQPQIAGSFLPCIAQLKVHLCREASLTVQSNNKSAGKVNPLDGYTVA QHRTQCTHNAARQNPTFIPGPGNSIFEESPELKQLVKGLSTNYERGPGQD >gi568815581f:45862338_46124168|GENSCAN_predicted_CDS_1|333_bp ggcctttgtcctgtggtcgcctccactggcaagctcttcccacagcctcagatagctggc tccttcttaccgtgcatagctcagcttaaagttcacctctgcagggaggcctccctgact gtgcagagtaacaacaagtcagcaggcaaggtgaacccactggacggttacactgttgct cagcacagaactcagtgtacccacaacgctgcacgccagaatcccaccttcataccaggt ccaggtaacagcatctttgaggagagcccagagctgaagcaattagtcaaaggtttaagt accaattatgagagaggccctggacaggactga >gi568815581f:45862338_46124168|GENSCAN_predicted_peptide_2|1371_aa MTGSLYPHLCQSLNVVSLQQGTRPLRRRAPSQSPPPTSSGTNSSAAATAHLLPPPPQPPS PPPLSSPVLASVDYQTGASCAGHRSGPPQGPSLPPLLGGWGQGGLERDLSKGCTHADASS AVPENSARIPLCTLAKAGFTWPGDVDGLGGRWSPFAGTHSRSHARMGAGLQKSGKHQDGT GEISVASDGMERSVCTAAGAPQILWVSEKSSEEQGSGSLKFRLPGEQIMGIQFLQYPPSG KPSSFLAESQWCHGSLGSRSPDAGSWPQISLIRALPECLVSTIAFDQWCPFAWQRPSPKG VINDPHLDIFQSLLLFQANRRVLICELGNLDRIYPIGRSEDVLNYPKWLHRMRKWHRHIV SPGFMGELQVQREAKQGALQAQLWARVGVTSQTPGNHNTYDDDDDDDDDDTYLKDCPEGS QRCLQGTCMEQAPLLWQVLVVPSPLKRKSKLLGSSIDFFNEVERELGITSRAHLTCLTWM AEPRQEFEVMEDHAGTYGLGDRKDQGGYTMHQDQEGDTDAGLKESPLQTPTEDGSEEPGS ETSDAKSTPTAEDVTAPLVDEGAPGKQAAAQPHTEIPEGTTGEAEEAGIGDTPSLEDEAA GHVTQAINAPLLTFCYRCLFKPEELRVPGRQRKAPERPLANEISAHVQPGPCGEASGVSG PCLGEKEPEAPVPLTASLPQHRPVCPAPPPTGGPQEPSLEWGQKGGDWAEKGPAFPKPAT TAYLHTEPESGKVVQEGFLREPGPPGLSHQLMSGMPGAPLLPEGPREATRQPSGTGPEDT EGGRHAPELLKHQLLGDLHQEGPPLKGAGGKERPGSKEEVDEDRDVDESSPQDSPPSKAS PAQDGRPPQTAAREATSIPGFPAEGAIPLPVDFLSKVSTEIPASEPDGPSVGRAKGQDAP LEFTFHVEITPNVQKEQAHSEEHLGRAAFPGAPGEGPEARGPSLGEDTKEADLPEPSEKQ PAAAPRGKPVSRVPQLKEPPSSPKYVSSVTSRTGSSGAKEMKLKGADGKTKIATPRGAAP PGQKGQANATRIPAKTPPAPKTPPSSGEPPKSGDRSGYSSPGSPGTPGSRSRTPSLPTPP TREPKKVAVVRTPPKSPSSAKSRLQTAPVPMPDLKNVKSKIGSTENLKHQPGGGKVQIIN KKLDLSNVQSKCGSKDNIKHVPGGGSFVCCEGTEEVQIVYKPVDLSKVTSKCGSLGNIHH KPGSPVEGGGQVEVKSEKLDFKDRVQSKIGSLDNITHVPGGGNKKIETHKLTFRENAKAK TDHGAEIVYKSPVVSGDTSPRHLSNVSSTGSIDMVDSPQLATLADEVSASLAKQEEGEGE ALKAASGGFQGTGGANHLWPCCGGVTEAVAATKDLKLGVFVEPQADDVNLV >gi568815581f:45862338_46124168|GENSCAN_predicted_CDS_2|4116_bp atgactgggtctttatacccccacctctgtcagtcactcaacgtggtctccctgcaacaa ggaacgcgccctcttcgccggcgcgcgccctcgcagtcaccgccacccaccagctccggc accaacagcagcgccgctgccaccgcccaccttctgccgccgccaccacagccaccttct cctcctccgctgtcctctcccgtcctcgcctctgtcgactatcagactggggcttcgtgc gccgggcatcggtcggggccaccgcagggcccctccctgcctcccctgctcgggggctgg ggccagggcggcctggaaagggacctgagcaagggatgcacgcacgcagacgcgagcagc gccgtgcctgagaacagtgcgcggatcccactgtgcacgctcgcaaaggcagggttcacc tggcctggcgatgtggacggactcggcggccgctggtccccgttcgcgggcacgcacagc cgcagccacgcacggatgggcgcggggctgcagaagtctggaaaacatcaggatggaact ggtgaaataagtgtggcctctgacggaatggagcggtccgtctgcactgctgcgggtgcc cctcagatcctgtgggtcagtgagaaaagcagtgaggaacaaggcagtggatccctgaag ttcaggctcccaggggagcagataatgggtatccagttcctgcaatatccaccctctggc aagccaagttccttcctggctgagagccagtggtgccatggttccttagggagccggtcc cctgatgccggctcctggccccaaatctctctgatccgggctcttccagaatgtcttgtc tccaccatcgcctttgaccaatggtgtccctttgcctggcagaggccttcccccaagggc gtcattaacgatccacatctggacatcttccaaagccttcttctgtttcaggccaaccgc agagtcctcatctgtgagttggggaatctggacagaatctaccccatagggcgtagtgag gatgtgttgaattatcccaagtggctacacagaatgcgcaagtggcatcgccacatcgtg agtcctggcttcatgggtgagctccaggtccaacgagaagccaagcagggggcccttcaa gctcagctttgggcccgggtcggggtaacttcccagactcctgggaatcataacacctat gatgatgatgatgatgatgatgatgatgacacctacctcaaggattgccctgaagggtca cagagatgcctgcaaggcacctgcatggagcaagcgccccttctctggcaggtgctggtc gtgccctcaccgttaaagagaaagagcaaactgctgggcagcagcattgatttttttaat gaagtggaaagagagctgggaataacaagtcgggcccacctcacctgcctcacctggatg gctgagccccgccaggagttcgaagtgatggaagatcacgctgggacgtacgggttgggg gacaggaaagatcaggggggctacaccatgcaccaagaccaagagggtgacacggacgct ggcctgaaagaatctcccctgcagacccccactgaggacggatctgaggaaccgggctct gaaacctctgatgctaagagcactccaacagcggaagatgtgacagcacccttagtggat gagggagctcccggcaagcaggctgccgcgcagccccacacggagatcccagaaggaacc acaggtgaggctgaagaagcaggcattggagacacccccagcctggaagacgaagctgct ggtcacgtgacccaagcgattaatgcgcccttgctaaccttttgctatcgctgcctcttc aaaccagaggagttgagagttccgggccggcagaggaaggcgcctgaaaggcccctggcc aatgagattagcgcccacgtccagcctggaccctgcggagaggcctctggggtctctggg ccgtgcctcggggagaaagagccagaagctcccgtcccgctgaccgcgagccttcctcag caccgtcccgtttgcccagcgcctcctccaacaggaggccctcaggagccctccctggag tggggacaaaaaggcggggactgggccgagaagggtccggcctttccgaagcccgccacc actgcgtatctccacacagagcctgaaagtggtaaggtggtccaggaaggcttcctccga gagccaggccccccaggtctgagccaccagctcatgtccggcatgcctggggctcccctc ctgcctgagggccccagagaggccacacgccaaccttcggggacaggacctgaggacaca gagggcggccgccacgcccctgagctgctcaagcaccagcttctaggagacctgcaccag gaggggccgccgctgaagggggcagggggcaaagagaggccggggagcaaggaggaggtg gatgaagaccgcgacgtcgatgagtcctccccccaagactcccctccctccaaggcctcc ccagcccaagatgggcggcctccccagacagccgccagagaagccaccagcatcccaggc ttcccagcggagggtgccatccccctccctgtggatttcctctccaaagtttccacagag atcccagcctcagagcccgacgggcccagtgtagggcgggccaaagggcaggatgccccc ctggagttcacgtttcacgtggaaatcacacccaacgtgcagaaggagcaggcgcactcg gaggagcatttgggaagggctgcatttccaggggcccctggagaggggccagaggcccgg ggcccctctttgggagaggacacaaaagaggctgaccttccagagccctctgaaaagcag cctgctgctgctccgcgggggaagcccgtcagccgggtccctcaactcaaagagccacct tcctctcctaaatacgtctcttctgtcacttcccgaactggcagttctggagcaaaggag atgaaactcaagggggctgatggtaaaacgaagatcgccacaccgcggggagcagcccct ccaggccagaagggccaggccaacgccaccaggattccagcaaaaaccccgcccgctcca aagacaccacccagctctggtgaacctccaaaatcaggggatcgcagcggctacagcagc cccggctccccaggcactcccggcagccgctcccgcaccccgtcccttccaaccccaccc acccgggagcccaagaaggtggcagtggtccgtactccacccaagtcgccgtcttccgcc aagagccgcctgcagacagcccccgtgcccatgccagacctgaagaatgtcaagtccaag atcggctccactgagaacctgaagcaccagccgggaggcgggaaggtgcagataattaat aagaagctggatcttagcaacgtccagtccaagtgtggctcaaaggataatatcaaacac gtcccgggaggcggcagttttgtctgctgtgaggggacagaagaggtgcaaatagtctac aaaccagttgacctgagcaaggtgacctccaagtgtggctcattaggcaacatccatcat aaaccaggtagccctgtggaaggaggtggccaggtggaagtaaaatctgagaagcttgac ttcaaggacagagtccagtcgaagattgggtccctggacaatatcacccacgtccctggc ggaggaaataaaaagattgaaacccacaagctgaccttccgcgagaacgccaaagccaag acagaccacggggcggagatcgtgtacaagtcgccagtggtgtctggggacacgtctcca cggcatctcagcaatgtctcctccaccggcagcatcgacatggtagactcgccccagctc gccacgctagctgacgaggtgtctgcctccctggccaagcaggaagagggagaaggagag gctctgaaagctgcttctgggggatttcaagggactgggggtgccaaccacctctggccc tgttgtgggggtgtcacagaggcagtggcagcaacaaaggatttgaaacttggtgtgttc gtggagccacaggcagacgatgtcaaccttgtgtga >gi568815581f:45862338_46124168|GENSCAN_predicted_peptide_3|616_aa XRRRSEWKWAADRAAIVSRWNWLQAHVSDLEYRIRQQTDIYKQIRANKGLIVLGEVPPPE HTTDLFLPLSSEVKTDHGTDKLIESVSQPLENHGAPIIGHISESLSTKSCGALRPVNGVI NTLQPVLADHIPGDSSDAEEQLHKKQRLNLVSSSSDGTCVAARTRPVLSCKKRRLVRPNS IVPLSKKVHRNSTIRPGCDVNPSCALCGSGSINTMPPEIHYEAPLLERLSQLDSCVHPVL AFPDDVPTSLHFQSMLKSQWQNKPFDKIKPPKKLSLKHRAPMPGSLPDSARKDRHKLVSS FLTTAKLSHHQTRPDRTHRQHLDDVGAVPMVERVTAPKAERLLNPPPPVHDPNHSKMRLR DHSSERSEVLKHHTDMSSSSYLAATHHPPHSPLVRQLSTSSDSPAPASSSSQVTASTSQQ PVRRRRGESSFDINNIVIPMSVAATTRVEKLQYKEILTPSWREVDLQSLKGSPDEENEEP ASPDVSSSHSLSEYSHGQSPRSPISPELHSAPLTPVARDTPRHLASEDTRCSTPELGLDE QSVQPWERRTFPLAHSPQAECEDQLDAQERAARCTRRTSGSKTGRETEAAPTSPPIVPLK SRHLVAAATAQRPTHR >gi568815581f:45862338_46124168|GENSCAN_predicted_CDS_3|1851_bp nngagacgcaggtcagaatggaaatgggctgcagaccgggcagctattgtcagccgctgg aactggcttcaggctcatgtttctgacttggaatatcgaattcgtcagcaaacagacatt tacaaacagatacgtgctaataaggggttgatagttcttggggaggtacctcccccagag catacaacagacttatttcttccacttagttctgaggtgaagacagatcatgggactgat aaattgattgagtctgtttctcagccattggaaaaccatggtgcccctattattggtcat atttcagagtcactgtctaccaaatcatgtggagcactcagacctgtcaatggagttatt aacactcttcagcctgtcttggcagaccacattccaggtgacagctctgatgctgaggaa caattacataagaagcaacgactgaatctcgtctcttcatcatctgatggcacctgtgtg gcagcccggacacgtcctgtactgagctgtaagaagcggaggcttgttcgacccaacagc atcgttcctctttccaagaaggttcaccggaacagcacaatccgccctggctgtgatgtg aatccctcctgcgcactgtgtggttcaggcagcatcaacaccatgcctcccgaaattcac tatgaagcccctctgttggaacgtctttcccagttggactcttgtgttcatcctgttcta gcatttccagatgatgttcccacaagcctgcatttccagagcatgctgaaatctcagtgg cagaacaagccttttgacaaaatcaaacctcccaaaaagttatcgcttaagcacagagca cccatgccgggcagtctgccagattcagctcgtaaggacaggcacaaattggtcagctcc ttcctaacaacagccaagctgtcccatcaccaaacccggcctgacaggacccacaggcag cacttagacgatgtgggggccgtgcccatggtggagcgagtgacagcgccaaaagcagag cgcttgctcaacccaccaccacccgtgcatgacccaaaccacagcaaaatgagattgcga gaccattcatctgagagaagtgaagtgttgaagcatcacacagacatgagcagttcgagc tacttggcagccacccaccatcctccacacagtcccttggtgcgacagctctccacctcc tcagattcccctgcacccgccagctctagctcacaggttacagccagcacatcgcagcag ccagtaaggaggagaaggggagagagctcatttgatattaacaacattgtcatcccaatg tctgttgctgcaacaactcgcgtagagaaactgcaatacaaggaaatccttacgcccagc tggcgggaggttgatcttcagtctctgaaggggagtcctgatgaggagaatgaagagcct gcctcccctgatgtcagcagtagccactctttgtcagaatactcccatggtcagtcccct aggagccccattagcccggaactgcactcagcacccctcacccctgtggctcgggacact ccgcgacacttagccagtgaggatacccgttgttccacaccagagctggggctggatgaa cagtctgtccagccctgggagcggcggaccttccccctggcgcacagtccccaggcggag tgtgaggaccagctggatgcacaggagcgagcagcccgctgcactcgacgcacctcaggc agcaagactggccgggagacagaggcagcgcccacctcgcctcccattgtccccctcaag agtcggcatctggtggcagcagccacagctcagcgcccgactcacagatga