GENSCAN 1.0 Date run: 3-Nov-116 Time: 05:07:13 Sequence gi568815576f:28887076_29153619 : 266544 bp : 45.27% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.09 PlyA - 28 23 6 1.05 1.08 Term - 12458 12368 91 1 1 78 53 61 0.431 -1.21 1.07 Intr - 14362 14292 71 1 2 82 50 74 0.348 0.98 1.06 Intr - 15678 15608 71 1 2 38 93 41 0.145 -1.50 1.05 Intr - 20569 20407 163 1 1 29 97 70 0.270 1.55 1.04 Intr - 21400 21238 163 0 1 101 115 40 0.468 7.98 1.03 Intr - 45113 45056 58 0 1 79 64 76 0.214 2.24 1.02 Intr - 56307 56191 117 1 0 -54 87 143 0.203 0.44 1.01 Init - 64975 64828 148 1 1 48 88 119 0.611 8.15 1.00 Prom - 74187 74148 40 -6.26 2.00 Prom + 75647 75686 40 -4.96 2.01 Init + 75920 75926 7 1 1 50 95 0 0.606 -1.98 2.02 Intr + 77596 77651 56 2 2 63 98 71 0.242 4.30 2.03 Intr + 100001 100126 126 1 0 91 121 54 0.664 9.88 2.04 Intr + 119981 120089 109 1 1 41 90 10 0.001 -3.64 2.05 Intr + 138771 138878 108 1 0 50 42 85 0.241 0.36 2.06 Intr + 142841 142988 148 0 1 45 95 92 0.672 4.89 2.07 Intr + 144305 144392 88 2 1 78 73 33 0.490 0.77 2.08 Intr + 146557 146619 63 0 0 44 99 76 0.457 3.21 2.09 Intr + 149433 149529 97 2 1 101 71 -12 0.045 -2.02 2.10 Intr + 155345 155373 29 0 2 58 116 44 0.896 2.03 2.11 Intr + 155420 155494 75 1 0 75 115 73 0.995 8.41 2.12 Intr + 156224 156355 132 1 0 99 46 223 0.997 20.04 2.13 Intr + 157705 157815 111 0 0 116 72 108 0.998 12.58 2.14 Intr + 159641 159808 168 1 0 84 80 168 0.937 15.74 2.15 Intr + 161314 161416 103 0 1 83 61 112 0.999 7.75 2.16 Intr + 162122 163873 1752 0 0 115 94 1673 0.950 157.92 2.17 Intr + 166080 166174 95 1 2 119 83 -42 0.868 -1.92 2.18 Term + 166504 166602 99 0 0 151 38 73 0.933 6.73 2.19 PlyA + 166663 166668 6 1.05 3.03 PlyA - 167980 167975 6 1.05 3.02 Term - 172107 171667 441 0 0 83 49 299 0.984 20.76 3.01 Init - 172646 172608 39 2 0 80 91 57 0.295 5.43 3.00 Prom - 174995 174956 40 -6.06 4.00 Prom + 175203 175242 40 -6.06 4.01 Init + 186056 186152 97 1 1 76 117 220 0.754 22.17 4.02 Term + 186282 187099 818 1 2 108 44 312 0.742 22.10 4.03 PlyA + 187840 187845 6 1.05 5.03 PlyA - 188385 188380 6 1.05 5.02 Term - 193762 193606 157 0 1 65 41 172 0.633 7.61 5.01 Init - 213106 213093 14 1 2 112 73 23 0.002 1.41 5.00 Prom - 213342 213303 40 -3.56 6.00 Prom + 213440 213479 40 -3.46 6.01 Init + 223714 223766 53 0 2 89 47 54 0.017 2.03 6.02 Intr + 232764 232840 77 0 2 36 94 62 0.012 0.76 6.03 Intr + 234282 234406 125 1 2 94 59 42 0.627 2.20 6.04 Intr + 238188 238341 154 2 1 71 73 131 0.690 9.55 6.05 Intr + 250267 250599 333 2 0 108 116 367 0.997 37.04 6.06 Intr + 251549 251707 159 0 0 103 69 229 0.969 22.46 6.07 Intr + 253207 253291 85 2 1 93 89 79 0.776 7.38 6.08 Intr + 260936 261054 119 2 2 55 22 85 0.358 -1.29 6.09 Term + 263717 263892 176 0 2 40 49 171 0.613 6.32 6.10 PlyA + 265822 265827 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 205013 205121 109 1 1 55 25 110 0.837 1.78 S.002 Intr + 207183 207345 163 1 1 102 110 55 0.987 8.13 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576f:28887076_29153619|GENSCAN_predicted_peptide_1|293_aa MHSEKRKPDIPMLAQCSLLHGEEEQVEASRILIPLINTGEEVHANGMGAETTGKQQEVIV KGSVAKGSKCRGSKHVPIKVQQLQKNHKDLFFNPFNLGYKQALNDDHRLLVSDSRISPPK ETCTRPNASSRCVSRYYCLLYSMIGMQHTPRTKGGHQDSQEKRKKLVPVIKSKYAAGLEN QMKAKSYQQIPMEPDHERHGHHLGTVGNHAATTCSLGKVYLHFTDAETEIQRGKAIFPQQ DYRNQKQQITIAEGEALEKGEQLRAFPSKNKVGMATNAPVTVVPNHIRRGLAC >gi568815576f:28887076_29153619|GENSCAN_predicted_CDS_1|882_bp atgcactcggagaaaagaaagccagatatacccatgctggctcagtgctccttgctccac ggggaagaggagcaagtggaggccagcaggattttaatacctctcatcaacacaggggaa gaggtacatgcaaacgggatgggagcagagacaacaggcaaacagcaagaggtcatcgtc aaaggatcggttgctaagggaagcaagtgtagaggcagcaagcatgtccccatcaaagtc cagcagctgcagaagaaccacaaagacctcttcttcaatcctttcaaccttggttacaaa caggcactcaatgatgaccacaggttgctagttagtgactctcgcatatccccaccaaaa gagacttgcacacggcccaacgctagttcaagatgtgtttctagatattactgccttctc tacagcatgattggcatgcaacatacaccaaggacaaaaggaggccaccaagactcccag gagaagagaaaaaagctagtaccagtaatcaaatctaaatatgctgctgggttggaaaac caaatgaaagcaaaatcttatcaacaaatccccatggaaccagaccatgaaaggcacgga caccatctgggcactgtggggaaccatgcggccaccacctgctcccttggaaaagtttat ctccattttacagatgcagaaactgagatccagagaggcaaagccatctttccacagcag gactacagaaatcagaagcagcaaatcacaattgctgaaggagaagcattagagaagggg gaacaactcagagctttcccttccaaaaacaaggttggtatggcaaccaatgccccagtc accgtggtccctaaccacatccgccggggcttggcatgctga >gi568815576f:28887076_29153619|GENSCAN_predicted_peptide_2|1121_aa MDGSDEDELLYSSPLRIELQKMHPLGLCNNNDEEDLYEYGWVGVVKLEQPELDPKPCLTV LGKYPRRELLSDDHFAANAGRRGQNDLNNVCGHAAGLGRVDCKPLTAFQPERGIDGIVNT GQKVVLMIIVTPRMRVAICIEGEWSFLKFLVADYIRFGLKWDFWKLNQDNDFQVLPKAYL PGIGRIFSPVTRVLLVCDLISYGMKSFCRKTGPLGGSSGDEKTMDGFGSQNQQNLLINRP PNPEIKSVDKFACIVCASLSILSFMQPSLIAKANFEKAKRAVQRGATAVIFDVSENPEAI DQLNQGSEDPLKRPVVYVKGADAIKLMNIVNKQKVARARIQHRPPRQPTEYFDMGIFLAF FVVVSLVCLILLVKIKLKQRRSQNSMNRLAVQALEKMETRKFNSKSKGRREGSCGALDTL SSSSTSDCAICLEKYIDGEELRVIPCTHRFHRKCVDPWLLQHHTCPHCRHNIIEQKGNPS AVCVETSNLSRGRQQRVTLPVHYPGRVHRTNAIPAYPTRTSMDSHGNPVTLLTMDRHGEQ SLYSPQTPAYIRSYPPLHLDHSLAAHRCGLEHRAYSPAHPFRRPKLSGRSFSKAACFSQY ETMYQHYYFQGLSYPEQEGQSPPSLAPRGPARAFPPSGSGSLLFPTVVHVAPPSHLESGS TSSFSCYHGHRSVCSGYLADCPGSDSSSSSSSGQCHCSSSDSVVDCTEVSNQGVYGSCST FRSSLSSDYDPFIYRSRSPCRASEAGGSGSSGRGPALCFEGSPPPEELPAVHSHGAGRGE PWPGPASPSGDQVSTCSLEMNYSSNSSLEHRGPNSSTSEVGLEASPGAAPDLRRTWKGGH ELPSCACCCEPQPSPAGPSAGAAGSSTLFLGPHLYEGSGPAGGEPQSGSSQGLYGLHPDH LPRTDGVKYEGLPCCFYEEKQVARGGGGGSGCYTEDYSVSVQYTLTEEPPPGCYPGARDL SQRIPIIPEDVDCDLGLPSDCQGTHSLGSWGGTRGPDTPRPHRGLGATREEERALCCQAR ALLRPGCPPEEAGAVRANFPSALQDTQESSTTATEAADLRLGNTPAPSEITYTPSVHFCH PIAKLLCLKDRDLTQQTAAAREPELRRNSYLEIGNCMETPN >gi568815576f:28887076_29153619|GENSCAN_predicted_CDS_2|3366_bp atggatggttctgatgaggatgaacttctgtactccagccccctcagaatagagctgcag aagatgcacccactgggcctatgtaataacaatgacgaagaggacttgtatgaatatggc tgggtaggagtggtgaagctggaacagccagaattggacccgaaaccatgcctcactgtc ctaggcaagtatccccggagagaattgttatcagatgatcattttgctgctaatgcaggg aggaggggacagaacgacttgaataacgtgtgtggtcatgcagctggtctgggcagagtg gattgcaagcccctcactgctttccagcccgaaaggggcattgatgggattgtgaacaca ggacagaaagtagtattaatgattattgttacaccccggatgcgtgttgccatatgcatt gaaggggagtggtcctttttgaagtttctagttgctgactatataagatttggcctgaag tgggacttctggaaactcaaccaagataacgacttccaggttcttccaaaagcttatctt ccaggcataggcaggattttctcaccggttacccgtgtgctcttggtgtgtgacctcatt tcctatggaatgaaaagtttctgtcggaaaacagggcccttgggtggcagcagtggggat gaaaaaacgatggatggatttggaagtcagaatcagcaaaatttgctcatcaacagaccg cctaacccagaaattaaatctgttgacaaatttgcctgtattgtctgtgccagtctatct atcttgtcatttatgcagccaagcctgattgccaaggccaactttgaaaaggccaagcga gcagtacagcggggagctactgcagtcatctttgatgtgtctgaaaacccagaagctatt gatcagctgaaccagggctctgaagacccgctcaagaggccggtggtgtatgtgaagggt gcagatgccattaagctgatgaacatcgtcaacaagcagaaagtggctcgagcaaggatc cagcaccgccctcctcgacaacccactgaatactttgacatggggattttcctggctttc ttcgtcgtggtctccttggtctgcctcatcctccttgtcaaaatcaagctgaagcagcga cgcagtcagaattccatgaacaggctggctgtgcaggctctagagaagatggaaaccaga aagttcaactccaagagcaaggggcgccgggaggggagctgtggggccctggacacactc agcagcagctccacgtccgactgtgccatctgtctggagaagtacattgatggagaggag ctgcgggtcatcccctgtactcaccggtttcacaggaagtgcgtggacccctggctgctg cagcaccacacctgcccccactgtcggcacaacatcatagaacaaaagggaaacccaagc gcggtgtgtgtggagaccagcaacctctcacgtggtcggcagcagagggtgaccctgccg gtgcattaccccggccgcgtgcacaggaccaacgccatcccagcctaccctacgaggaca agcatggactcccacggcaaccccgtcaccttgctgaccatggaccggcacggggagcag agcctctattccccgcagacccccgcctacatccgcagctacccacccctccacctggac cacagcctggccgctcaccgctgcggcctggagcaccgggcctactccccagcccacccc ttccgcaggcccaagttgagtggccgcagcttctccaaggcagcttgcttctcccagtat gagaccatgtaccagcactactacttccagggcctcagctacccggagcaggaggggcag tccccacctagcctcgcaccccggggcccggcccgtgcctttcctccgagcggcagtggc agcctgctcttccccaccgtggtgcacgtggccccgccctcccacctggagagcggcagc acgtccagcttcagctgctatcacggccaccgctcggtgtgcagtggctacctggccgac tgcccaggcagcgacagcagcagcagcagcagctccggccagtgccactgttcctccagt gactctgtggtagactgcactgaggtcagcaaccagggcgtgtacgggagctgctccacc ttccgcagctccctcagcagcgactatgaccccttcatctaccgcagccggagcccctgt cgtgccagtgaggcggggggctcgggcagctcgggccggggacctgccctgtgcttcgag ggctccccgcctcccgaggagctcccggcggtgcacagtcatggtgctgggcggggcgag ccttggccgggccctgcctctccctcgggggatcaggtgtccacctgcagcctggagatg aactacagcagcaactcctccctggagcacagggggcccaatagctctacctcagaagtg gggctcgaggcttctcctggggccgcccctgacctcaggaggacctggaaggggggccac gagttgccgtcgtgtgcctgctgctgcgagccccagccctccccagccgggcctagcgcc ggagcagctggcagcagcaccttgttcctggggccccacctctacgagggctctggcccg gcgggtggggagccccagtcaggaagctcccagggcttgtacggccttcaccccgaccat ttgcccaggacagatggggtgaaatacgagggtctgccctgctgcttctatgaagagaag caggtggcccgcgggggcggagggggcagcggctgctacactgaggactactcggtgagt gtgcagtacacgctcaccgaggaaccaccgcccggctgctaccccggggcccgggacctg agccagcgcatccccatcattccagaggatgtggactgtgatctgggcctgccctcggac tgccaagggacccacagcctcggctcctggggtgggacgcgaggcccggataccccacgg ccccacaggggcctgggagcaacccgggaagaggagcgggctctgtgctgccaggctagg gccctactgcggcctggctgccctccggaggaggcgggtgctgtcagggccaacttccct agtgccctccaggacactcaggagtccagcaccactgccactgaggctgcagatcttagg ttaggtaacactccagccccctctgagatcacctacacaccttctgttcacttctgtcat cctattgctaaattactttgtctcaaggaccgagatctcactcagcagacagcagcagcc cgggagcctgagctcaggaggaactcttacctggaaattgggaactgtatggagactcca aactga >gi568815576f:28887076_29153619|GENSCAN_predicted_peptide_3|159_aa MENLAYEGLIGIQSKESSKEKKLTVRQDLEDRYAEHVAATQALPQDSGTAAWKGRVLLPE TQKRQQLSEDTLTIHGLPTEGYQALYHAVVEPMLWNPSGTPKRYSLELGKAIKQKLWEAL CSQGAISEGAQRDRFPGRKQPGVHEEPVLKKWPKLKSKK >gi568815576f:28887076_29153619|GENSCAN_predicted_CDS_3|480_bp atggagaacttggcctacgaaggcctgattggaatccagagtaaagaaagttcaaaggag aaaaaactaacagtccgccaagatcttgaggacagatatgctgaacatgtggctgccacc caagcgctaccccaggacagtgggacagcagcctggaagggccgagtgttgcttcctgaa acccaaaagagacagcagttgtcggaggacacgctaaccatccatggtctccccacagag ggttaccaggctctgtaccacgctgtggtggagccaatgctgtggaatccttcagggacc cccaagaggtacagcctggagctgggcaaggccattaaacaaaagctctgggaggctctt tgcagtcagggtgccatctctgaaggtgctcagagggaccggttccctggcaggaagcag ccaggtgtccacgaggagcctgtactcaagaaatggcccaagttaaagagcaaaaaatag >gi568815576f:28887076_29153619|GENSCAN_predicted_peptide_4|304_aa MAPPAARLALLSAAALTLAARPAPSPGLGPGPGDAPQAGMVPSWDPGYPQARHRRPRARY CPPAAGPGAPQRRPSAQEALSALSLPRPSATPPGLGHLLCLGIWDPLPESLSDPNQAGTP PLPGTSWDPVPSPQRLPPGRDVLCSPVPPRIPSQNPQRLPTGQEAPCSPVPPGIPSPSPQ RPLPGRDDPCSPVPPGIPPQVLHRRTLHRDDSPATRGYTPLSETSSDIPPLGQGPLPEPH CDAPASPSPLLPPTPVEPGASPRRAERRREEPALSPFSRVLGCTRLADIITSASRGTGWT CSPC >gi568815576f:28887076_29153619|GENSCAN_predicted_CDS_4|915_bp atggcgccgccagccgcccgcctcgccctgctctccgccgcggcgctcacgctggcggcc cggcccgcgcctagccccggcctcggccccggacccggcgacgcccctcaggccgggatg gtcccttcctgggacccgggctacccccaggcccgtcatcgacgcccccgggcccggtac tgtcccccggctgcaggacccggtgctcctcagcgacgcccctcagcccaggaggccctc tccgccttgagtcttccaagaccctctgcgacgccccccgggctgggacatcttctctgt ctcgggatctgggacccgctgcccgagtccctcagcgaccccaaccaggccgggacgccc cctctccccggtacctcctgggatcccgtcccaagtccccagcgacttcccccgggccgg gacgtcctctgctccccggtacctcctaggatcccgtcccaaaatccccagcgactcccc acgggccaggaggccccctgctccccggtacctcctgggatcccgtccccaagtccccag cgacccctcccgggccgggacgacccctgctccccagtacctcctgggatcccgccccaa gtccttcatcgacgcaccttgcaccgggacgactcccccgctacaagaggctatacgccc ctctccgagacctccagcgacatccctcccctgggccaaggtcccctccctgagcctcac tgcgacgcccccgcgtcccccagtcctctcctcccgcctacaccggtggaacccggcgcc tccccgcgcagagcagagcggaggcgggaggagccggcgctcagccccttttcccgagtc ctcggctgcacccgcttggcggacattataacttctgcctcgcgaggaacgggatggact tgttcgccctgctag >gi568815576f:28887076_29153619|GENSCAN_predicted_peptide_5|56_aa MAPLQMFSSSIPGLYRLDASSTSLVVTAKMSPSIAKCPLGAKSPPVEITELEEMVK >gi568815576f:28887076_29153619|GENSCAN_predicted_CDS_5|171_bp atggcgccgttgcagatgtttagcagcagcatccctggcctctaccggctagatgccagt agtacctctttagttgtgacagccaaaatgtctccaagtattgcaaaatgtcccctgggg gcaaaatctcctccagttgaaatcactgagttagaagaaatggtcaaataa >gi568815576f:28887076_29153619|GENSCAN_predicted_peptide_6|426_aa MAAVRFGTEVADEQHRVQICEYVTLDGIGDFADMMKDFVIDYPVPGNLGCYKDHGNPPPL TGTSKTSNKLTIQTCISFCRSQRFKFAGMESGYACFCGNNPDYWKYGEAASTECNSVCFG DHTQPCGGDGRIILFDTLVGACGGNYSAMSSVVYSPDFPDTYATGRVCYWTIRVPGASHI HFSFPLFDIRDSADMVELLDGYTHRVLARFHGRSRPPLSFNVSLDFVILYFFSDRINQAQ GFAVLYQAVKEELPQERPAVNQTVAEVITEQANLSVSAARSSKVLYVITTSPSHPPQTVP GWTVYGLATLLILTVTAIVAKILLHVTFKIEVGVKNEEVSGMVLRYLTWVPERREGPLFD IRNVKGARLRTVYQTLCQVTNRHYRIHLCRNPTRELRVREGVQPGAAKVRTQPGTTGSKA GGPYSC >gi568815576f:28887076_29153619|GENSCAN_predicted_CDS_6|1281_bp atggcagctgtcagatttggaacggaagttgcagatgaacagcacagggtgcaaatctgt gaatacgttaccttagatggcataggagattttgcagatatgatgaaagattttgtgata gattatccagtgcctggaaaccttggctgctacaaggatcatggaaacccacctcctcta actggcaccagtaaaacgtccaacaaactcaccatacaaacttgcatcagtttttgtcgg agtcagaggttcaagtttgctgggatggagtcaggctatgcttgcttctgtggaaacaat cctgattactggaagtacggggaggcagccagtaccgaatgcaacagcgtctgcttcggg gatcacacccaaccctgtggtggcgatggcaggatcatcctctttgatactctcgtgggc gcctgcggtgggaactactcagccatgtcttctgtggtctattcccctgacttccccgac acctatgccacggggagggtctgctactggaccatccgggttccgggggcctcccacatc cacttcagcttccccctatttgacatcagggactcggcggacatggtggagcttctggat ggctacacccaccgtgtcctagcccgcttccacgggaggagccgcccacctctgtccttc aacgtctctctggacttcgtcatcttgtatttcttctctgatcgcatcaatcaggcccag ggatttgctgttttataccaagccgtcaaggaagaactgccacaggagaggcccgctgtc aaccagacggtggccgaggtgatcacggagcaggccaacctcagtgtcagcgctgcccgg tcctccaaagtcctctatgtcatcaccaccagccccagccacccacctcagactgtccca ggatggacagtctatggtctggcaactctcctcatcctcacagtcacagccattgtagca aagatacttctgcacgtcacattcaaaatagaggtgggggttaagaatgaggaggtgtct gggatggtgctccggtacctgacatgggtgcctgagcggagagaggggccactcttcgac ataaggaatgtaaaaggagccaggttacgtactgtgtaccagacgctgtgccaagtgacc aacagacattaccgaattcatctttgccgcaacccaacaagggaactgagggtcagagaa ggggtccagccaggggccgccaaggttcggactcagcctgggaccactggctccaaagct gggggtccttactcctgttga