GENSCAN 1.0 Date run: 4-Nov-116 Time: 13:39:59 Sequence gi568815596r:237475022_237686102 : 211081 bp : 49.66% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2768 2897 130 1 1 97 63 52 0.620 3.87 1.02 Intr + 6057 6218 162 1 0 85 -7 107 0.439 0.85 1.03 Intr + 8219 8356 138 0 0 75 77 42 0.667 2.24 1.04 Intr + 11246 11288 43 0 1 89 82 56 0.829 2.40 1.05 Intr + 11660 11804 145 2 1 74 50 56 0.495 0.68 1.06 Intr + 12377 12416 40 1 1 141 117 8 0.624 6.80 1.07 Intr + 22546 22723 178 2 1 61 27 160 0.176 6.18 1.08 Intr + 26086 26102 17 1 2 122 81 9 0.020 -1.21 1.09 Intr + 28768 28789 22 2 1 110 91 29 0.000 2.00 1.10 Intr + 35553 35774 222 0 0 48 99 237 0.006 18.04 1.11 Intr + 35968 36080 113 1 2 87 94 184 0.996 18.92 1.12 Term + 37174 37313 140 2 2 70 41 72 0.642 -1.17 1.13 PlyA + 38828 38833 6 1.05 2.03 PlyA - 39244 39239 6 1.05 2.02 Term - 40317 40088 230 1 2 46 39 180 0.561 5.69 2.01 Init - 40940 40874 67 2 1 36 73 98 0.958 2.36 2.00 Prom - 41648 41609 40 1.34 3.00 Prom + 42448 42487 40 -14.38 3.01 Init + 43499 43627 129 1 0 84 106 141 0.984 15.65 3.02 Intr + 44889 45008 120 2 0 49 82 123 0.823 8.49 3.03 Intr + 45047 45076 30 1 0 97 94 10 0.385 0.83 3.04 Intr + 48282 48306 25 2 1 50 91 -10 0.201 -6.90 3.05 Intr + 50590 50784 195 2 0 78 86 79 0.580 6.09 3.06 Intr + 52356 52495 140 1 2 32 97 146 0.943 10.08 3.07 Intr + 59543 59626 84 1 0 117 87 92 0.997 12.02 3.08 Intr + 62805 62970 166 2 1 69 55 87 0.942 3.03 3.09 Intr + 64620 64693 74 1 2 128 47 34 0.876 2.43 3.10 Intr + 65327 65512 186 1 0 24 94 285 0.927 22.59 3.11 Intr + 65781 65936 156 2 0 127 53 174 0.999 18.01 3.12 Intr + 67546 67638 93 0 0 131 117 73 0.997 14.66 3.13 Intr + 71585 71662 78 1 0 108 95 36 0.805 6.05 3.14 Intr + 74200 74257 58 0 1 81 116 20 0.956 2.56 3.15 Term + 77316 77440 125 1 2 97 55 128 0.758 8.95 3.16 PlyA + 80270 80275 6 1.05 4.03 PlyA - 81034 81029 6 1.05 4.02 Term - 82650 82518 133 2 1 92 48 91 0.758 3.06 4.01 Init - 84345 84266 80 0 2 78 69 115 0.683 7.13 4.00 Prom - 88045 88006 40 -5.06 5.00 Prom + 88691 88730 40 -5.16 5.01 Init + 91553 91652 100 1 1 55 73 291 0.941 22.72 5.02 Term + 91991 92154 164 0 2 94 48 157 0.967 10.40 5.03 PlyA + 92238 92243 6 1.05 6.09 PlyA - 92599 92594 6 1.05 6.08 Term - 100107 99998 110 1 2 109 43 84 0.963 4.67 6.07 Intr - 100610 100366 245 1 2 45 92 269 0.900 20.14 6.06 Intr - 100887 100841 47 0 2 73 94 34 0.916 -0.29 6.05 Intr - 102361 102236 126 1 0 134 86 200 0.980 25.28 6.04 Intr - 103134 102983 152 1 2 113 97 194 0.990 22.68 6.03 Intr - 111136 110977 160 1 1 85 87 64 0.191 5.56 6.02 Intr - 116463 116389 75 0 0 91 96 53 0.739 6.11 6.01 Init - 120470 120384 87 2 0 68 96 43 0.176 2.03 6.00 Prom - 128082 128043 40 -4.36 7.00 Prom + 129245 129284 40 -7.06 7.01 Init + 131269 131578 310 0 1 67 79 120 0.625 6.48 7.02 Intr + 136427 136559 133 0 1 91 87 24 0.194 2.30 7.03 Intr + 138158 138232 75 2 0 94 107 20 0.403 3.13 7.04 Intr + 152189 152336 148 2 1 100 19 116 0.364 6.14 7.05 Intr + 152606 152719 114 1 0 2 83 197 0.199 11.24 7.06 Intr + 164821 164963 143 0 2 69 7 78 0.134 -3.05 7.07 Intr + 165417 165573 157 0 1 89 86 107 0.152 10.61 7.08 Term + 169690 169752 63 0 0 117 49 24 0.064 -0.71 7.09 PlyA + 171201 171206 6 -0.45 8.05 PlyA - 171245 171240 6 1.05 8.04 Term - 172759 172738 22 0 1 111 48 15 0.031 -2.42 8.03 Intr - 173225 172969 257 2 2 68 108 49 0.532 1.14 8.02 Intr - 177121 176865 257 2 2 84 22 192 0.257 9.36 8.01 Init - 189430 189358 73 1 1 81 72 32 0.334 2.22 8.00 Prom - 189644 189605 40 -4.66 9.00 Prom + 189723 189762 40 -2.66 9.01 Init + 195153 195237 85 2 1 61 74 50 0.071 1.90 9.02 Term + 208126 208274 149 2 2 66 50 166 0.325 8.66 9.03 PlyA + 210781 210786 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 18406 18515 110 0 2 90 70 160 0.979 14.09 S.002 Init - 32787 32595 193 0 1 66 85 160 0.844 12.63 S.003 Init + 35515 35774 260 0 2 75 99 217 0.953 18.11 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:237475022_237686102|GENSCAN_predicted_peptide_1|449_aa FPEQLEKLALIPGPATEFSHALSVTQTCQPSKSLPKWIAAVFPTWAHEGGSGFEFLHLLF TAFLLLAVPGPMNQSPLLWLLPAPGAVEIQAQSPLDLGSLSTCREEAKPKATPQWDLPDQ LSLELGYAQAVKLKGASLKPIFGDRKTELQKRFLENRIRAARGCGTLLVLMPGRIEGVRV RSLQQKALGLKSYLLFGVATYWSVALVLDQGLSVPRRGSGGVDAVQGHRGNKHRKKALVL GTKIGSAEELAKIPEIHDETEDNSYNNEPPSLSFIIANLASPMPSCGDAYRWALKGKIKK ESSKRELLSDTAHLNETHCARCLQPYQLLVNSKRQCLECGLFTCKSCGRVHPEEQGWICD PCHLARVVKIGSLEWYYEHVKARFKRFGSAKVIRSLHGRLQGGGLPSGETPYSCLSLKAS SLIEEAEVGIDTFEHSSSRRACCRHFKAS >gi568815596r:237475022_237686102|GENSCAN_predicted_CDS_1|1350_bp ttccctgaacaactagagaagcttgcattaatacctggacctgctacagaattttctcac gctctaagcgtcactcaaacctgccagccttccaaatcacttcccaaatggattgcagca gtattcccaacatgggcccatgaggggggctctggttttgaatttctacatctgctcttc acagctttcctgcttctggctgtgccaggccccatgaaccagtcacccctgctgtggctg ttgcctgccccaggcgctgtggaaatccaggctcagagccccttagacctgggttcccta tccacgtgcagagaggaggctaagcccaaggccacccctcagtgggacttgcctgaccag ttatccttggaactggggtatgcccaggcagtgaaattaaagggtgcttctctgaagccc atctttggtgataggaaaactgaattgcaaaagcggttcctggagaacagaatcagggct gcccgcggctgcgggaccctcctggtcctgatgcctggcagaatcgaaggggtgagggta cgctctctgcagcagaaagccctgggcttaaagtcctatttattatttggcgttgccact tactggtcggtggctttggtgctggaccagggactgagcgtcccccggagagggtccggt ggtgtggacgctgtgcaaggccacagaggaaacaagcatagaaaaaaggctctggttctt ggcaccaaaattggttctgcagaggagctggcgaaaataccagaaatccatgatgaaacg gaagataactcttacaataatgaacctccttccctgtctttcatcattgctaatctcgct tctccaatgcccagctgtggggatgcttaccggtgggcgttgaagggcaagattaagaag gaaagctccaagagggagctgctttccgacactgcccatctgaacgagacccactgcgcc cgctgcctgcagccctaccagctgcttgtgaatagcaaaaggcagtgcctggaatgtggc ctcttcacctgcaaaagctgtggccgcgtccacccggaggagcagggctggatctgtgac ccctgccatctggccagagtcgtgaagatcggctcactggagtggtactatgagcatgtg aaagcccgcttcaagaggttcggaagtgccaaggtcatccggtccctccacgggcggctg cagggtggaggccttccttctggagagacaccttattcgtgcctgtctcttaaagcctca agtttgattgaagaagcagaagttggcatagacacctttgagcattcaagctctcgccgg gcctgctgcaggcatttcaaggcatcttag >gi568815596r:237475022_237686102|GENSCAN_predicted_peptide_2|98_aa MWELALHTTPLPCLLLCEGSARRQPRTRHLASIRRMYSINHRRIFQSLLRDGDLTVHLLS SAPECRTPSSVMSTPAYNSTIDWALGSSGFSGSVLSQQ >gi568815596r:237475022_237686102|GENSCAN_predicted_CDS_2|297_bp atgtgggagctggccctgcacaccacacctctgccttgcctgctcctctgtgagggctcg gctcgcagacagcccaggacccggcacctggcgagcattcggcggatgtactcaatcaat caccgacgcatcttccaaagcctcctgcgagatggagacctcactgtgcacctcctttca tccgcccccgagtgccgtacaccctcctccgtgatgtccacaccagcttacaattctacc atcgactgggccttgggaagctctgggttctcaggctctgtcctgagccagcagtga >gi568815596r:237475022_237686102|GENSCAN_predicted_peptide_3|552_aa MQVSIAAGPELISEERSGDSDQTDEDGEPGSEAQAQAQPFGSKKKRLLSVHDFDFEGDSD DSTQPQGHSLHLSSVPEARDSPQEPEWQVLRPQGLAKVTVIDESCSEKAAPHKAEGLEEA DTGASGCHSHPEEQPTSISPSRHGALAELCPPGGSHRMALGTAAALGSNVIRNEQLPLQY LADVDTSDEESIRAHVMASHHSKRRGRASSESQIFELNKHISAVECLLTYLENTVVPPLA KPSSVSGGDGSQRKGPRTALKFPRWHLCAALASHFKVDFMGPGESPRNVGWMSERADKVA SGTRPFKDFTEIIELSSYNLQGLGAGVRTEADVEEEALRRKLEELTSNVSDQETSSEEEE AKDEKAEPNRDKSVGPLPQADPEVGTAAHQTNRQEKSPQDPGDPVQYNRTTDEELSELED RVAVTASEVQQAESEVSDIESRIAALRAAGLTVKPSGKPRRKSNLPIFLPRVAGKLGKRP EDPNADPSSEAKAMAVPYLLRRKFSNSLKSQGKDDDSFDRKSVYRGSLTQRNPNARKGMA SHTFAVKFSLIL >gi568815596r:237475022_237686102|GENSCAN_predicted_CDS_3|1659_bp atgcaggtatctattgcagctgggcctgaactgatatctgaagagagaagtggagacagc gaccagacagatgaggatggagaacctggctcagaggcccaggcccaggcccagcccttt ggcagcaaaaaaaagcgcctcctctccgtccacgacttcgacttcgagggagactcagat gactccactcagcctcaaggtcactccctgcacctgtcctcagtccctgaggccagggac agcccacaggaacctgagtggcaggtgctcaggccccaggggttggctaaggtaacagta attgatgagtcctgctcagagaaggcagcccctcacaaggctgagggcctggaggaggct gatactggggcctctgggtgccactcccatccggaagagcagccgaccagcatctcacct tccagacacggcgccctggctgagctctgcccgcctggaggctcccacaggatggccctg gggactgctgctgcactcgggtcgaatgtcatcaggaatgagcagctgcccctgcagtac ttggccgatgtggacacctctgatgaggaaagcatccgggctcacgtgatggcctcccac cattccaagcggagaggccgggcgtcttctgagagtcagatctttgagctgaataagcat atttcagctgtggaatgcctgctgacctacctggagaacacagttgtgcctcccttggcc aagccctccagtgtcagtggaggtgatggctcccagaggaagggaccaaggacggcacta aagttccccaggtggcatctctgcgcggccctcgcctcacacttcaaagtcgacttcatg gggccaggggagagccctcggaatgtgggatggatgagtgaacgagcagacaaggtggcc agtggaactagaccctttaaggactttacagagattattgaactgagttcttataacttg cagggtctaggtgctggagtgcgcacggaggccgatgtagaggaggaggccctgaggagg aagctggaggagctgaccagcaacgtcagtgaccaggagacctcgtccgaggaggaggaa gccaaggacgaaaaggcagagcccaacagggacaaatcagttgggcctctcccccaggcg gacccggaggtgggcacggctgcccatcaaaccaacagacaggaaaaaagcccccaggac cctggggaccccgtccagtacaacaggaccacagatgaggagctgtcagagctggaggac agagtggcagtgacggcctcagaagtccagcaggcagagagcgaggtttcagacattgaa tccaggattgcagccctgagggccgcagggctcacggtgaagccctcgggaaagccccgg aggaagtcaaacctcccgatatttctccctcgagtggctgggaaacttggcaagagacca gaggacccaaatgcagacccttcaagtgaggccaaggcaatggctgtgccctatcttctg agaagaaagttcagtaattccctgaaaagtcaaggtaaagatgatgattcttttgatcgg aaatcagtgtaccgaggctcgctgacacagagaaaccccaacgcgaggaaaggaatggcc agccacaccttcgcggtaaagttttctctcattctctga >gi568815596r:237475022_237686102|GENSCAN_predicted_peptide_4|70_aa MGRSWLWPSLPFAALGLGLQLANCCNRKTLHSLMGEVTLGIGSHRVAAVEQPRQEQYSVT RKAVSAVDGF >gi568815596r:237475022_237686102|GENSCAN_predicted_CDS_4|213_bp atgggccgcagctggctttggcccagcctgccgtttgctgccctgggtctaggtctccag ctggcaaactgttgcaacaggaagacactacacagcctgatgggagaggtaaccttggga attggaagccatcgagttgcagctgtggaacagccacggcaagagcaatattcagtcacc agaaaagctgtttctgctgtagatggcttctga >gi568815596r:237475022_237686102|GENSCAN_predicted_peptide_5|87_aa MKVLRAWLLCLLMLGLALRGAASRTHRHSMEIRTPDINPAWYASRGIRPVGRFGRRRATL GDVPKPGLRPRLTCFPLEGGAMSSQDG >gi568815596r:237475022_237686102|GENSCAN_predicted_CDS_5|264_bp atgaaggtgctgagggcctggctcctgtgcctgctgatgctgggcctggccctgcgggga gctgcaagtcgtacccatcggcactccatggagatccgcacccctgacatcaatcctgcc tggtacgccagtcgcgggatcaggcctgtgggccgcttcggtcggaggagggcaaccctg ggggacgtccccaagcctggcctgcgaccccggctgacctgcttccccctggaaggcggt gctatgtcgtcccaggatggctga >gi568815596r:237475022_237686102|GENSCAN_predicted_peptide_6|333_aa MLSLFLAATWASRASSGRGPLDSSPASGQGGGTTVVAELISQQPRGMRPKREVRGMAQAH RTPQPRAAPSQPRVFKLVLLGSGSVGKSSLALRYVKNDFKSILPTVGCAFFTKVVDVGAT SLKLEIWDTAGQEKYHSVCHLYFRGANAALLVYDITRKDSFLKAQQWLKDLEEELHPGEV LVMLVGNKTDLSQEREVTFQRRAVLENEGSPEEGPRSAKKHNLDTKRGGVRAPRLQLSRI LLLEVNQVSLGNSLINSAGFCGCFLQEGKEFADSQKLLFMETSAKLNHQVSEVFNTVAQE LLQRSDEEGQALRGDAAVALNKGPARQAKCCAH >gi568815596r:237475022_237686102|GENSCAN_predicted_CDS_6|1002_bp atgctctccctcttcttggctgccacgtgggcttctcgtgcctcctccgggagaggtcct ctggattcttcccctgcctctgggcagggcggaggaaccactgtggtggcggagcttatt tcccagcagccccgcgggatgagacccaaacgagaagttaggggcatggcacaggcacac aggaccccccagcccagggctgcccccagccagccccgtgtgttcaagctggttctcctg ggaagtggctccgtgggtaagtccagcttggctcttcggtacgtgaagaacgacttcaag agtatcctgcctacggtgggctgtgcgttcttcacaaaggtggtggatgtgggtgccacc tctctgaagcttgagatctgggacacagctggccaggagaagtaccacagcgtctgccac ctctacttcaggggtgccaacgctgcgcttctggtgtacgacatcaccaggaaggattcc ttcctcaaggctcagcagtggctgaaggacctggaggaggagctgcacccaggagaagtc ctggtgatgctggtgggcaacaagacggacctcagccaggagcgggaggtgaccttccag aggagagctgtgttggaaaatgagggttccccagaagaagggcccaggagtgccaagaaa cacaacttggacacgaaacgtggaggagtgcgggctccacggctgcagctttctcggatc ttactgttggaagtgaaccaggtgtccttaggaaactctcttataaacagcgctggattt tgtggctgtttccttcaggaagggaaggagtttgccgacagccagaagttgctgttcatg gaaacttcggccaaactgaaccaccaggtgtcggaggtgttcaatacagtggcccaagag ctactgcagagaagcgacgaggagggccaggctctacggggggatgcagctgtggctctg aacaaggggcccgcgaggcaggccaaatgctgcgcccactag >gi568815596r:237475022_237686102|GENSCAN_predicted_peptide_7|380_aa MRSSWIQVSPKSNDWLPHEKREIWTQKRTQREEAMGRQRQKLGDGPQAKQRPGPPGAPEA RERPGMNSPSEPPEGATSADTLIQSSGLQDLSQYISVISHTACGKVERTKGLLKTHLTKL SHQLKKDWTILLPLSLLRSQTCPQNAARWNPRHLLLVEASLGLTWHSTLQIARSPLNRCP PPWRASLRDYRPAKQSPQSPPSGAPGPSGPACPALGIRSVPVLRATRSMDMGTQGSGRKR LPNRERLTAEDDALNQIAREDFERHPWLLPSRRQELLAIPSCDHQTCFQTLSNGPWGTRY PTEDLCVRLLKIQVWLQEKGQGRPSTLVSPVEVGLETQSSFTPQDHPPDEKGDWMLDNRK DPSGIHIAFCHQFTLDSSNP >gi568815596r:237475022_237686102|GENSCAN_predicted_CDS_7|1143_bp atgaggtcatcctggattcaggtgagccctaaatccaatgactggcttcctcatgagaaa agagagatttggacacagaaacgcacacagagggaagaggccatgggaaggcaaaggcag aaattgggtgatggtccacaggccaagcagcgcccaggaccaccaggagcaccagaggcc agagagaggccaggaatgaattcaccctcggagcctccagaaggagccacgtctgctgac accttgattcagtcttctggcctgcaagacttgagccaatacatttctgttataagccac acagcttgtggaaaagtagaacggactaaaggtcttttaaaaacacacctcaccaagctc agccaccaacttaaaaaggactggacaatacttttaccactttcccttctcagaagtcag acctgtcctcagaatgctgcaagatggaacccccggcatctgctcctagtagaggccagt ctgggcctgacctggcattccaccctgcagatagcgaggtccccactcaaccggtgccca ccgccctggcgcgcgtcgctccgcgattaccggcccgccaaacagtccccgcaaagtcca ccctccggggctcccggtccctcaggtcccgcgtgcccggccctgggcatccgcagcgtc cccgtgctgcgggcgacgcggtcgatggacatgggcacccagggatcggggcgcaagcgg ctccccaaccgggagcggctcacggcggaggacgacgcgctcaaccagatcgcgcgggag gattttgaacggcatccctggcttctgcccagtaggcgccaggagctcctcgctattccc agctgtgaccaccaaacatgcttccagacattgtcaaatggcccatggggaacaagatac ccaactgaggatctctgtgtcagattactcaaaatccaggtgtggctgcaggagaaggga cagggaagaccctcaactttagtctcacctgtggaggtggggctggagacccagtcgtcc ttcaccccccaggaccaccccccagatgagaagggggactggatgttggacaacagaaag gatccatctgggatccacattgcattttgtcatcagttcaccttggactcctccaatcca tga >gi568815596r:237475022_237686102|GENSCAN_predicted_peptide_8|202_aa MPAAEHTATPPRPRHLLMDLSPVRESGGATAFMMDKFMMERSAVTTCTYFSLNMGRPLRA SQCHTTGSTQHHLGAKGTPESGGDAGAAQPNAATGSARACELKLRGYPCKNTGAELAARG RARAQGCRVHSWLRSTKVIQMELNVETREWSVRIQVKQARQNQAKRLVDSYTASISGLEA TAVGSCTDYLEYRRNNEWPATG >gi568815596r:237475022_237686102|GENSCAN_predicted_CDS_8|609_bp atgccagcagctgaacacactgccacaccgccgaggccccgtcatttgctgatggacctc agtccagtcagagagagtgggggcgccactgctttcatgatggacaagtttatgatggaa agatccgccgtcaccacctgcacctacttttcactaaacatgggacgaccactgagggcc tcccagtgccacacaacaggaagcacacagcaccacctaggagcaaaaggtacgccagaa tctggtggagacgctggagctgcacagcccaatgcagccacgggcagcgcccgagcctgt gaactaaagctgagaggctacccgtgcaaaaacacgggtgctgagttggctgctagggga agggctagggctcagggctgccgggtccacagctggctgagaagcaccaaagtcattcaa atggaattgaatgtggaaacacgcgagtggagtgtcagaatccaagtaaagcaagcaagg caaaaccaagcaaaaagactcgtggactcatacactgcctctatttcaggtttggaggcc actgctgtcggaagctgtactgattatttagaatatcgaagaaacaatgagtggccggcc acggggtga >gi568815596r:237475022_237686102|GENSCAN_predicted_peptide_9|77_aa MRTQLAGQKAVSINRRSSCTGTSVPRHLGFPKKPDSTFDTDATNGVETRGEGFSGISATK IKPTVRSEKHLESLQCV >gi568815596r:237475022_237686102|GENSCAN_predicted_CDS_9|234_bp atgagaacacagctggcagggcagaaagcagtgtcgatcaacaggaggtcgtcctgcacc gggacgtcagtgccccgacacctcgggtttcccaaaaagcccgacagcacatttgataca gatgccaccaacggggtggagacaaggggcgagggcttcagtggaatctcagcaactaaa atcaagccgacggtcaggtcagagaagcatttggaaagcctccagtgcgtttag