GENSCAN 1.0 Date run: 8-Nov-116 Time: 04:55:08 Sequence gi568815596f:237466574_237667172 : 200599 bp : 50.05% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1105 1146 42 0 0 105 80 2 0.043 1.56 1.02 Intr + 4801 4971 171 0 0 11 58 113 0.119 0.74 1.03 Intr + 11216 11345 130 1 1 97 63 52 0.369 3.87 1.04 Intr + 14505 14666 162 1 0 85 -7 107 0.247 0.85 1.05 Intr + 16667 16804 138 0 0 75 77 42 0.405 2.24 1.06 Intr + 19694 19736 43 0 1 89 82 56 0.590 2.40 1.07 Intr + 20108 20252 145 2 1 74 50 56 0.372 0.68 1.08 Intr + 20825 20864 40 1 1 141 117 8 0.496 6.80 1.09 Intr + 30994 31171 178 2 1 61 27 160 0.176 6.18 1.10 Intr + 34534 34550 17 1 2 122 81 9 0.020 -1.21 1.11 Intr + 37216 37237 22 2 1 110 91 29 0.000 2.00 1.12 Intr + 44001 44222 222 0 0 48 99 237 0.006 18.04 1.13 Intr + 44416 44528 113 1 2 87 94 184 0.996 18.92 1.14 Term + 45622 45761 140 2 2 70 41 72 0.642 -1.17 1.15 PlyA + 47276 47281 6 1.05 2.03 PlyA - 47692 47687 6 1.05 2.02 Term - 48765 48536 230 1 2 46 39 180 0.561 5.69 2.01 Init - 49388 49322 67 2 1 36 73 98 0.958 2.36 2.00 Prom - 50096 50057 40 1.34 3.00 Prom + 50896 50935 40 -14.38 3.01 Init + 51947 52075 129 1 0 84 106 141 0.984 15.65 3.02 Intr + 53337 53456 120 2 0 49 82 123 0.823 8.49 3.03 Intr + 53495 53524 30 1 0 97 94 10 0.385 0.83 3.04 Intr + 56730 56754 25 2 1 50 91 -10 0.201 -6.90 3.05 Intr + 59038 59232 195 2 0 78 86 79 0.580 6.09 3.06 Intr + 60804 60943 140 1 2 32 97 146 0.943 10.08 3.07 Intr + 67991 68074 84 1 0 117 87 92 0.997 12.02 3.08 Intr + 71253 71418 166 2 1 69 55 87 0.942 3.03 3.09 Intr + 73068 73141 74 1 2 128 47 34 0.876 2.43 3.10 Intr + 73775 73960 186 1 0 24 94 285 0.927 22.59 3.11 Intr + 74229 74384 156 2 0 127 53 174 0.999 18.01 3.12 Intr + 75994 76086 93 0 0 131 117 73 0.997 14.66 3.13 Intr + 80033 80110 78 1 0 108 95 36 0.805 6.05 3.14 Intr + 82648 82705 58 0 1 81 116 20 0.956 2.56 3.15 Term + 85764 85888 125 1 2 97 55 128 0.758 8.95 3.16 PlyA + 88718 88723 6 1.05 4.03 PlyA - 89482 89477 6 1.05 4.02 Term - 91098 90966 133 2 1 92 48 91 0.758 3.06 4.01 Init - 92793 92714 80 0 2 78 69 115 0.683 7.13 4.00 Prom - 96493 96454 40 -5.06 5.00 Prom + 97139 97178 40 -5.16 5.01 Init + 100001 100100 100 1 1 55 73 291 0.941 22.72 5.02 Term + 100439 100602 164 0 2 94 48 157 0.967 10.40 5.03 PlyA + 100686 100691 6 1.05 6.09 PlyA - 101047 101042 6 1.05 6.08 Term - 108555 108446 110 1 2 109 43 84 0.963 4.67 6.07 Intr - 109058 108814 245 1 2 45 92 269 0.900 20.14 6.06 Intr - 109335 109289 47 0 2 73 94 34 0.916 -0.29 6.05 Intr - 110809 110684 126 1 0 134 86 200 0.980 25.28 6.04 Intr - 111582 111431 152 1 2 113 97 194 0.990 22.68 6.03 Intr - 119584 119425 160 1 1 85 87 64 0.191 5.56 6.02 Intr - 124911 124837 75 0 0 91 96 53 0.739 6.11 6.01 Init - 128918 128832 87 2 0 68 96 43 0.176 2.03 6.00 Prom - 136530 136491 40 -4.36 7.00 Prom + 137693 137732 40 -7.06 7.01 Init + 139717 140026 310 0 1 67 79 120 0.625 6.48 7.02 Intr + 144875 145007 133 0 1 91 87 24 0.194 2.30 7.03 Intr + 146606 146680 75 2 0 94 107 20 0.403 3.13 7.04 Intr + 160637 160784 148 2 1 100 19 116 0.364 6.14 7.05 Intr + 161054 161167 114 1 0 2 83 197 0.199 11.24 7.06 Intr + 173269 173411 143 0 2 69 7 78 0.134 -3.05 7.07 Intr + 173865 174021 157 0 1 89 86 107 0.152 10.61 7.08 Term + 178138 178200 63 0 0 117 49 24 0.064 -0.71 7.09 PlyA + 179649 179654 6 -0.45 8.05 PlyA - 179693 179688 6 1.05 8.04 Term - 181207 181186 22 0 1 111 48 15 0.031 -2.42 8.03 Intr - 181673 181417 257 2 2 68 108 49 0.532 1.14 8.02 Intr - 185569 185313 257 2 2 84 22 192 0.256 9.36 8.01 Init - 197878 197806 73 1 1 81 72 32 0.306 2.22 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 26854 26963 110 0 2 90 70 160 0.980 14.09 S.002 Init - 41235 41043 193 0 1 66 85 160 0.844 12.63 S.003 Init + 43963 44222 260 0 2 75 99 217 0.953 18.11 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:237466574_237667172|GENSCAN_predicted_peptide_1|520_aa MDLVWISPVLRVLMVCSEEICDYAHLRHYGKSDATTRQSLKLIEIETGRAPEQSRVDGHV QNRDSPPAAKLFPEQLEKLALIPGPATEFSHALSVTQTCQPSKSLPKWIAAVFPTWAHEG GSGFEFLHLLFTAFLLLAVPGPMNQSPLLWLLPAPGAVEIQAQSPLDLGSLSTCREEAKP KATPQWDLPDQLSLELGYAQAVKLKGASLKPIFGDRKTELQKRFLENRIRAARGCGTLLV LMPGRIEGVRVRSLQQKALGLKSYLLFGVATYWSVALVLDQGLSVPRRGSGGVDAVQGHR GNKHRKKALVLGTKIGSAEELAKIPEIHDETEDNSYNNEPPSLSFIIANLASPMPSCGDA YRWALKGKIKKESSKRELLSDTAHLNETHCARCLQPYQLLVNSKRQCLECGLFTCKSCGR VHPEEQGWICDPCHLARVVKIGSLEWYYEHVKARFKRFGSAKVIRSLHGRLQGGGLPSGE TPYSCLSLKASSLIEEAEVGIDTFEHSSSRRACCRHFKAS >gi568815596f:237466574_237667172|GENSCAN_predicted_CDS_1|1563_bp atggaccttgtttggatttcaccagttttacgtgtgctcatggtctgttcagaagaaatc tgtgattatgcgcatctcagacactatgggaagtcagatgcgaccacccgccagtcactc aagctgatcgagatagaaacagggagggctccagagcagagccgggtagatggacacgtt caaaacagggacagtccccctgctgcaaagctgttccctgaacaactagagaagcttgca ttaatacctggacctgctacagaattttctcacgctctaagcgtcactcaaacctgccag ccttccaaatcacttcccaaatggattgcagcagtattcccaacatgggcccatgagggg ggctctggttttgaatttctacatctgctcttcacagctttcctgcttctggctgtgcca ggccccatgaaccagtcacccctgctgtggctgttgcctgccccaggcgctgtggaaatc caggctcagagccccttagacctgggttccctatccacgtgcagagaggaggctaagccc aaggccacccctcagtgggacttgcctgaccagttatccttggaactggggtatgcccag gcagtgaaattaaagggtgcttctctgaagcccatctttggtgataggaaaactgaattg caaaagcggttcctggagaacagaatcagggctgcccgcggctgcgggaccctcctggtc ctgatgcctggcagaatcgaaggggtgagggtacgctctctgcagcagaaagccctgggc ttaaagtcctatttattatttggcgttgccacttactggtcggtggctttggtgctggac cagggactgagcgtcccccggagagggtccggtggtgtggacgctgtgcaaggccacaga ggaaacaagcatagaaaaaaggctctggttcttggcaccaaaattggttctgcagaggag ctggcgaaaataccagaaatccatgatgaaacggaagataactcttacaataatgaacct ccttccctgtctttcatcattgctaatctcgcttctccaatgcccagctgtggggatgct taccggtgggcgttgaagggcaagattaagaaggaaagctccaagagggagctgctttcc gacactgcccatctgaacgagacccactgcgcccgctgcctgcagccctaccagctgctt gtgaatagcaaaaggcagtgcctggaatgtggcctcttcacctgcaaaagctgtggccgc gtccacccggaggagcagggctggatctgtgacccctgccatctggccagagtcgtgaag atcggctcactggagtggtactatgagcatgtgaaagcccgcttcaagaggttcggaagt gccaaggtcatccggtccctccacgggcggctgcagggtggaggccttccttctggagag acaccttattcgtgcctgtctcttaaagcctcaagtttgattgaagaagcagaagttggc atagacacctttgagcattcaagctctcgccgggcctgctgcaggcatttcaaggcatct tag >gi568815596f:237466574_237667172|GENSCAN_predicted_peptide_2|98_aa MWELALHTTPLPCLLLCEGSARRQPRTRHLASIRRMYSINHRRIFQSLLRDGDLTVHLLS SAPECRTPSSVMSTPAYNSTIDWALGSSGFSGSVLSQQ >gi568815596f:237466574_237667172|GENSCAN_predicted_CDS_2|297_bp atgtgggagctggccctgcacaccacacctctgccttgcctgctcctctgtgagggctcg gctcgcagacagcccaggacccggcacctggcgagcattcggcggatgtactcaatcaat caccgacgcatcttccaaagcctcctgcgagatggagacctcactgtgcacctcctttca tccgcccccgagtgccgtacaccctcctccgtgatgtccacaccagcttacaattctacc atcgactgggccttgggaagctctgggttctcaggctctgtcctgagccagcagtga >gi568815596f:237466574_237667172|GENSCAN_predicted_peptide_3|552_aa MQVSIAAGPELISEERSGDSDQTDEDGEPGSEAQAQAQPFGSKKKRLLSVHDFDFEGDSD DSTQPQGHSLHLSSVPEARDSPQEPEWQVLRPQGLAKVTVIDESCSEKAAPHKAEGLEEA DTGASGCHSHPEEQPTSISPSRHGALAELCPPGGSHRMALGTAAALGSNVIRNEQLPLQY LADVDTSDEESIRAHVMASHHSKRRGRASSESQIFELNKHISAVECLLTYLENTVVPPLA KPSSVSGGDGSQRKGPRTALKFPRWHLCAALASHFKVDFMGPGESPRNVGWMSERADKVA SGTRPFKDFTEIIELSSYNLQGLGAGVRTEADVEEEALRRKLEELTSNVSDQETSSEEEE AKDEKAEPNRDKSVGPLPQADPEVGTAAHQTNRQEKSPQDPGDPVQYNRTTDEELSELED RVAVTASEVQQAESEVSDIESRIAALRAAGLTVKPSGKPRRKSNLPIFLPRVAGKLGKRP EDPNADPSSEAKAMAVPYLLRRKFSNSLKSQGKDDDSFDRKSVYRGSLTQRNPNARKGMA SHTFAVKFSLIL >gi568815596f:237466574_237667172|GENSCAN_predicted_CDS_3|1659_bp atgcaggtatctattgcagctgggcctgaactgatatctgaagagagaagtggagacagc gaccagacagatgaggatggagaacctggctcagaggcccaggcccaggcccagcccttt ggcagcaaaaaaaagcgcctcctctccgtccacgacttcgacttcgagggagactcagat gactccactcagcctcaaggtcactccctgcacctgtcctcagtccctgaggccagggac agcccacaggaacctgagtggcaggtgctcaggccccaggggttggctaaggtaacagta attgatgagtcctgctcagagaaggcagcccctcacaaggctgagggcctggaggaggct gatactggggcctctgggtgccactcccatccggaagagcagccgaccagcatctcacct tccagacacggcgccctggctgagctctgcccgcctggaggctcccacaggatggccctg gggactgctgctgcactcgggtcgaatgtcatcaggaatgagcagctgcccctgcagtac ttggccgatgtggacacctctgatgaggaaagcatccgggctcacgtgatggcctcccac cattccaagcggagaggccgggcgtcttctgagagtcagatctttgagctgaataagcat atttcagctgtggaatgcctgctgacctacctggagaacacagttgtgcctcccttggcc aagccctccagtgtcagtggaggtgatggctcccagaggaagggaccaaggacggcacta aagttccccaggtggcatctctgcgcggccctcgcctcacacttcaaagtcgacttcatg gggccaggggagagccctcggaatgtgggatggatgagtgaacgagcagacaaggtggcc agtggaactagaccctttaaggactttacagagattattgaactgagttcttataacttg cagggtctaggtgctggagtgcgcacggaggccgatgtagaggaggaggccctgaggagg aagctggaggagctgaccagcaacgtcagtgaccaggagacctcgtccgaggaggaggaa gccaaggacgaaaaggcagagcccaacagggacaaatcagttgggcctctcccccaggcg gacccggaggtgggcacggctgcccatcaaaccaacagacaggaaaaaagcccccaggac cctggggaccccgtccagtacaacaggaccacagatgaggagctgtcagagctggaggac agagtggcagtgacggcctcagaagtccagcaggcagagagcgaggtttcagacattgaa tccaggattgcagccctgagggccgcagggctcacggtgaagccctcgggaaagccccgg aggaagtcaaacctcccgatatttctccctcgagtggctgggaaacttggcaagagacca gaggacccaaatgcagacccttcaagtgaggccaaggcaatggctgtgccctatcttctg agaagaaagttcagtaattccctgaaaagtcaaggtaaagatgatgattcttttgatcgg aaatcagtgtaccgaggctcgctgacacagagaaaccccaacgcgaggaaaggaatggcc agccacaccttcgcggtaaagttttctctcattctctga >gi568815596f:237466574_237667172|GENSCAN_predicted_peptide_4|70_aa MGRSWLWPSLPFAALGLGLQLANCCNRKTLHSLMGEVTLGIGSHRVAAVEQPRQEQYSVT RKAVSAVDGF >gi568815596f:237466574_237667172|GENSCAN_predicted_CDS_4|213_bp atgggccgcagctggctttggcccagcctgccgtttgctgccctgggtctaggtctccag ctggcaaactgttgcaacaggaagacactacacagcctgatgggagaggtaaccttggga attggaagccatcgagttgcagctgtggaacagccacggcaagagcaatattcagtcacc agaaaagctgtttctgctgtagatggcttctga >gi568815596f:237466574_237667172|GENSCAN_predicted_peptide_5|87_aa MKVLRAWLLCLLMLGLALRGAASRTHRHSMEIRTPDINPAWYASRGIRPVGRFGRRRATL GDVPKPGLRPRLTCFPLEGGAMSSQDG >gi568815596f:237466574_237667172|GENSCAN_predicted_CDS_5|264_bp atgaaggtgctgagggcctggctcctgtgcctgctgatgctgggcctggccctgcgggga gctgcaagtcgtacccatcggcactccatggagatccgcacccctgacatcaatcctgcc tggtacgccagtcgcgggatcaggcctgtgggccgcttcggtcggaggagggcaaccctg ggggacgtccccaagcctggcctgcgaccccggctgacctgcttccccctggaaggcggt gctatgtcgtcccaggatggctga >gi568815596f:237466574_237667172|GENSCAN_predicted_peptide_6|333_aa MLSLFLAATWASRASSGRGPLDSSPASGQGGGTTVVAELISQQPRGMRPKREVRGMAQAH RTPQPRAAPSQPRVFKLVLLGSGSVGKSSLALRYVKNDFKSILPTVGCAFFTKVVDVGAT SLKLEIWDTAGQEKYHSVCHLYFRGANAALLVYDITRKDSFLKAQQWLKDLEEELHPGEV LVMLVGNKTDLSQEREVTFQRRAVLENEGSPEEGPRSAKKHNLDTKRGGVRAPRLQLSRI LLLEVNQVSLGNSLINSAGFCGCFLQEGKEFADSQKLLFMETSAKLNHQVSEVFNTVAQE LLQRSDEEGQALRGDAAVALNKGPARQAKCCAH >gi568815596f:237466574_237667172|GENSCAN_predicted_CDS_6|1002_bp atgctctccctcttcttggctgccacgtgggcttctcgtgcctcctccgggagaggtcct ctggattcttcccctgcctctgggcagggcggaggaaccactgtggtggcggagcttatt tcccagcagccccgcgggatgagacccaaacgagaagttaggggcatggcacaggcacac aggaccccccagcccagggctgcccccagccagccccgtgtgttcaagctggttctcctg ggaagtggctccgtgggtaagtccagcttggctcttcggtacgtgaagaacgacttcaag agtatcctgcctacggtgggctgtgcgttcttcacaaaggtggtggatgtgggtgccacc tctctgaagcttgagatctgggacacagctggccaggagaagtaccacagcgtctgccac ctctacttcaggggtgccaacgctgcgcttctggtgtacgacatcaccaggaaggattcc ttcctcaaggctcagcagtggctgaaggacctggaggaggagctgcacccaggagaagtc ctggtgatgctggtgggcaacaagacggacctcagccaggagcgggaggtgaccttccag aggagagctgtgttggaaaatgagggttccccagaagaagggcccaggagtgccaagaaa cacaacttggacacgaaacgtggaggagtgcgggctccacggctgcagctttctcggatc ttactgttggaagtgaaccaggtgtccttaggaaactctcttataaacagcgctggattt tgtggctgtttccttcaggaagggaaggagtttgccgacagccagaagttgctgttcatg gaaacttcggccaaactgaaccaccaggtgtcggaggtgttcaatacagtggcccaagag ctactgcagagaagcgacgaggagggccaggctctacggggggatgcagctgtggctctg aacaaggggcccgcgaggcaggccaaatgctgcgcccactag >gi568815596f:237466574_237667172|GENSCAN_predicted_peptide_7|380_aa MRSSWIQVSPKSNDWLPHEKREIWTQKRTQREEAMGRQRQKLGDGPQAKQRPGPPGAPEA RERPGMNSPSEPPEGATSADTLIQSSGLQDLSQYISVISHTACGKVERTKGLLKTHLTKL SHQLKKDWTILLPLSLLRSQTCPQNAARWNPRHLLLVEASLGLTWHSTLQIARSPLNRCP PPWRASLRDYRPAKQSPQSPPSGAPGPSGPACPALGIRSVPVLRATRSMDMGTQGSGRKR LPNRERLTAEDDALNQIAREDFERHPWLLPSRRQELLAIPSCDHQTCFQTLSNGPWGTRY PTEDLCVRLLKIQVWLQEKGQGRPSTLVSPVEVGLETQSSFTPQDHPPDEKGDWMLDNRK DPSGIHIAFCHQFTLDSSNP >gi568815596f:237466574_237667172|GENSCAN_predicted_CDS_7|1143_bp atgaggtcatcctggattcaggtgagccctaaatccaatgactggcttcctcatgagaaa agagagatttggacacagaaacgcacacagagggaagaggccatgggaaggcaaaggcag aaattgggtgatggtccacaggccaagcagcgcccaggaccaccaggagcaccagaggcc agagagaggccaggaatgaattcaccctcggagcctccagaaggagccacgtctgctgac accttgattcagtcttctggcctgcaagacttgagccaatacatttctgttataagccac acagcttgtggaaaagtagaacggactaaaggtcttttaaaaacacacctcaccaagctc agccaccaacttaaaaaggactggacaatacttttaccactttcccttctcagaagtcag acctgtcctcagaatgctgcaagatggaacccccggcatctgctcctagtagaggccagt ctgggcctgacctggcattccaccctgcagatagcgaggtccccactcaaccggtgccca ccgccctggcgcgcgtcgctccgcgattaccggcccgccaaacagtccccgcaaagtcca ccctccggggctcccggtccctcaggtcccgcgtgcccggccctgggcatccgcagcgtc cccgtgctgcgggcgacgcggtcgatggacatgggcacccagggatcggggcgcaagcgg ctccccaaccgggagcggctcacggcggaggacgacgcgctcaaccagatcgcgcgggag gattttgaacggcatccctggcttctgcccagtaggcgccaggagctcctcgctattccc agctgtgaccaccaaacatgcttccagacattgtcaaatggcccatggggaacaagatac ccaactgaggatctctgtgtcagattactcaaaatccaggtgtggctgcaggagaaggga cagggaagaccctcaactttagtctcacctgtggaggtggggctggagacccagtcgtcc ttcaccccccaggaccaccccccagatgagaagggggactggatgttggacaacagaaag gatccatctgggatccacattgcattttgtcatcagttcaccttggactcctccaatcca tga >gi568815596f:237466574_237667172|GENSCAN_predicted_peptide_8|202_aa MPAAEHTATPPRPRHLLMDLSPVRESGGATAFMMDKFMMERSAVTTCTYFSLNMGRPLRA SQCHTTGSTQHHLGAKGTPESGGDAGAAQPNAATGSARACELKLRGYPCKNTGAELAARG RARAQGCRVHSWLRSTKVIQMELNVETREWSVRIQVKQARQNQAKRLVDSYTASISGLEA TAVGSCTDYLEYRRNNEWPATG >gi568815596f:237466574_237667172|GENSCAN_predicted_CDS_8|609_bp atgccagcagctgaacacactgccacaccgccgaggccccgtcatttgctgatggacctc agtccagtcagagagagtgggggcgccactgctttcatgatggacaagtttatgatggaa agatccgccgtcaccacctgcacctacttttcactaaacatgggacgaccactgagggcc tcccagtgccacacaacaggaagcacacagcaccacctaggagcaaaaggtacgccagaa tctggtggagacgctggagctgcacagcccaatgcagccacgggcagcgcccgagcctgt gaactaaagctgagaggctacccgtgcaaaaacacgggtgctgagttggctgctagggga agggctagggctcagggctgccgggtccacagctggctgagaagcaccaaagtcattcaa atggaattgaatgtggaaacacgcgagtggagtgtcagaatccaagtaaagcaagcaagg caaaaccaagcaaaaagactcgtggactcatacactgcctctatttcaggtttggaggcc actgctgtcggaagctgtactgattatttagaatatcgaagaaacaatgagtggccggcc acggggtga