GENSCAN 1.0 Date run: 3-Nov-116 Time: 23:30:16 Sequence gi568815589f:33194418_33447114 : 252697 bp : 46.68% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 7936 8076 141 0 0 98 43 90 0.390 3.43 1.02 PlyA + 12012 12017 6 1.05 2.03 PlyA - 13103 13098 6 1.05 2.02 Term - 17496 17396 101 1 2 62 44 96 0.531 0.89 2.01 Init - 20817 20751 67 0 1 45 94 58 0.401 3.33 2.00 Prom - 32489 32450 40 -3.86 3.00 Prom + 34470 34509 40 -4.86 3.01 Init + 45792 45852 61 2 1 90 84 137 0.716 13.05 3.02 Intr + 50695 50735 41 2 2 113 91 -10 0.799 -0.26 3.03 Intr + 52199 52311 113 1 2 131 95 57 0.927 9.88 3.04 Term + 54009 54054 46 0 1 77 42 81 0.852 -0.92 3.05 PlyA + 54128 54133 6 1.05 4.10 PlyA - 58074 58069 6 1.05 4.09 Term - 60891 60802 90 0 0 96 51 123 0.979 7.12 4.08 Intr - 61510 61448 63 1 0 85 109 -1 0.583 0.61 4.07 Intr - 62491 62384 108 1 0 95 119 131 0.998 17.38 4.06 Intr - 64616 64503 114 2 0 64 93 117 0.999 10.44 4.05 Intr - 66752 66670 83 0 2 105 94 51 0.992 6.76 4.04 Intr - 68413 68285 129 2 0 105 98 74 0.999 10.77 4.03 Intr - 68969 68820 150 0 0 53 86 44 0.613 0.83 4.02 Intr - 70178 69807 372 0 0 44 79 354 0.257 25.13 4.01 Init - 70723 70576 148 1 1 48 11 140 0.222 0.56 4.00 Prom - 74160 74121 40 -6.76 5.00 Prom + 74710 74749 40 -7.66 5.01 Init + 76249 76299 51 0 0 78 30 75 0.425 1.86 5.02 Intr + 76735 76806 72 0 0 102 78 96 0.972 9.60 5.03 Intr + 82039 82147 109 0 1 46 111 165 0.939 14.46 5.04 Intr + 83696 83887 192 0 0 66 61 98 0.568 4.36 5.05 Intr + 95951 96104 154 0 1 79 21 88 0.208 0.33 5.06 Intr + 100151 101010 860 2 2 106 97 371 0.638 30.80 5.07 Intr + 148388 148437 50 0 2 39 81 104 0.368 3.20 5.08 Intr + 149703 149771 69 2 0 40 109 85 0.526 5.18 5.09 Intr + 158229 158302 74 2 2 62 96 45 0.656 0.90 5.10 Intr + 159669 159770 102 0 0 106 91 96 0.842 10.99 5.11 Intr + 169593 169691 99 0 0 111 91 46 0.981 6.43 5.12 Intr + 170291 170357 67 2 1 86 9 87 0.485 -0.49 5.13 Intr + 172212 172357 146 2 2 92 96 130 0.666 13.28 5.14 Term + 175489 175561 73 1 1 97 48 43 0.492 -1.42 5.15 PlyA + 177808 177813 6 -0.45 6.11 PlyA - 177948 177943 6 1.05 6.10 Term - 180550 180419 132 1 0 78 39 80 0.147 0.19 6.09 Intr - 190873 190663 211 0 1 102 39 133 0.320 8.72 6.08 Intr - 191449 191232 218 1 2 58 84 376 0.988 31.50 6.07 Intr - 191611 191478 134 2 2 90 99 17 0.505 3.36 6.06 Intr - 192124 191987 138 2 0 131 79 193 0.999 23.14 6.05 Intr - 192675 192552 124 0 1 67 105 104 0.957 10.16 6.04 Intr - 200778 200661 118 2 1 30 101 130 0.001 8.97 6.03 Intr - 206870 206820 51 1 0 84 115 16 0.012 1.92 6.02 Intr - 233102 233036 67 0 1 135 4 101 0.021 4.36 6.01 Init - 244741 244669 73 1 1 64 86 40 0.157 2.63 6.00 Prom - 245000 244961 40 -8.86 7.06 PlyA - 246761 246756 6 1.05 7.05 Term - 247794 247626 169 2 1 124 48 293 0.995 26.25 7.04 Intr - 248101 247884 218 1 2 80 94 308 0.999 27.90 7.03 Intr - 248553 248435 119 1 2 91 80 170 0.999 16.68 7.02 Intr - 249041 248904 138 0 0 71 103 117 0.999 11.94 7.01 Intr - 249475 249349 127 1 1 126 95 204 0.999 25.15 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 233102 233011 92 0 2 135 42 89 0.888 6.88 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589f:33194418_33447114|GENSCAN_predicted_peptide_1|46_aa DQGRQSKGGLGYPMSFSVAAHLVAVAFIALEMLMLSQKQGDSGNLE >gi568815589f:33194418_33447114|GENSCAN_predicted_CDS_1|141_bp gaccagggcagacagagcaagggtggcctgggatatccaatgtcattcagtgttgcagcc catctggtggctgtggccttcatagccctggaaatgctgatgctgtctcagaaacaaggg gatagtgggaacctggaataa >gi568815589f:33194418_33447114|GENSCAN_predicted_peptide_2|55_aa MQSQVSVHPPVGNAKADERSSAGEQAPSCACSSLVTHNTTDLQDLNLIVPGSEEG >gi568815589f:33194418_33447114|GENSCAN_predicted_CDS_2|168_bp atgcaaagccaggtgagcgttcatcctccagtggggaatgcaaaggcagatgagcgttca tccgcaggggaacaggctccaagctgtgcctgctcctctctggtgactcacaacacgact gatctccaagatctaaacctgattgtgcctggaagtgaagaagggtga >gi568815589f:33194418_33447114|GENSCAN_predicted_peptide_3|86_aa MAVRQWVIALALAALLVVDREVPVAAGKLPFSRMPICEHMVESPTCSQMSNLVCGTDGLT YTNECQLCLARIKTKQDIQIMKDGKC >gi568815589f:33194418_33447114|GENSCAN_predicted_CDS_3|261_bp atggccgtccgccagtgggtaatcgccctggccttggctgccctccttgttgtggacagg gaagtgccagtggcagcaggaaagctccctttctcaagaatgcccatctgtgaacacatg gtagagtctccaacctgttcccagatgtccaacctggtctgcggcactgatgggctcaca tatacgaatgaatgccagctctgcttggcccggataaaaaccaaacaggacatccagatc atgaaagatggcaaatgctga >gi568815589f:33194418_33447114|GENSCAN_predicted_peptide_4|418_aa MQSVRLGGGALGFAFPKSRFILSSREETQTLEQNQKRKQSHSRLDFRPLRREPRQSEPPA QRGPPPSGRPPARSTASGHDRPTRGAAAGARRPRMKKKTRRRSTRSEELTRSEELTLSEE ATWSEEATQSEEATQGEEMNRSQEVTRDEESTRSEEVTREEMAAAGLTVTVTHTQIMDAL KCVQWERLGGKQSLAHYSKRDETNTIQGTGQSSHRKQMACDEKGNEKHDLHVTSQQGSSE PVVQDLAQVVEEVIGVPQSFQKLIFKGKSLKEMETPLSALGIQDGCRVMLIGKKNSPQEE VELKKLKHLEKSVEKIADQLEELNKELTGIQQGFLPKDLQAEALCKLDRRVKATIEQFMK ILEEIDTLILPENFKDSRLKRKGLVKKVQAFLAECDTVEQNICQETERLQSTNFALAE >gi568815589f:33194418_33447114|GENSCAN_predicted_CDS_4|1257_bp atgcagtcagtcaggctgggcggcggagccttgggtttcgctttcccgaagagtcggttc atcttgagcagccgcgaagaaacccaaacactagagcaaaaccagaaacggaagcagagt cactcccgcctcgacttccggcccctccgccgggagccgcgccagtcggagcccccggcc cagcgtggtccgcctccctctgggcgtccacctgcccggagtactgccagcgggcatgac cgacccaccaggggcgccgccgccggcgctcgcaggccgcggatgaagaagaaaacccgg cgccgctcgacccggagcgaggagttgacccggagcgaggagttgaccctgagtgaggaa gcgacctggagtgaagaggcgacccagagtgaggaggcgacccagggcgaagagatgaat cggagccaggaggtgacccgggacgaggagtcgacccggagcgaggaggtgaccagggag gaaatggcggcagctgggctcaccgtgactgtcacccacacacaaatcatggatgcactc aagtgtgttcagtgggagagacttggaggaaaacagtcacttgcccattacagcaagcga gacgagaccaatacaatacagggtacagggcagtcaagccacaggaagcagatggcttgt gatgaaaaaggcaatgagaagcacgaccttcatgttacctcccagcagggcagcagtgaa ccagttgtccaagacctggcccaggttgttgaagaggtcataggggttccacagtctttt cagaaactcatatttaagggaaaatctctgaaggaaatggaaacaccgttgtcagcactt ggaatacaagatggttgccgggtcatgttaattgggaaaaagaacagtccacaggaagag gttgaactaaagaagttgaaacatttggagaagtctgtggagaagatagctgaccagctg gaagagttgaataaagagcttactggaatccagcagggttttctgcccaaggatttgcaa gctgaagctctctgcaaacttgataggagagtaaaagccacaatagagcagtttatgaag atcttggaggagattgacacactgatcctgccagaaaatttcaaagacagtagattgaaa aggaaaggcttggtaaaaaaggttcaggcattcctagccgagtgtgacacagtggagcag aacatctgccaggagactgagcggctgcagtctacaaactttgccctggccgagtga >gi568815589f:33194418_33447114|GENSCAN_predicted_peptide_5|705_aa MEQANYTIQSLKDTKTTVDAMKLGVKEMKKAYKQVKIDQIEDLQDQLEDMMEDANEIQEA LSRSYGTPELDEDDLEAELDALGDELLADEDSSYLDEAASAPAIPEGVPTDTKNKVKAFS VYISMQNSKGFMLDHKSGNLKATSLLLPPDTWRVYLTSRTRHTDSGCATSVAVYTARRCE YSRGAGSPGHVTWQVPYDEISAVHQHSYHPSGSKPKSQQTSFQSSPCNKSPKSHGLQNQP WQKLRNEKHHIRVKKAQSLAEQTSDTAGLESSTRSESGTDLREHSPSESEKEVVGADPRG AKPKKATQFVYSYGRGPKVKGKLKCEWSNRTTPKPEDAGPESTKPVGVFHPDSSEASSRK GVLDGYGARRNEQRRYPQKRPPWEVEGARPRPGRNPPKQEGHRHTNAGHRNNMGPIPKDD LNERPAKSTCDSENLAVINKSSRRVDQEKCTVRRQDPQVVSPFSRGKQNHVLKNVETHTG VKNLVIVETARHAGKPFPVVLGPLNVPKPALESMSVTIQVELQCECGRRKEMVICSEASS TYQRIAAISMASKITDMQLGGSVEISKLITKKEVHQARRLAEAFHISEDSDPFNIRSSGS KFSDSLKEDARKDLKFVSDVEKEMETLVEAVNKGKNSKKSHSFPPMNRDHRRIIHDLAQV YGLESVSYDSEPKRNVVVTAIRNPGSSNLQKITKEPIIDYFDVQD >gi568815589f:33194418_33447114|GENSCAN_predicted_CDS_5|2118_bp atggaacaagccaattataccatccagtctttgaaggacaccaagaccacggttgatgct atgaaactgggagtaaaggaaatgaagaaggcatacaagcaagtgaagatcgaccagatt gaggatttacaagaccagctagaggatatgatggaagatgcaaatgaaatccaagaagca ctgagtcgcagttatggcaccccagaactggatgaagatgatttagaagcagagttggat gcactaggtgatgagcttctggctgatgaagacagttcttatttggatgaggcagcatct gcacctgcaattccagaaggtgttcccactgatacaaaaaacaaggtgaaagctttttct gtttatatttcaatgcaaaatagcaaaggctttatgcttgaccataagtctgggaatcta aaagctactagtctcctcctcccgcccgacacctggcgcgtctatctgacgtcacgaacg cgccacacagattcgggctgcgcaacctctgtggccgtctacacggcgcgcagatgcgaa tattctcgcggcgccggaagtccggggcacgtgacctggcaggtcccttatgatgaaatc tctgctgttcatcagcatagttatcatccgtcaggaagcaaacctaagagtcagcagacg tctttccagtcctctccttgtaataaatcgcccaagagccatggccttcagaatcaacct tggcagaaattgaggaatgagaagcaccatatcagagtcaagaaagcacagagtcttgct gagcagacctcagatacagctggattagagagctcgaccagatcagagagtgggacagac ctcagagagcatagtccttctgagagtgagaaggaagttgtgggtgcagatcccagggga gcaaaacccaaaaaagcaacacagtttgtatacagctatggtagaggaccaaaagtcaag gggaaactcaaatgtgaatggagtaaccgaacaactccaaaaccggaggatgctggaccc gaaagtaccaaacctgtgggggttttccaccctgactcttcagaggcatcctctagaaaa ggagtattggatgggtatggagccagacgaaatgagcagagaagatacccacagaaaagg cctccctgggaagtggagggggccaggccacgaccaggcagaaatccaccaaaacaggag ggccaccgacatacaaacgcaggacacagaaacaacatgggccccattccaaaggatgac ctcaatgaaagaccagcaaaatctacctgtgacagtgagaacttggcagtcatcaacaag tcttccaggagggttgaccaagagaaatgcactgtacggaggcaggatcctcaagtagta tctcctttctcccgaggcaaacagaaccatgtgctaaagaatgtggaaacgcacacaggt gtgaagaaccttgtcatcgtggaaactgccagacatgctggcaagccattccctgtggta ctaggccccctgaatgtacccaaacctgcgctagagtccatgagtgtgaccatccaggta gagctacagtgtgaatgtggacgaagaaaagagatggtgatttgctctgaagcatctagt acttatcaaagaatagctgcaatctccatggcctctaagataacagacatgcagcttgga ggttcagtggagatcagcaagttaattaccaaaaaggaagttcatcaagccaggagatta gcagaggcatttcatatcagtgaggattctgatcctttcaatatacgttcttcagggtca aaattcagtgatagtttgaaagaagatgccaggaaggacttaaagtttgtcagtgacgtt gagaaggaaatggaaaccctcgtggaggccgtgaataagggaaagaatagtaagaaaagc cacagcttccctcccatgaacagagaccaccgccggatcatccatgacttggcccaagtt tatggcctggagagcgtgagctatgacagtgaaccgaagcgcaatgtggtggtcactgcc atcaggaatcctgggagcagtaatttacagaaaataaccaaggagccaataattgactat tttgacgtccaggactaa >gi568815589f:33194418_33447114|GENSCAN_predicted_peptide_6|421_aa MQGKEARGRNQGNVLPGGSGHVREGDSYEKELEAQKLLMGSAVEEACIYKSERQNMVQAS GHRRSTRGSKMVSWSVIAKIQEILQRKMVREFLAEFMSTYVMMVFGLGSVAHMVLNKKYG SYLGVNLGFGFGVTMGVHVAGRISGAHMNAAVTFANCALGRVPWRKFPVYVLGQFLGSFL AAATIYSLFYRRSQQGVPPDRQDKNSGWRLYRDVSLLVGLGLGHCRGPVAWGGAQAWLTG MLQLCLFAITDQENNPALPGTEALVIGILVVIIGVSLGMNTGYAINPSRDLPPRIFTFIA GWGKQVFSNGENWWWVPVVAPLLGAYLGGIIYLVFIGSTIPREPLKLEDSVAYEDHGITV LPKMGSHEPTISPLTPVSELQEELYKCSNSKAVLRQFQGQFTALVKDTDLRGISDHRLFA E >gi568815589f:33194418_33447114|GENSCAN_predicted_CDS_6|1266_bp atgcagggcaaagaagcacgtggccgcaaccaagggaacgtgttgcctggagggtcaggc catgtgagagagggggatagttacgagaaagaattagaggcccagaaacttctaatgggg tcagccgtggaagaagcctgcatctacaaatctgaaagacaaaacatggttcaagcatcc gggcacaggcggtccacccgtggctccaaaatggtctcctggtccgtgatagcaaagatc caggaaatactgcagaggaagatggtgcgagagttcctggccgagttcatgagcacatat gtcatgatggtattcggccttggttccgtggcccatatggttctaaataaaaaatatggg agctaccttggtgtcaacttgggttttggcttcggagtcaccatgggagtgcacgtggca ggccgcatctctggagcccacatgaacgcagctgtgacctttgctaactgtgcgctgggc cgcgtgccctggaggaagtttccggtctatgtgctggggcagttcctgggctccttcctg gcggctgccaccatctacagtctcttctacagacggagccagcagggagtccctccggat agacaggacaagaactctggatggagactgtaccgagacgtgtctctgctggtgggcttg ggtctggggcactgccgaggtcctgtggcttggggaggggcccaggcgtggctgaccggg atgctccagctgtgtctcttcgccatcacggaccaggagaacaacccagcactgccagga acagaggcgctggtgataggcatcctcgtggtcatcatcggggtgtcccttggcatgaac acaggatatgccatcaacccgtcccgggacctgcccccccgcatcttcaccttcattgct ggttggggcaaacaggtcttcagcaatggggagaactggtggtgggtgccagtggtggca ccacttctgggtgcctatctaggtggcatcatctacctggtcttcattggctccaccatc ccacgggagcccctgaaattggaggattctgtggcgtatgaagaccacgggataaccgta ttgcccaagatgggatctcatgaacccacgatctctcccctcacccccgtctctgaactc caagaggagctgtacaaatgctccaattccaaggcagttttaagacagtttcaggggcaa ttcacagctcttgtcaaggacacagacctgagaggaatttcagatcaccggttgtttgct gagtga >gi568815589f:33194418_33447114|GENSCAN_predicted_peptide_7|256_aa MFGCGSVAQVVLSRGTHGGFLTINLAFGFAVTLGILIAGQVSGAHLNPAVTFAMCFLARE PWIKLPIYTLAQTLGAFLGAGIVFGLYYDAIWHFADNQLFVSGPNGTAGIFATYPSGHLD MINGFFDQFIGTASLIVCVLAIVDPYNNPVPRGLEAFTVGLVVLVIGTSMGFNSGYAVNP ARDFGPRLFTALAGWGSAVFTTGQHWWWVPIVSPLLGSIAGVFVYQLMIGCHLEQPPPSN EEENVKLAHVKHKEQI >gi568815589f:33194418_33447114|GENSCAN_predicted_CDS_7|771_bp atgtttggctgtggctccgtggcccaggttgtgctcagccggggcacccacggtggtttc ctcaccatcaacctggcctttggctttgctgtcactctgggcatcctcatcgctggccag gtctctggggcccacctgaaccctgccgtgacctttgccatgtgcttcctggctcgtgag ccctggatcaagctgcccatctacaccctggcacagacgctgggagccttcttgggtgct ggaatagtttttgggctgtattatgatgcaatctggcacttcgccgacaaccagcttttt gtttcgggccccaatggcacagccggcatctttgctacctacccctctggacacttggat atgatcaatggcttctttgaccagttcataggcacagcctcccttatcgtgtgtgtgctg gccattgttgacccctacaacaaccccgtcccccgaggcctggaggccttcaccgtgggc ctggtggtcctggtcattggcacctccatgggcttcaactccggctatgccgtcaaccct gcccgggactttggcccccgcctttttacagcccttgcgggctggggctctgcagtcttc acgaccggccagcattggtggtgggtgcccatcgtgtccccactcctgggctccattgcg ggtgtcttcgtgtaccagctgatgatcggctgccacctggagcagcccccaccctccaac gaggaagagaatgtgaagctggcccatgtgaagcacaaggagcagatctga