GENSCAN 1.0 Date run: 4-Nov-116 Time: 10:24:30 Sequence gi568815589f:33165079_33380856 : 215778 bp : 44.93% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 PlyA - 1142 1137 6 1.05 1.01 Sngl - 2091 1549 543 0 0 84 35 552 0.945 43.60 1.00 Prom - 6503 6464 40 -8.86 2.00 Prom + 11966 12005 40 -3.36 2.01 Init + 12362 12391 30 1 0 59 111 -8 0.653 -1.80 2.02 Term + 14247 14339 93 2 0 87 55 140 0.923 8.43 2.03 PlyA + 14885 14890 6 1.05 3.03 PlyA - 17760 17755 6 1.05 3.02 Term - 22181 22029 153 2 0 109 36 47 0.323 -0.48 3.01 Init - 24513 24259 255 0 0 67 92 85 0.447 4.06 3.00 Prom - 27152 27113 40 -1.86 4.04 PlyA - 30004 29999 6 1.05 4.03 Term - 37406 37317 90 2 0 103 44 63 0.307 1.12 4.02 Intr - 53626 53534 93 1 0 23 67 106 0.188 2.16 4.01 Init - 56564 56505 60 2 0 62 111 9 0.349 1.85 4.00 Prom - 61828 61789 40 -3.86 5.00 Prom + 63809 63848 40 -4.86 5.01 Init + 75131 75191 61 1 1 90 84 137 0.716 13.05 5.02 Intr + 80034 80074 41 1 2 113 91 -10 0.799 -0.26 5.03 Intr + 81538 81650 113 0 2 131 95 57 0.927 9.88 5.04 Term + 83348 83393 46 2 1 77 42 81 0.852 -0.92 5.05 PlyA + 83467 83472 6 1.05 6.10 PlyA - 87413 87408 6 1.05 6.09 Term - 90230 90141 90 2 0 96 51 123 0.979 7.12 6.08 Intr - 90849 90787 63 0 0 85 109 -1 0.583 0.61 6.07 Intr - 91830 91723 108 0 0 95 119 131 0.998 17.38 6.06 Intr - 93955 93842 114 1 0 64 93 117 0.999 10.44 6.05 Intr - 96091 96009 83 2 2 105 94 51 0.992 6.76 6.04 Intr - 97752 97624 129 1 0 105 98 74 0.999 10.77 6.03 Intr - 98308 98159 150 2 0 53 86 44 0.613 0.83 6.02 Intr - 99517 99146 372 2 0 44 79 354 0.257 25.13 6.01 Init - 100062 99915 148 0 1 48 11 140 0.222 0.56 6.00 Prom - 103499 103460 40 -6.76 7.00 Prom + 104049 104088 40 -7.66 7.01 Init + 105588 105638 51 2 0 78 30 75 0.425 1.86 7.02 Intr + 106074 106145 72 2 0 102 78 96 0.972 9.60 7.03 Intr + 111378 111486 109 2 1 46 111 165 0.939 14.46 7.04 Intr + 113035 113226 192 2 0 66 61 98 0.568 4.36 7.05 Intr + 125290 125443 154 2 1 79 21 88 0.208 0.33 7.06 Intr + 129490 130349 860 1 2 106 97 371 0.638 30.80 7.07 Intr + 177727 177776 50 2 2 39 81 104 0.368 3.20 7.08 Intr + 179042 179110 69 1 0 40 109 85 0.526 5.18 7.09 Intr + 187568 187641 74 1 2 62 96 45 0.656 0.90 7.10 Intr + 189008 189109 102 2 0 106 91 96 0.842 10.99 7.11 Intr + 198932 199030 99 2 0 111 91 46 0.981 6.43 7.12 Intr + 199630 199696 67 1 1 86 9 87 0.486 -0.49 7.13 Intr + 201551 201696 146 1 2 92 96 130 0.657 13.28 7.14 Term + 204828 204900 73 0 1 97 48 43 0.540 -1.42 7.15 PlyA + 214471 214476 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589f:33165079_33380856|GENSCAN_predicted_peptide_1|180_aa MRLREPLLSGSAAMPGASLQRACRLLVAVCALHLGVTLVYYLAGRDLSRLPQLVGVSTPL QGGSNSAAAIGQSSGELRTGGARPPPPLGASSQPRPGGDSSPVVDSGPGPASNLTSVPVP HTTALSLPACPEESPLLGKDSGRRQSEDWDPPGFPRQGPPDIPSGWLFYDSQPPSSGSEF >gi568815589f:33165079_33380856|GENSCAN_predicted_CDS_1|543_bp atgaggcttcgggagccgctcctgagcggcagcgccgcgatgccaggcgcgtccctacag cgggcctgccgcctgctcgtggccgtctgcgctctgcaccttggcgtcaccctcgtttac tacctggctggccgcgacctgagccgcctgccccaactggtcggagtctccacaccgctg cagggcggctcgaacagtgccgccgccatcgggcagtcctccggggagctccggaccgga ggggcccggccgccgcctcctctaggcgcctcctcccagccgcgcccgggtggcgactcc agcccagtcgtggattctggccctggccccgctagcaacttgacctcggtcccagtgccc cacaccaccgcactgtcgctgcccgcctgccctgaggagtccccgctgcttggtaaggac tcgggtcggcgccagtcggaggattgggacccccccggatttccccgacagggtccccca gacattccctcaggctggctcttctacgacagccagcctccctcttctggatcagagttt taa >gi568815589f:33165079_33380856|GENSCAN_predicted_peptide_2|40_aa MAKPSPPSVLFPVLPNLRYGLHQISASERIWKDIDPNFIQ >gi568815589f:33165079_33380856|GENSCAN_predicted_CDS_2|123_bp atggccaaacctagcccaccgtctgttttgtttcctgttcttcccaacctgcgctatgga cttcatcagatttcagcatcagagagaatatggaaggacatcgaccctaacttcatccag tga >gi568815589f:33165079_33380856|GENSCAN_predicted_peptide_3|135_aa MSGLGQQTAPAVKEPAGLWGRHAIKRQLQERQWDECQTDVSAKYGRSSEEASSLVAWDRR LAGLPTSHPSPPCALLPECLPEHSYGSQGTVPPELELLADRCKNMGTRDAGSSARIRVLA KQLKSFDSPFPARPF >gi568815589f:33165079_33380856|GENSCAN_predicted_CDS_3|408_bp atgtcaggcctggggcaacaaacagcccctgctgtcaaggaacccgcaggcttgtggggc cgtcatgctatcaaaaggcagctccaagagagacagtgggatgagtgtcaaacagatgtt tcagccaagtatggcaggagctcagaggaggcctcatcacttgttgcctgggacagacgt ctggctggtctgcccacctcccaccccagccctccctgtgcgctgctgccagaatgtctt cctgaacacagctatggttcacaggggacagtccctcctgagctggaacttcttgctgac aggtgtaagaacatggggaccagagacgcagggtcctctgctcgtatcagagtgctggcc aagcaactgaagtcctttgacagcccctttcctgctaggcccttctaa >gi568815589f:33165079_33380856|GENSCAN_predicted_peptide_4|80_aa MREGCFPLPSSRDTTSAVDQPGSRQKTDGLLSVVIEGFNEDIVYESASKAKVPTIPLFLR QHQHFQGYEGHSHQMGCNTE >gi568815589f:33165079_33380856|GENSCAN_predicted_CDS_4|243_bp atgagggaaggatgcttcccattaccctcatctagagacaccacctctgctgtggaccag ccagggtctcggcagaaaacagatgggttactcagtgtggtcattgaagggtttaatgaa gacattgtttatgaaagtgccagcaaggctaaggttcccactatccccttgtttctgaga cagcatcagcatttccagggctatgaaggccacagccaccagatgggctgcaacactgaa tga >gi568815589f:33165079_33380856|GENSCAN_predicted_peptide_5|86_aa MAVRQWVIALALAALLVVDREVPVAAGKLPFSRMPICEHMVESPTCSQMSNLVCGTDGLT YTNECQLCLARIKTKQDIQIMKDGKC >gi568815589f:33165079_33380856|GENSCAN_predicted_CDS_5|261_bp atggccgtccgccagtgggtaatcgccctggccttggctgccctccttgttgtggacagg gaagtgccagtggcagcaggaaagctccctttctcaagaatgcccatctgtgaacacatg gtagagtctccaacctgttcccagatgtccaacctggtctgcggcactgatgggctcaca tatacgaatgaatgccagctctgcttggcccggataaaaaccaaacaggacatccagatc atgaaagatggcaaatgctga >gi568815589f:33165079_33380856|GENSCAN_predicted_peptide_6|418_aa MQSVRLGGGALGFAFPKSRFILSSREETQTLEQNQKRKQSHSRLDFRPLRREPRQSEPPA QRGPPPSGRPPARSTASGHDRPTRGAAAGARRPRMKKKTRRRSTRSEELTRSEELTLSEE ATWSEEATQSEEATQGEEMNRSQEVTRDEESTRSEEVTREEMAAAGLTVTVTHTQIMDAL KCVQWERLGGKQSLAHYSKRDETNTIQGTGQSSHRKQMACDEKGNEKHDLHVTSQQGSSE PVVQDLAQVVEEVIGVPQSFQKLIFKGKSLKEMETPLSALGIQDGCRVMLIGKKNSPQEE VELKKLKHLEKSVEKIADQLEELNKELTGIQQGFLPKDLQAEALCKLDRRVKATIEQFMK ILEEIDTLILPENFKDSRLKRKGLVKKVQAFLAECDTVEQNICQETERLQSTNFALAE >gi568815589f:33165079_33380856|GENSCAN_predicted_CDS_6|1257_bp atgcagtcagtcaggctgggcggcggagccttgggtttcgctttcccgaagagtcggttc atcttgagcagccgcgaagaaacccaaacactagagcaaaaccagaaacggaagcagagt cactcccgcctcgacttccggcccctccgccgggagccgcgccagtcggagcccccggcc cagcgtggtccgcctccctctgggcgtccacctgcccggagtactgccagcgggcatgac cgacccaccaggggcgccgccgccggcgctcgcaggccgcggatgaagaagaaaacccgg cgccgctcgacccggagcgaggagttgacccggagcgaggagttgaccctgagtgaggaa gcgacctggagtgaagaggcgacccagagtgaggaggcgacccagggcgaagagatgaat cggagccaggaggtgacccgggacgaggagtcgacccggagcgaggaggtgaccagggag gaaatggcggcagctgggctcaccgtgactgtcacccacacacaaatcatggatgcactc aagtgtgttcagtgggagagacttggaggaaaacagtcacttgcccattacagcaagcga gacgagaccaatacaatacagggtacagggcagtcaagccacaggaagcagatggcttgt gatgaaaaaggcaatgagaagcacgaccttcatgttacctcccagcagggcagcagtgaa ccagttgtccaagacctggcccaggttgttgaagaggtcataggggttccacagtctttt cagaaactcatatttaagggaaaatctctgaaggaaatggaaacaccgttgtcagcactt ggaatacaagatggttgccgggtcatgttaattgggaaaaagaacagtccacaggaagag gttgaactaaagaagttgaaacatttggagaagtctgtggagaagatagctgaccagctg gaagagttgaataaagagcttactggaatccagcagggttttctgcccaaggatttgcaa gctgaagctctctgcaaacttgataggagagtaaaagccacaatagagcagtttatgaag atcttggaggagattgacacactgatcctgccagaaaatttcaaagacagtagattgaaa aggaaaggcttggtaaaaaaggttcaggcattcctagccgagtgtgacacagtggagcag aacatctgccaggagactgagcggctgcagtctacaaactttgccctggccgagtga >gi568815589f:33165079_33380856|GENSCAN_predicted_peptide_7|705_aa MEQANYTIQSLKDTKTTVDAMKLGVKEMKKAYKQVKIDQIEDLQDQLEDMMEDANEIQEA LSRSYGTPELDEDDLEAELDALGDELLADEDSSYLDEAASAPAIPEGVPTDTKNKVKAFS VYISMQNSKGFMLDHKSGNLKATSLLLPPDTWRVYLTSRTRHTDSGCATSVAVYTARRCE YSRGAGSPGHVTWQVPYDEISAVHQHSYHPSGSKPKSQQTSFQSSPCNKSPKSHGLQNQP WQKLRNEKHHIRVKKAQSLAEQTSDTAGLESSTRSESGTDLREHSPSESEKEVVGADPRG AKPKKATQFVYSYGRGPKVKGKLKCEWSNRTTPKPEDAGPESTKPVGVFHPDSSEASSRK GVLDGYGARRNEQRRYPQKRPPWEVEGARPRPGRNPPKQEGHRHTNAGHRNNMGPIPKDD LNERPAKSTCDSENLAVINKSSRRVDQEKCTVRRQDPQVVSPFSRGKQNHVLKNVETHTG VKNLVIVETARHAGKPFPVVLGPLNVPKPALESMSVTIQVELQCECGRRKEMVICSEASS TYQRIAAISMASKITDMQLGGSVEISKLITKKEVHQARRLAEAFHISEDSDPFNIRSSGS KFSDSLKEDARKDLKFVSDVEKEMETLVEAVNKGKNSKKSHSFPPMNRDHRRIIHDLAQV YGLESVSYDSEPKRNVVVTAIRNPGSSNLQKITKEPIIDYFDVQD >gi568815589f:33165079_33380856|GENSCAN_predicted_CDS_7|2118_bp atggaacaagccaattataccatccagtctttgaaggacaccaagaccacggttgatgct atgaaactgggagtaaaggaaatgaagaaggcatacaagcaagtgaagatcgaccagatt gaggatttacaagaccagctagaggatatgatggaagatgcaaatgaaatccaagaagca ctgagtcgcagttatggcaccccagaactggatgaagatgatttagaagcagagttggat gcactaggtgatgagcttctggctgatgaagacagttcttatttggatgaggcagcatct gcacctgcaattccagaaggtgttcccactgatacaaaaaacaaggtgaaagctttttct gtttatatttcaatgcaaaatagcaaaggctttatgcttgaccataagtctgggaatcta aaagctactagtctcctcctcccgcccgacacctggcgcgtctatctgacgtcacgaacg cgccacacagattcgggctgcgcaacctctgtggccgtctacacggcgcgcagatgcgaa tattctcgcggcgccggaagtccggggcacgtgacctggcaggtcccttatgatgaaatc tctgctgttcatcagcatagttatcatccgtcaggaagcaaacctaagagtcagcagacg tctttccagtcctctccttgtaataaatcgcccaagagccatggccttcagaatcaacct tggcagaaattgaggaatgagaagcaccatatcagagtcaagaaagcacagagtcttgct gagcagacctcagatacagctggattagagagctcgaccagatcagagagtgggacagac ctcagagagcatagtccttctgagagtgagaaggaagttgtgggtgcagatcccagggga gcaaaacccaaaaaagcaacacagtttgtatacagctatggtagaggaccaaaagtcaag gggaaactcaaatgtgaatggagtaaccgaacaactccaaaaccggaggatgctggaccc gaaagtaccaaacctgtgggggttttccaccctgactcttcagaggcatcctctagaaaa ggagtattggatgggtatggagccagacgaaatgagcagagaagatacccacagaaaagg cctccctgggaagtggagggggccaggccacgaccaggcagaaatccaccaaaacaggag ggccaccgacatacaaacgcaggacacagaaacaacatgggccccattccaaaggatgac ctcaatgaaagaccagcaaaatctacctgtgacagtgagaacttggcagtcatcaacaag tcttccaggagggttgaccaagagaaatgcactgtacggaggcaggatcctcaagtagta tctcctttctcccgaggcaaacagaaccatgtgctaaagaatgtggaaacgcacacaggt gtgaagaaccttgtcatcgtggaaactgccagacatgctggcaagccattccctgtggta ctaggccccctgaatgtacccaaacctgcgctagagtccatgagtgtgaccatccaggta gagctacagtgtgaatgtggacgaagaaaagagatggtgatttgctctgaagcatctagt acttatcaaagaatagctgcaatctccatggcctctaagataacagacatgcagcttgga ggttcagtggagatcagcaagttaattaccaaaaaggaagttcatcaagccaggagatta gcagaggcatttcatatcagtgaggattctgatcctttcaatatacgttcttcagggtca aaattcagtgatagtttgaaagaagatgccaggaaggacttaaagtttgtcagtgacgtt gagaaggaaatggaaaccctcgtggaggccgtgaataagggaaagaatagtaagaaaagc cacagcttccctcccatgaacagagaccaccgccggatcatccatgacttggcccaagtt tatggcctggagagcgtgagctatgacagtgaaccgaagcgcaatgtggtggtcactgcc atcaggaatcctgggagcagtaatttacagaaaataaccaaggagccaataattgactat tttgacgtccaggactaa