GENSCAN 1.0 Date run: 7-Nov-116 Time: 18:44:04 Sequence gi568815594r:184595291_184834062 : 238772 bp : 43.18% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 12189 12228 40 -2.56 1.01 Init + 12316 12343 28 0 1 78 96 18 0.390 1.36 1.02 Term + 29285 29484 200 0 2 116 36 167 0.513 11.86 1.03 PlyA + 30043 30048 6 1.05 2.04 PlyA - 30500 30495 6 1.05 2.03 Term - 34211 33982 230 0 2 86 43 123 0.938 4.39 2.02 Intr - 35851 35747 105 2 0 44 85 99 0.719 5.39 2.01 Init - 37104 36978 127 0 1 40 95 84 0.715 4.62 2.00 Prom - 55131 55092 40 -2.26 3.00 Prom + 61507 61546 40 -3.46 3.01 Init + 65423 65442 20 1 2 59 61 21 0.075 -3.94 3.02 Intr + 66484 66613 130 1 1 36 84 84 0.770 3.50 3.03 Intr + 70627 70774 148 0 1 65 111 46 0.784 4.41 3.04 Intr + 76988 77170 183 0 0 63 123 8 0.248 1.56 3.05 Intr + 84630 84654 25 1 1 67 53 52 0.470 -3.32 3.06 Intr + 86958 87046 89 0 2 106 99 45 0.884 7.01 3.07 Intr + 90119 90208 90 0 0 87 88 28 0.777 2.67 3.08 Intr + 90286 90394 109 2 1 41 101 63 0.641 2.24 3.09 Intr + 96209 96291 83 2 2 34 93 43 0.494 -1.32 3.10 Term + 99323 99489 167 0 2 6 36 181 0.226 2.78 3.11 PlyA + 100957 100962 6 1.05 4.11 PlyA - 103674 103669 6 1.05 4.10 Term - 103715 103697 19 1 1 116 54 -16 0.257 -4.41 4.09 Intr - 105591 105530 62 0 2 50 103 87 0.512 3.93 4.08 Intr - 107151 107073 79 2 1 81 103 65 0.913 6.85 4.07 Intr - 114890 114782 109 0 1 55 89 85 0.825 4.64 4.06 Intr - 117723 117654 70 0 1 96 80 15 0.917 0.25 4.05 Intr - 121343 121107 237 2 0 74 103 184 0.959 16.31 4.04 Intr - 122892 122749 144 0 0 62 31 166 0.889 8.78 4.03 Intr - 123189 123131 59 1 2 81 84 2 0.874 -2.30 4.02 Intr - 124763 124659 105 0 0 68 68 68 0.400 2.99 4.01 Init - 125937 125925 13 0 1 81 84 2 0.577 -0.41 4.00 Prom - 127227 127188 40 -5.56 5.00 Prom + 127674 127713 40 0.14 5.01 Init + 138771 139125 355 2 1 100 110 236 0.913 22.30 5.02 Intr + 143006 143367 362 0 2 38 34 175 0.190 2.04 5.03 Term + 150640 150765 126 0 0 76 48 128 0.480 5.88 5.04 PlyA + 151145 151150 6 1.05 6.23 PlyA - 157059 157054 6 1.05 6.22 Term - 161975 161835 141 2 0 104 36 117 0.804 6.03 6.21 Intr - 162416 162345 72 2 0 80 80 27 0.493 0.70 6.20 Intr - 162630 162529 102 0 0 61 116 43 0.963 4.77 6.19 Intr - 165210 165067 144 0 0 86 91 161 0.998 16.68 6.18 Intr - 167233 167117 117 1 0 120 100 65 0.997 11.56 6.17 Intr - 167965 167877 89 2 2 79 105 100 0.995 10.49 6.16 Intr - 169635 169563 73 0 1 80 80 18 0.629 -0.82 6.15 Intr - 170696 170601 96 2 0 62 94 52 0.688 3.41 6.14 Intr - 171466 171332 135 1 0 43 92 188 0.989 15.46 6.13 Intr - 173160 173026 135 0 0 73 91 181 0.999 17.66 6.12 Intr - 177864 177791 74 1 2 92 78 48 0.839 3.33 6.11 Intr - 178424 178373 52 2 1 83 109 -4 0.859 -0.32 6.10 Intr - 181372 181194 179 2 2 94 105 190 0.996 20.94 6.09 Intr - 181693 181594 100 1 1 58 102 113 0.994 9.38 6.08 Intr - 185143 185042 102 1 0 137 64 84 0.988 11.27 6.07 Intr - 188701 188637 65 2 2 116 109 19 0.960 5.24 6.06 Intr - 193441 193327 115 1 1 113 82 106 0.987 12.52 6.05 Intr - 195943 195849 95 2 2 88 65 28 0.766 0.18 6.04 Intr - 202530 202364 167 2 2 23 110 68 0.321 2.10 6.03 Intr - 203629 203271 359 1 2 -11 -8 279 0.101 2.45 6.02 Intr - 208256 208030 227 0 2 103 101 297 0.567 30.20 6.01 Init - 216425 216302 124 2 1 40 50 44 0.036 -3.87 6.00 Prom - 219424 219385 40 -4.56 7.00 Prom + 228642 228681 40 -1.46 7.01 Init + 230774 230998 225 1 0 82 50 149 0.938 6.97 7.02 Term + 231257 231466 210 1 0 81 42 123 0.830 4.29 7.03 PlyA + 232680 232685 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594r:184595291_184834062|GENSCAN_predicted_peptide_1|75_aa MEYYAAIKKASITDLFLTTASITDLFLTTASITDLFLTTASIIDLFLTTASIIDLFLTTA SITYLFLTTASIIQC >gi568815594r:184595291_184834062|GENSCAN_predicted_CDS_1|228_bp atggaatactacgcagccataaaaaaggcctccatcactgacctgtttctgaccacagcc tccatcactgacctgtttctgaccacagcctccatcactgacctgtttctgaccacagcc tccatcattgacctgtttctgaccacagcctccatcattgacctgtttctgaccacagcc tccatcacttacctgtttctgaccacagcctccatcattcagtgttag >gi568815594r:184595291_184834062|GENSCAN_predicted_peptide_2|153_aa MTSRSGTDVDAANLRETFRNLKYEVRNKNDLTREEIVELMRDELDCGIETDSGVDDDMAC HKIPVEADFLYAYSTAPGYYSWRNSKDGSWFIQSLCAMLKQYADKLEFMHILTRVNRKVA TEFESFSFDATFHAKKQIPCIVSMLTKELYFYH >gi568815594r:184595291_184834062|GENSCAN_predicted_CDS_2|462_bp atgacatctcggtctggtacagatgtcgatgcagcaaacctcagggaaacattcagaaac ttgaaatatgaagtcaggaataaaaatgatcttacacgtgaagaaattgtggaattgatg cgtgatgaactggactgtggcattgagacagacagtggtgttgatgatgacatggcgtgt cataaaataccagtggaggccgacttcttgtatgcatactccacagcacctggttattat tcttggcgaaattcaaaggatggctcctggttcatccagtcgctttgtgccatgctgaaa cagtatgccgacaagcttgaatttatgcacattcttacccgggttaaccgaaaggtggca acagaatttgagtccttttcctttgacgctacttttcatgcaaagaaacagattccatgt attgtttccatgctcacaaaagaactctatttttatcactaa >gi568815594r:184595291_184834062|GENSCAN_predicted_peptide_3|347_aa MELTATGKNLLHCYEVIPENAVCKLYFDLEFNKPANPGADGKKMVALLIEYVCKALQELY GVNCSAEDVLNLDSSTDEKFSRHLIFQLHDVAFKDNIHVEAPARQGFSFNKMFTEKATEE SWTSNSKKLERLGSAEQSSPDLSFLVVKNNMGEKHLFVDLGNRFLVPERFSDTLRILTCE PSQNKQKGVGYFNSIGTSVETIEGFQCSPYPEVDHFVLSLVNKDGIKGGIRRWNYFFPEE LLVYDICKYRWCENIGRAHKSNNIMILVDLKNEVWYQKCHDPVCKAENFKSDCASADAVW DNGIDDAYFLEATEDAELAEAAENSLLSYNSEVDEIPDELIIEVLQE >gi568815594r:184595291_184834062|GENSCAN_predicted_CDS_3|1044_bp atggaactgactgctacaggaaaaaatctcttacactgctatgaagttattcctgaaaat gctgtgtgcaagctttattttgatttggaatttaacaaacctgccaacccaggagctgat gggaaaaagatggttgcattactcattgagtatgtgtgtaaagcacttcaagagttatac ggtgttaattgctcagctgaagatgttttgaacttggattctagcactgatgaaaaattc agccggcatttaatatttcagctccatgatgtggcatttaaagataatattcatgttgaa gcacctgcaagacaaggattttctttcaataaaatgttcacagaaaaggctacagaggaa agctggacatcgaattcaaagaaactggagaggctggggtcagctgagcaaagcagtcct gacctttcatttctagttgtgaagaataacatgggagagaagcatctttttgtagatctc ggaaaccggttcctggtgccagaaaggttctcagatactttacgaattcttacatgtgag ccatctcagaataaacaaaaaggagttggatattttaacagtatcggcacttcagtagaa accattgaaggttttcagtgttctccctatcctgaagttgatcattttgttctttctttg gtgaataaagatggcattaaaggaggaattcggcgttggaactactttttcccagaagaa ttactggtttatgatatttgtaaatatcggtggtgtgaaaacattggaagagcccataag agtaataatataatgattctggttgatctgaaaaatgaagtttggtatcaaaaatgtcat gaccctgtatgtaaagcagaaaacttcaaatctgactgtgcatctgctgatgctgtctgg gataatggcattgatgatgcttattttttagaagctactgaagatgctgaattagctgaa gctgcagagaacagtcttctcagttataacagtgaagtggatgaaattcctgatgaacta attatagaagtattacaagagtaa >gi568815594r:184595291_184834062|GENSCAN_predicted_peptide_4|298_aa MRLAGLVPGALFSSFDEVMFHWMVLMLMDVYQCLDIEESGLPFACMHGIEGPCASLDPGR LPTAVASAQLLQWIQKALQWVQKVLQLEAGTDHRRSMQHAYVGALQPPGRKLRPISDDSE SIEESDTRRKVKSAEKISTQRHEVIRTTASSELSEKPAESVTSKKTGPLSAQPSVEKENL AIESQSKTQKKGKISHDKRKKSRSKAIGSDTSDIVHIWCPEGMKTSDIKELNIVLPEFEK THLEHQQRIESKVCKAAIATFYVNVKEQFIKMMISDIEKKRQRMIEVQDELLRYKFRN >gi568815594r:184595291_184834062|GENSCAN_predicted_CDS_4|897_bp atgaggctagcaggattggtccctggtgccttatttagttcatttgatgaggtcatgttt cactggatggtcttgatgcttatggatgtttatcagtgtctagacattgaagagtcaggg ctcccctttgcctgtatgcatggcatagagggcccctgtgccagtttggatcctgggaga ctgcccacagctgtggcctctgcccagctgctgcagtggatccagaaggcgctgcagtgg gtgcagaaggtgctgcagctggaggctggcactgaccataggagatccatgcagcatgcc tatgtgggtgccctgcagcccccaggaagaaagctcaggcccattagtgatgactctgaa agcattgaagaaagtgatacaaggagaaaagttaaatcagcagagaaaataagtacacaa cgtcatgaggttattcgaaccacagcgtcttcagaactttcagagaaaccagctgagtct gtcacttctaaaaagacaggaccccttagtgcccagccctctgttgaaaaagagaacttg gcaatagaaagtcaatcgaaaactcagaaaaaagggaagatatctcatgacaaaaggaag aaatcaagaagtaaagccataggctcagatacttctgacattgtgcacatttggtgtcca gaaggaatgaaaaccagtgacatcaaggagttgaatattgttttgcctgaatttgagaaa acccacctagagcatcaacaaagaatagaatctaaagtttgtaaggcagccatcgccaca ttttatgttaatgttaaagaacaattcatcaaaatgatgatttcagatatcgaaaagaaa aggcagcgtatgattgaagtccaggatgaactgcttcgatataaatttagaaattga >gi568815594r:184595291_184834062|GENSCAN_predicted_peptide_5|280_aa MVPLSALERLEAPAKPLAAPNSQSGPSTRACAGLPERPPGSSLAWSVRENWVWLLQRNGA GVHAAQPVTYGPKHAKVFRAFAPTRIASFPAPESPGATWDGPLWGGERGGASVPESWAVV PNPYTVLSQIPEEAEWFTVLELKDACFCIPLHSDSQFLFAFEDPTDHTSRLTWTVLPQGF GDRPHLFGQALAQDLCHFSSPGTLVLQYVDDLLLATSSEASCQQATLELLKFLANQGYKH WVHQGADEERAWRFDDVIGAQSASLSDVVNTASKSVQEDE >gi568815594r:184595291_184834062|GENSCAN_predicted_CDS_5|843_bp atggtgccgctctccgctctcgagcgactggaagctcccgccaagcccctcgccgctccg aacagccaatccgggccgagcacgcgcgcgtgcgcaggacttcctgagcggcccccgggc tcctccctggcctggagcgtgagggaaaactgggtctggttattacagcgcaacggagct ggagtccatgccgcacaaccagttacctacggtccaaaacatgccaaagttttccgggct ttcgcgcccacccgcatcgccagttttccggcgccagaaagcccgggagccacgtgggac ggccccttgtggggtggggaaaggggaggagctagtgtccctgagtcgtgggcagttgta cccaacccctatactgtgctctctcaaataccagaggaagcagaatggttcacggttctg gagctcaaggatgcctgcttctgtattcccctgcactctgactcccagtttctctttgcc tttgaggatcccacagaccacacgtcccgacttacatggacagtcttgccccaagggttt ggggatagacctcatctgtttggtcaggcactggcccaagatctatgccacttctcaagt ccaggcactctggtccttcagtatgtggatgatttacttttggctaccagttcggaagcc tcatgccagcaggctactctagagctcttgaaatttctagctaatcaagggtacaagcac tgggtccaccagggggcagatgaagaaagggcctggagattcgacgatgtcattggagca cagagcgcctcactttcagacgttgtaaatactgcttcaaagtcagttcaagaagatgaa taa >gi568815594r:184595291_184834062|GENSCAN_predicted_peptide_6|920_aa MFRHPELKLCARSAITLIPPPPAQPPVTSLLLSVSVNLLILEFSLENYQHRTMQAHELFR YFRMPELVDFRQYVRTLPTNTLMGFGAFAALTTFWYATRPKPLKPPCDLSMQSVEVAMRT FVIAEGGSSAGKSCVQDLLLPVSTDPCDIHRSSECLRCSAGYLISVCSYTSSDHNQCYAG TASLALLWIGGILKGCLLWKQFRWTERSHWNFGYWALGSPGNGNGCCVALFARGGHCTRA EKAAALRELQASRKYRSDPLSLAEGESHLETRAVNASDLFSCHRTTAVGSFSARGTPLQS AESGVAFEERALKKTMNAFLLCQEGSGGARRSALLDSDEPLVYFYDDVTTLYEGFQRGIQ VSNNGPCLGSRKPDQPYEWLSYKQVAELSECIGSALIQKGFKTAPDQFIGIFAQNRPEWV IIEQGCFAYSMVIVPLYDTLGNEAITYIVNKAELSLVFVDKPEKAKLLLEGVENKLIPGL KIIVVMDAYGSELVERGQRCGVEVTSMKAMEPPAPEDLAVICFTSGTTGNPKGAMVTHRN IVSDCSAFVKATECVMLCHGAKIGFFQGDIRLLMDDLKVLQPTVFPVVPRLLNRMFDRIF GQANTTLKRWLLDFASKRKEAELRSGIIRNNSLWDRLIFHKVQSSLGGRVRLMVTGAAPV SATVLTFLRAALGCQFYEGYGQTECTAGCCLTMPGDWTAGHVGAPMPCNLIKLVDVEEMN YMAAEGEGEVCVKGPNVFQGYLKDPAKTAEALDKDGWLHTGDIGKWLPNGTLKIIDRKKH IFKLAQGEYIAPEKIENIYMRSEPVAQVFVHGESLQAFLIAIVVPDVETLCSWAQKRGFE GSFEELCRNKDVKKAILEDMVRLGKDSGLKPFEQVKGITLHPELFSIDNGLLTPTMKAKR PELRNYFRSQIDDLYSTIKV >gi568815594r:184595291_184834062|GENSCAN_predicted_CDS_6|2763_bp atgtttcgtcatcccgaattgaaactctgtgcccgttcagcaataacgctcattcctcct ccccctgcccagcccccagtaacctctctgctgctttctgtctctgtgaatttgctgatt ctagaattcagcttagagaactatcaacacaggacaatgcaagcccatgagctgttccgg tattttcgaatgccagagctggttgacttccgacagtacgtgcgtactcttccgaccaac acgcttatgggcttcggagcttttgcagcactcaccaccttctggtacgccacgagaccc aaacccctgaagccgccatgcgacctctccatgcagtcagtggaagtggcgatgagaacc tttgtaattgctgaaggaggtagtagtgcaggcaagtcctgtgtgcaagacctgctgctc ccagttagtacggacccctgtgacattcacagaagttcagaatgtctgagatgctctgca ggctaccttatctccgtctgcagctacacctccagtgatcacaatcagtgctacgctggc acagccagcctggccctgctctggattggaggcatcctcaagggctgcttgctgtggaag cagtttcgctggaccgagaggagccactggaattttgggtactgggccttagggtcaccc gggaatgggaatggctgctgtgtggccctgtttgccagagggggccattgtaccagggca gagaaggctgctgccctcagggagctccaggccagcaggaagtaccgctcggaccccttg tccttggcagagggtgagagccatcttgagactagagcagtcaatgcaagtgacttgttt tcatgccatagaactacagctgtgggttcattctcagctaggggaactcctctgcagagt gcagaaagtggtgttgcttttgaagagagagcactgaaaaagaccatgaatgcatttctg ttgtgtcaagagggtagtggtggtgcacgaagatccgcactacttgacagcgacgagccc ttggtgtatttctatgatgatgtcacaacattatacgaaggtttccagaggggaatacag gtgtcaaataatggcccttgtttaggctctcggaaaccagaccaaccctatgaatggctt tcatataaacaggttgcagaattgtcggagtgcataggctcagcactgatccagaagggc ttcaagactgccccagatcagttcattggcatctttgctcaaaatagacctgagtgggtg attattgaacaaggatgctttgcttattcgatggtgatcgttccactttatgataccctt ggaaatgaagccatcacgtacatagtcaacaaagctgaactctctctggtttttgttgac aagccagagaaggccaaactcttattagagggtgtagaaaataagttaataccaggcctt aaaatcatagttgtcatggatgcctacggcagtgaactggtggaacgaggccagaggtgt ggggtggaagtcaccagcatgaaggcgatggagcctccagcacctgaagatcttgcagta atttgtttcacaagtggaactacaggcaaccccaaaggagcaatggtcactcaccgaaac atagtgagcgattgttcagcttttgtgaaagcaacagagtgtgtaatgctgtgtcatgga gctaaaatcggatttttccaaggagatatcaggctgctcatggatgacctcaaggtgctt caacccactgtcttccccgtggttccaagactgctgaaccggatgtttgaccgaattttc ggacaagcaaacaccacgctgaagcgatggctcttggactttgcctccaagaggaaagaa gcagagcttcgcagcggcatcatcagaaacaacagcctgtgggaccggctgatcttccac aaagtacagtcgagcctgggcggaagagtccggctgatggtgacaggagccgccccggtg tctgccactgtgctgacgttcctcagagcagccctgggctgtcagttttatgaaggatac ggacagacagagtgcactgccgggtgctgcctgaccatgcctggagactggaccgcaggc catgttggggccccgatgccgtgcaatttgataaaacttgttgatgtggaagaaatgaat tacatggctgccgagggcgagggcgaggtgtgtgtgaaagggccaaatgtatttcagggc tacttgaaggacccagcgaaaacagcagaagctttggacaaagacggctggttacacaca ggggacattggaaaatggttaccaaatggcaccttgaaaattatcgaccggaaaaagcac atatttaagctggcacaaggagaatacatagcccctgaaaagattgaaaatatctacatg cgaagtgagcctgttgctcaggtgtttgtccacggagaaagcctgcaggcatttctcatt gcaattgtggtaccagatgttgagacattatgttcctgggcccaaaagagaggatttgaa gggtcgtttgaggaactgtgcagaaataaggatgtcaaaaaagctatcctcgaagatatg gtgagacttgggaaggattctggtctgaaaccatttgaacaggtcaaaggcatcacattg caccctgaattattttctatcgacaatggccttctgactccaacaatgaaggcgaaaagg ccagagctgcggaactatttcaggtcgcagatagatgacctctattccactatcaaggtt tag >gi568815594r:184595291_184834062|GENSCAN_predicted_peptide_7|144_aa MLLVRSPRRPRPPAAPPRPQAPPPAGPAPPSARPPSCLGARGWRVAGGREAAAGERRWSR PDCLGFHNESEEGQRSLTWAAPAARAGAGELRADCALHHGGHLLCGSEGFRGSEGFRVPA MPILGLQKPTRWFSGSHPKGCSSF >gi568815594r:184595291_184834062|GENSCAN_predicted_CDS_7|435_bp atgctgctggtgcgctcgccgcgccggccccgccctcccgcggccccgccccgcccgcag gccccgcccccggcaggccccgccccgccgagcgcccggccgccctcctgtctgggcgcg cgtggctggcgggtggccggcggtcgagaggctgcggcgggcgaacgccgctggagtcgc ccagactgcctcggatttcataatgaatctgaggaaggacaaaggagcctgacgtgggca gcgcccgcagctcgcgccggtgcaggggagctgagggccgactgcgcgctgcaccacggc ggccacctgctgtgcggttctgagggcttccgcggttctgagggcttccgcgtccctgcc atgcccatcttggggcttcagaagccaacacgatggttttcgggctcacatcccaaaggc tgctcttccttttag