GENSCAN 1.0 Date run: 5-Nov-116 Time: 18:01:15 Sequence gi568815583f:24875357_25078320 : 202964 bp : 41.45% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2845 3136 292 2 1 45 21 232 0.523 7.47 1.02 Intr + 11160 11277 118 0 1 87 78 63 0.455 4.75 1.03 Intr + 14831 14963 133 1 1 69 91 59 0.387 3.60 1.04 Intr + 18296 18314 19 0 1 60 94 29 0.097 -4.55 1.05 Intr + 19793 19910 118 2 1 75 100 66 0.140 6.05 1.06 Intr + 27153 27201 49 2 1 62 73 62 0.101 -0.57 1.07 Intr + 27378 27542 165 1 0 106 31 160 0.076 11.11 1.08 Term + 27984 28324 341 1 2 60 47 128 0.866 -0.39 1.09 PlyA + 30336 30341 6 1.05 2.02 PlyA - 30576 30571 6 1.05 2.01 Sngl - 34316 33573 744 2 0 39 40 835 0.920 69.74 2.00 Prom - 39771 39732 40 -2.55 3.07 PlyA - 39778 39773 6 1.05 3.06 Term - 45721 45481 241 0 1 58 44 195 0.896 6.61 3.05 Intr - 58487 58349 139 0 1 75 95 76 0.101 5.60 3.04 Intr - 66382 66280 103 1 1 62 72 91 0.166 3.73 3.03 Intr - 67581 67377 205 2 1 0 70 112 0.191 -1.12 3.02 Intr - 68074 67913 162 0 0 101 17 130 0.175 5.37 3.01 Init - 74157 74150 8 0 2 103 91 0 0.664 2.35 3.00 Prom - 76691 76652 40 -3.75 4.02 PlyA - 76856 76851 6 1.05 4.01 Sngl - 79928 79362 567 2 0 54 43 334 0.934 21.40 4.00 Prom - 82137 82098 40 -5.05 5.00 Prom + 87319 87358 40 -4.65 5.01 Init + 89322 89333 12 2 0 76 75 5 0.235 -1.68 5.02 Intr + 99002 99100 99 1 0 69 103 91 0.937 8.09 5.03 Intr + 100002 100153 152 2 2 99 80 91 0.995 7.44 5.04 Intr + 100949 101060 112 2 1 64 67 129 0.999 7.86 5.05 Intr + 101521 101673 153 0 0 78 94 169 0.999 15.85 5.06 Intr + 102422 102560 139 1 1 101 110 133 0.983 15.92 5.07 Intr + 102837 102962 126 1 0 82 105 96 0.998 10.43 5.08 Intr + 145234 145405 172 2 1 89 30 102 0.285 2.58 5.09 Intr + 150452 150572 121 2 1 40 75 85 0.021 1.98 5.10 Intr + 165447 165491 45 2 0 69 83 65 0.017 1.79 5.11 Term + 174889 175269 381 0 0 81 38 198 0.803 7.95 5.12 PlyA + 175462 175467 6 1.05 6.02 PlyA - 175718 175713 6 -3.74 6.01 Sngl - 177224 176862 363 2 0 69 42 330 0.662 22.33 6.00 Prom - 179613 179574 40 -8.35 7.13 PlyA - 181911 181906 6 -0.45 7.12 Term - 182592 182284 309 0 0 35 49 319 0.132 16.78 7.11 Intr - 183086 182866 221 0 2 77 24 109 0.048 0.50 7.10 Intr - 185394 185118 277 0 1 27 75 133 0.129 2.07 7.09 Intr - 186788 186547 242 0 2 66 25 232 0.952 11.05 7.08 Intr - 188165 187905 261 0 0 31 75 123 0.353 1.84 7.07 Intr - 189632 189304 329 1 2 -8 1 233 0.243 -0.58 7.06 Intr - 190503 190392 112 1 1 84 68 64 0.710 2.52 7.05 Intr - 190911 190630 282 1 0 -8 75 271 0.785 12.67 7.04 Intr - 192651 192501 151 0 1 62 72 94 0.715 4.01 7.03 Intr - 195712 195550 163 0 1 80 91 61 0.695 4.66 7.02 Intr - 196743 196456 288 2 0 38 49 161 0.140 2.64 7.01 Init - 199112 199018 95 2 2 91 111 71 0.352 9.60 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 174006 174065 60 2 0 83 84 42 0.813 4.60 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583f:24875357_25078320|GENSCAN_predicted_peptide_1|411_aa XSSGVFAQVSVAGLGLLKPHGSHCYPHGYYRNHRCHAARKKCATVLSGRQTRSALREGHL TVGENGSPQGPIFGAPAQRRRNWGAFSPSGFRFRAPRGYFDEGNWDSHQVSTQLAATWGT SPGKDCPCSTYVPVSKMVIKELSALPVLKVGHNGTLISEEFCPIPGYQQRSEQTGLMNFS PGFGQEERDKDLFLWVKRGSSHMRVLLLASAESQKALPRFYDLFQRKWVLGLADFKNEAV HPGSGFMVSPASGVKLQTFAVLQALQARKGSVDPKSEQQQDLLQRAKEQCFHSVERDPMS EKWLTQSFGFRERVYTGKSVEGLAEKKQKQKQNILRAKGNGCKENGQKLASFSSSTHTTL CRIPNQSLNVSAQVPFMLSNGNFIWFIGFRIVEPSGIKKIKGHEKVYRMLV >gi568815583f:24875357_25078320|GENSCAN_predicted_CDS_1|1236_bp nagagctctggcgtattcgctcaagtttccgtggcgggtctagggctcctgaagccccac gggtctcactgctacccacatggctattaccgcaaccaccgctgccacgcagcccgcaag aaatgtgccaccgttctgagcggccgacaaacccgcagtgcgctacgggaaggtcacctg acagttggggagaacggcagcccgcaggggccgatcttcggcgcgcctgcacagcgcagg cggaattggggcgccttttctccttccggtttccgtttccgtgcgccccgaggatacttt gacgaaggtaattgggactcccatcaagtctccacacagctagcagccacgtggggcact tctccaggtaaggattgtccctgttccacttatgttcctgtcagtaaaatggtcataaaa gaattatctgccctacctgtcctgaaagtaggtcataatgggaccctcatttcagaggag ttctgccctatccctggataccaacaaagatctgaacaaacaggtcttatgaatttctcc ccaggttttggtcaagaggaaagagataaagatcttttcctctgggtaaagcgtggcagc tctcacatgagggttttattacttgcttcagcagaaagtcaaaaagcccttcccaggttt tatgacctctttcagcgaaagtgggttcttggtcttgctgacttcaagaatgaagccgtg caccctggcagtgggttcatggtctcgccggcttcaggagtgaagctgcagacctttgcg gtgttacaagcgttacaagctcgtaaaggtagtgtggacccaaagagtgagcagcagcaa gatttattgcaaagagcaaaagaacaatgcttccacagcgtggaaagggaccccatgtct gaaaagtggttaacccaaagttttggctttagagaaagagtctatacaggcaaaagcgta gagggtttggcagagaaaaaacaaaaacaaaaacaaaacatcctaagggccaaaggaaat ggttgcaaggaaaacggtcaaaaacttgcatcattctcttcatccactcacaccactcta tgcaggatcccaaatcagtccttgaatgtttcggcacaggttccattcatgctttccaat ggaaacttcatttggtttataggattcaggattgtggaaccatccggtatcaagaaaata aaaggccatgaaaaagtttacagaatgttggtataa >gi568815583f:24875357_25078320|GENSCAN_predicted_peptide_2|247_aa MIVRVTNRDIICQIAYAHIEGDMIVCTAYAHELPKYGVKVGPTNYAAAYCTGLPLARRLL SRFGMDKIYEGQVEVTGDEYIVESIDGQPSAFTCYLDAGLARTTTGNKVFGALKGAVDGG LSIPHSTKQFPGYGSESKEFNAEVHRKHIMGQNVADYMCYLMEEDENAYKKQFSQYIKNR VTPDMMEEMYKEAHAAIRENPVYENKPKKEVKKKRWNHPKMSLAQKKDWVAQKNASFLRA QKQAAES >gi568815583f:24875357_25078320|GENSCAN_predicted_CDS_2|744_bp atgatagttcgtgtaacaaacagagatatcatttgtcagattgcttatgcccatatagag ggggatatgatagtctgcacagcatatgcacacgaactgccaaaatatggtgtgaaggtt ggcccgacaaattatgctgcagcatattgtactggcctgccgctggcccgcaggcttctc agtaggtttggcatggacaagatctatgaaggccaagtggaggtgactggcgatgaatac attgtggaaagcattgatggtcagccaagtgcctttacctgctatttggatgcaggcctt gccagaactaccactggcaataaagtttttggtgccctgaagggagctgtggatggaggc ttgtctatccctcacagtaccaaacaattccctggttatggttctgaaagcaaggaattt aatgcagaagtacaccggaagcacatcatgggccagaatgttgccgattacatgtgctac ttaatggaagaagatgaaaatgcttacaagaaacagttctctcaatacataaagaacagg gtaactccagacatgatggaggaaatgtataaggaagctcatgctgctatacgagagaat ccagtctatgaaaataagcccaagaaagaagttaaaaagaagaggtggaaccatcccaaa atgtcccttgctcagaagaaggattgggtagctcaaaagaatgcaagcttcctcagagct cagaagcaggctgctgagagctaa >gi568815583f:24875357_25078320|GENSCAN_predicted_peptide_3|285_aa MPSCTYGSWGVPFNQMMEKEKTQAWFTDGSTKHAHTTQKWTTAALQSLSEDLPEEECQAH FSVIAQWDYKRSGHDGRDEGYAWVWPHELPFTKTNLTMATTECPIYPQQRPTLSLHYGTI PHSDQAPHAFESTKKGIVVLASATHPDYQGETGLLLHKGTCGLPRILPEDCLDENPSMNA MYPLLNGPAHLVDPTPCLLFCFSLYLICITPDKSWTVTDPSEKWAEVDGKYHTPPAHTLS IIIYLEGYYTWVTLRNKPLTAADLKHRHKAIAMQMMSELRFTAGT >gi568815583f:24875357_25078320|GENSCAN_predicted_CDS_3|858_bp atgcccagctgcacctatggctcatggggggttcccttcaatcagatgatggaaaaagag aagactcaggcctggtttacagatggttctacaaaacatgcacataccacccagaagtgg acaactgctgcactacagagcctttctgaagacctccctgaagaagagtgtcaggctcat ttttctgtcattgcccaatgggattataaacgaagtggccatgatggcagggatgaaggt tacgcatgggtttggccacatgaactcccattcaccaagaccaacctaactatggccacc actgagtgcccaatctaccctcagcagagaccaacactgagtctccactatggtaccatt ccccacagtgatcaggctcctcatgcctttgaatcaacaaagaagggaattgttgtgctg gctagtgccactcatcctgattaccaaggagaaactggacttctgcttcacaaaggaacc tgtggacttccccgtatcttaccagaagactgccttgatgaaaatccttcaatgaatgcc atgtaccctctactcaatgggcctgcccacctggtagaccccacaccttgtctcctcttc tgtttctccttatacctaatttgtattacaccagacaaatcatggactgtcactgaccca tcagaaaaatgggctgaagtggatggaaaatatcatacacctccagcacatactctttca atcattatctacttggaagggtactacacctgggtaactctccgaaacaaacctttaaca gcagctgacctaaaacacagacacaaagcaatagctatgcaaatgatgtctgagcttcgc ttcactgctgggacttaa >gi568815583f:24875357_25078320|GENSCAN_predicted_peptide_4|188_aa MGDAPQCERTGYHRGRTAQQQASEHSGSGSPEQRTLGPKFRLFSTPSPKNLEYLMNKSGR SPGCLLREATGTADLARSIASLTAPQTDASGISGGRSTLRQTRCSSGRLRTHPRLSMRAS LPLRPRRRACLPQCRGPSSLPHRNDLGGGGYWTPRAPQHCCNERGPLETISNLGSMDMST CFLKCVKY >gi568815583f:24875357_25078320|GENSCAN_predicted_CDS_4|567_bp atgggggacgcgccccaatgcgagcggacaggataccatcggggcagaacggcacaacag caagcctctgaacattccggatctggttctccagaacaaaggactttagggcccaaattc cgtttattcagtactccaagtcctaaaaacttggaatatctgatgaataaaagtggccgc tccccaggctgtctcttgagagaagccaccggcacagctgaccttgcccgctccatcgcg tcactgaccgctcctcagacagatgcgtcaggcatctccggcggccgctccactctgcgc cagactcgctgcagcagcggcaggcttcgcacacatccccgcctgagcatgcgcgccagc ctgcctctgcggccgcgcaggcgtgcttgtttgccgcagtgcaggggtcccagctccctc cctcaccggaatgacctggggggagggggctactggacccctagggccccacagcactgt tgcaatgagagggggcctctagaaaccataagcaacctgggatcaatggacatgtctacc tgttttttaaaatgtgtaaaatactaa >gi568815583f:24875357_25078320|GENSCAN_predicted_peptide_5|503_aa MVVEENLRHTFIYSLPLGLQKHQVLTVDIGFGGTAIMTVGKSSKMLQHIDYRMRCILQDG RIFIGTFKAFDKHMNLILCDCDEFRKIKPKNAKQPEREEKRVLGLVLLRGENLVSMTVEG PPPKDTGIARVPLAGAAGGPGVGRAAGRGVPAGVPIPQAPAGLAGPVRGVGGPSQQVMTP QGRGTVAAAAVAATASIAGAPTQYPPGRGTPPPPVGRATPPPGIMAPPPGMRPPMGPPIG LPPARGTPIGMPPPGMRPPPPGIRGGDHVDNRIIDKWWFHLQHHRNAVVTASFWDLLASL LLKGHTFYRSTSRNRSDLLFRKNCAAGHVLIQEEMWSNSTEEFILSKFKSERNPYAHTLK MHFAAGLEDVTEFQGREDQAGHPVIKGDITFGCPEGMVQAPGLSGGEEAHVLRSFGYMGK TVSGLDAEFQTDVLAVVMVLSSAGCGWFLARYTSPGTVAKDSVSSRHQKPHEEEHVLAPG VVAVRGGTSDLCLHGLRSEFTSY >gi568815583f:24875357_25078320|GENSCAN_predicted_CDS_5|1512_bp atggtggtggaggagaacctgcgtcatacctttatctatagccttcccctaggtcttcag aagcatcaagttttaactgtggacattggatttggtggaacagcaatcatgactgttggc aagagtagcaagatgctgcagcacattgactatagaatgagatgtatcctgcaagatggc cgaatcttcattggcacctttaaggcttttgacaagcatatgaatttgatcctctgtgat tgtgatgagttcagaaagatcaagccaaagaatgcgaagcaaccagagcgtgaagaaaag cgggttttgggtctggtgttgctgcgtggggagaacttggtatccatgactgtggagggg ccaccccccaaagatactggcattgctcgggtaccacttgctggagctgctggaggccct ggggttggtagggcagctggtagaggagtaccagctggtgtgccaattccccaggcccct gctggattggcaggccctgtccgaggagttgggggaccatcccagcaggtaatgactcca cagggaagaggcactgtagcagctgctgctgttgctgcgactgccagtattgctggagcc ccaacacagtacccaccaggacggggcactccgcccccacccgtcggcagagcaacccca cctccaggcattatggctcctccacctggtatgagaccacccatgggcccaccaattggg cttccccctgctcgagggacgccaataggcatgccgcctccgggaatgagaccccctcca ccaggcattagaggtggtgaccatgtggataataggatcatagacaaatggtggttccat ctacagcatcaccgaaatgctgttgttactgcttctttctgggatcttcttgcatctttg cttctaaaggggcacaccttttatagaagtacgtccaggaatagaagtgacctgcttttt cgcaaaaactgtgctgcaggacacgttttgattcaggaagaaatgtggagtaactccaca gaggaattcatcctttctaagtttaaatctgaacggaacccatatgcccataccctgaag atgcattttgctgcaggtctagaagatgtcacagaatttcaaggaagagaggatcaggca ggacatcctgtaataaaaggagacattaccttcggatgtcctgagggcatggtgcaggca ccaggactgagtggaggggaggaagctcatgtcttgaggtcctttggctacatggggaag actgtctctgggcttgatgctgagttccagactgacgtgctggctgttgtgatggtgctc tccagtgctggttgtggttggtttttggctaggtacacctcacctgggacagtggccaaa gacagtgtcagttccagacatcagaagccccatgaggaagagcatgtgctcgcgccagga gtggtggctgttaggggagggacctcagacctatgtctgcatggcctgaggtcagagttc acatcctactga >gi568815583f:24875357_25078320|GENSCAN_predicted_peptide_6|120_aa MDSSKCDEYLRHRLGPEPGSGHIHSGPSGLKLLSKQERGSRAVFAHTSCSRWSRPIMAGQ ETIYRKICTMEEPITLLGRLEVTQGHGHQLDPRAPQDLASKTPAGQPVFHQKNPWEAHHP >gi568815583f:24875357_25078320|GENSCAN_predicted_CDS_6|363_bp atggatagctccaagtgtgacgagtacctacggcacagacttgggccagagccagggtca gggcacattcactctggaccttcaggactgaagctgctttccaagcaggaaagaggatcc agagcagtatttgctcatacaagctgcagcaggtggagccgcccaataatggcgggccaa gagaccatttataggaagatttgcacaatggaggagcccattactctcctgggaagactg gaagtgacccagggccacgggcaccaactcgatcctcgtgctccccaggacctggcctcg aagacccctgctggccaacctgtcttccaccagaaaaatccatgggaagcccatcacccc tga >gi568815583f:24875357_25078320|GENSCAN_predicted_peptide_7|909_aa MQRWATGYRLPVAPAGLPSPAHWQDATPPSHSQTTSRNSFLHSTQSTYAHVPSGCLKRMA ASPTGRPRLMPQKPSGQDTPPNIVLALVTSQGFPALTSEAMSHRAVSLTAFPPTTMPASQ HHLNTSTRPLPLHYEQLTAQEAKKSLPGRPCDQPWKESQRAFGYLLGHPLLFPINLALPC PQRKCSSHACDRRFSSVNALGGGTHGNQWKLSLAPTVPRGGNVLDLSSDENDGCSRIQYW NIANTQRRGGIPNQETLHGSAPGIAHVLPGMHKDLHEYAKCDWTPSESPRPGPGPGHIPS GRLWDGSCFQSGKWDPEQYLLTRAAAGPALEDPQWPTCHPPEKSLESPSPPNDSPRPLMA CMSTLGSVRADDLKGLDNALNGQLKNFLVGLRNSKDTLTHHAGQRNAYCPEPSTSTGCTV DARPHRLHDSMEIPLLGRTDHLSCQQAVGLSRQLPLKSVKKSNARLRNRPTSTGTFGVQH WNIASTQSHGDMPSQDSLWQCPWDCKHTAQNAHRSPRVASCMTGAQRRVPGQGQGTFALD TFGMESGNDTEQYLLTRAAAGLRNSKGTITQHTGQKNAYRPGPPASTGCTVDARPHQIQE PAGPVSQKHRPSSLTAGSGPPTATVPEKCKEIKCMLEKQAQGVAESNTGTLPTCRGVEAS QVKRLSVATPLGLHTHCPEWTRTSTDSPMHDWNPREGPGPGPGHIPSGCLWDGCCFQSKK KDQEQYLLTQAAAGHPKGMAAFPMGHPCLTPQTPGRQDTVPSIILASITLKGLPCTDLRR NIPPRSQLGCRTILPNTCSTTSDNHHQTPMDSSKCDRIPQESSEPGPGPGHIPSESLQDG SHFQSRTEDPKQLFAHKSCSMWGCSAMEGQETISRKMCTTVEPTILPGRPEVTEGHGLQP NTHAPWICP >gi568815583f:24875357_25078320|GENSCAN_predicted_CDS_7|2730_bp atgcagaggtgggcaactggataccgccttcctgtggcacctgcaggtttaccttctcct gcacactggcaggatgctaccccaccttctcacagccaaacgacctcaagaaacagcttc ctccactccactcagtccacgtacgcacacgtgccctcaggatgcctgaagagaatggct gcttctccaacaggacgtccacgcctgatgccccaaaagccgagtggacaagacacacca cctaacattgtccttgcactggtcacctcacagggcttccctgcactgacctcagaggca atgtcccatcgtgcagtcagcttgaccgcattcccacccacaacaatgcctgcaagtcaa caccacctcaacaccagcaccaggcctctgccattgcactacgagcagctcacagctcag gaggccaagaaatccctgccagggaggccctgtgaccaaccctggaaggagagccaaaga gcctttggatatctgctgggacacccattgctcttccccatcaacttggcactcccttgt cctcagaggaaatgttcctcacatgcatgtgataggagattttcatctgtaaatgccctg ggtggtggaacccatgggaaccagtggaagttgtccctcgctcctacagtccccagagga ggcaacgtgctggacctcagttctgatgagaacgacggatgtagcagaatccaatactgg aacatcgccaacacacaaaggcgaggaggcatcccaaatcaagagactctccacggcagt gcccctggtattgcacatgtgttgcctggaatgcacaaggacctccatgaatacgccaag tgtgactggaccccatcggagagtcccaggccagggccagggccagggcacattccctct ggacgcctttgggatggaagctgctttcaaagcgggaaatgggatccagagcagtatttg ctcacacgagctgcagcaggacctgcccttgaagacccccaatggccaacctgtcatcca ccagagaaatccctggaaagcccatcaccaccaaatgacagcccaagaccactgatggca tgcatgtccacgctgggcagtgtgcgagctgatgaccttaaaggcctcgacaatgccctt aatggccagctgaaaaacttccttgtgggactcagaaattctaaagacaccctcactcat catgcggggcagaggaatgcttactgtcctgagccatccaccagcacagggtgcactgtg gatgcgaggccacaccgactccacgactccatggagattccactgctcggaagaactgac catctgtcatgccagcaggcggtgggcctctcaaggcaactgcccctgaaaagtgtaaag aaatcaaatgcacgcttgagaaataggcccacgtccactgggacttttggagtccaacac tggaacatcgccagcacacagtcacatggagacatgccaagtcaagactctctgtggcaa tgcccctgggattgcaaacacactgctcagaatgcacatagatctccacgggtagcctca tgcatgactggagcccaaaggagagtcccaggccagggacagggcacattcgctctggac acttttgggatggaaagcgggaatgatacagagcagtatttgctcacgagagcagcagca ggtctcagaaattctaaaggcaccatcactcagcacacggggcagaaaaatgcttaccgt cctgggccacccgccagcacggggtgcaccgtggatgccaggccacaccaaatccaggaa cccgcagggcctgtttctcagaagcatcgaccatcttccttgacagcaggcagcggacct cccacggcaactgtccctgagaagtgtaaagaaatcaaatgcatgcttgagaaacaggcc cagggagtagcggaatccaacactggaacattgccaacatgcagaggcgtggaggcatcc caagtcaagagactttccgtggcaacacccctgggattgcacacgcattgcccggaatgg acacggacctccacagacagccccatgcatgattggaacccacgggagggtcctgggcct gggccagggcacattccctctggatgcctttgggacggatgctgctttcaaagcaagaaa aaggatcaagagcagtatttgctcacacaagctgcagcaggacacccaaagggaatggct gcttttcccatgggacatccctgcctaacgccccaaacaccaggcagacaggacacagtg ccttccatcatccttgcatcgatcacactcaaaggccttccttgcaccgacctccgacgc aacatcccaccacgcagccaacttggatgcaggaccatcctccccaacacctgctcaacg acatctgacaaccaccaccagacccccatggacagctccaagtgtgaccgaatcccacag gagagttccgagccagggccagggccagggcacattccctctgaaagtcttcaggatgga agccactttcaaagcaggacggaagatccaaagcagttatttgctcacaagagctgcagc atgtggggctgctcagcaatggagggccaagagaccatttccaggaagatgtgcacaacg gtggagcccactattctgccgggaagaccagaagtgaccgagggccacgggctacagccc aacactcatgctccttggatctgcccttga