GENSCAN 1.0 Date run: 4-Nov-116 Time: 17:42:36 Sequence gi568815583r:34881362_35083434 : 202073 bp : 40.62% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.17 Intr - 1278 1141 138 0 0 115 98 159 0.997 19.34 1.16 Intr - 3373 3164 210 1 0 103 94 116 0.999 11.89 1.15 Intr - 5300 5165 136 1 1 37 110 96 0.911 6.35 1.14 Intr - 8963 8854 110 2 2 87 99 190 0.996 18.16 1.13 Intr - 12412 12302 111 1 0 97 99 78 0.993 9.46 1.12 Intr - 15605 15536 70 1 1 67 95 87 0.999 5.57 1.11 Intr - 16344 16198 147 2 0 29 43 185 0.988 6.53 1.10 Intr - 19502 19261 242 2 2 85 103 187 0.980 15.33 1.09 Intr - 23144 22975 170 0 2 66 86 37 0.952 0.04 1.08 Intr - 23421 23293 129 1 0 34 92 101 0.950 4.85 1.07 Intr - 25351 25184 168 2 0 98 56 200 0.988 16.80 1.06 Intr - 28952 28774 179 1 2 87 115 168 0.997 18.04 1.05 Intr - 33818 33677 142 0 1 106 49 112 0.981 7.59 1.04 Intr - 37017 36897 121 0 1 105 86 154 0.954 16.05 1.03 Intr - 49010 48897 114 2 0 61 91 87 0.983 6.02 1.02 Intr - 51073 50957 117 1 0 82 68 73 0.952 4.44 1.01 Init - 53263 53210 54 1 0 82 94 39 0.780 5.23 1.00 Prom - 58092 58053 40 -6.55 2.00 Prom + 59440 59479 40 -4.25 2.01 Sngl + 61719 62036 318 2 0 87 42 200 0.981 11.02 2.02 PlyA + 62060 62065 6 1.05 3.10 PlyA - 64294 64289 6 1.05 3.09 Term - 65754 65660 95 1 2 82 49 76 0.635 0.11 3.08 Intr - 65940 65850 91 0 1 45 74 36 0.003 -3.45 3.07 Intr - 82929 82873 57 0 0 84 115 19 0.385 2.36 3.06 Intr - 88433 88178 256 1 1 29 99 300 0.878 21.72 3.05 Intr - 101822 100002 1821 1 0 39 -52 645 0.000 31.97 3.04 Intr - 112033 111906 128 1 2 32 93 125 0.233 5.96 3.03 Intr - 119418 119313 106 2 1 33 113 91 0.184 5.40 3.02 Intr - 134241 134127 115 1 1 130 78 76 0.446 9.39 3.01 Init - 135369 135324 46 0 1 64 98 -21 0.440 -2.59 3.00 Prom - 136279 136240 40 -9.45 4.04 PlyA - 136528 136523 6 1.05 4.03 Term - 138273 138112 162 0 0 5 37 253 0.545 8.85 4.02 Intr - 157932 157848 85 2 1 37 100 99 0.043 4.90 4.01 Init - 163006 162951 56 1 2 113 100 13 0.023 3.82 4.00 Prom - 171156 171117 40 -4.95 5.00 Prom + 176249 176288 40 -7.65 5.01 Init + 177846 177929 84 2 0 71 59 101 0.920 6.37 5.02 Term + 180299 180490 192 1 0 80 38 155 0.992 6.14 5.03 PlyA + 181349 181354 6 1.05 6.02 PlyA - 181889 181884 6 1.05 6.01 Sngl - 194777 194604 174 2 0 75 55 204 0.984 10.44 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 90520 90492 29 1 2 79 55 42 0.843 -0.88 S.002 Sngl - 163671 163507 165 0 0 75 48 185 0.910 7.63 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583r:34881362_35083434|GENSCAN_predicted_peptide_1|786_aa MDKVHYCERFIELMIDLEALLPTRRWFNTILDDSHLLVHCYLSNLVRREEDGHLFSQLLD MLKFYTGFEINDQTGNALTENEMTTIHYDRITSLQVSRHERRISQIQQLNQMPLYPTEKI IWDENIVPTEYYSGEGCLALPKLNLQFLTLHDYLLRNFNLFRLESTYEIRQDIEDSVSRM KPWQSEYGGVVFGGWARMAQPIVAFTVVEVAKPNIGENWPTRVRADVTINLNVRDHIKDE WEGLRKHDVCFLITVRPTKPYGTKFDRRRPFIEQVGLVYVRGCEIQGMLDDKGRVIEDVE CVYKLLQVMVLHRKHYMTAMEGNSKNLEKEENTFSILEFEVGPEPRPNLRGESRTFRVFL DPNQYQQDMTNTIQNGAEDVYETFNIIMRRKPKENNFKAVLETIRNLMNTDCVVPDWLHD IILGYGDPSSAHYSKMPNQIATLDFNDTFLSIEHLKASFPGHNVKVTVEDPALQIPPFRI TFPVRSGKGKKRKDADVEDEDTEEAKTLIVEPHVIPNRGPYPYNQPKRNTIQFTHTQIEA IRAGMQPGLTMVVGPPGTGKTDVAVQIISNIYHNFPEQRTLIVTHSNQALNQLFEKIMAL DIDERHLLRLGHGEEELETEKDFSRYGRVNYVLARRIELLEEVKRLQKSLGVPGDASYTC ETAGYFFLYQVMSRWEEYISKVKNKGSTLPDVTEVSTFFPFHEYFANAPQPIFKGRSYEE DMEIAEGCFRHIKKIFTQLEEFRASELLRSGLDRSKYLLVKEAKIIAMTCTHAALKRHDL VKLGFK >gi568815583r:34881362_35083434|GENSCAN_predicted_CDS_1|2358_bp atggacaaagttcattactgtgaaagattcattgaacttatgattgatctagaggccctg ctacccacaaggcgctggtttaataccatcctggatgattcccaccttctggttcactgt tacctttccaatcttgttcgtagagaagaggatggccatcttttttcccagcttttggac atgcttaaattctatactggttttgaaattaatgaccaaactggaaatgctctgacagag aatgagatgaccacaattcactatgatagaattacttctctacaggtatctcgtcatgaa cgtcgaatttctcagattcagcagttgaaccagatgcctttgtatccaactgagaaaatt atatgggatgaaaatattgtcccaactgagtactattctggagaaggttgtcttgctctt cccaaattgaatttgcagtttttgactcttcatgactacctgctaaggaactttaacctc ttccgcttagaatcaacttatgaaattcgtcaggacattgaagatagtgtcagcagaatg aagccatggcaatctgaatatggcggtgtagtgtttggtggttgggcgcgaatggcccag cccattgtggctttcactgtcgttgaagtggccaaacccaacataggtgaaaactggcca acccgagttcgtgcagatgttaccataaatctcaatgtcagagatcacatcaaagatgaa tgggaaggtcttcgtaagcatgatgtatgctttttaattaccgtacgtcccacaaaacct tatggcactaagtttgaccggaggagaccttttattgagcaggttggcctggtttatgtc agaggctgtgaaattcagggcatgctggatgataaaggacgtgtcattgaagatgttgag tgtgtgtacaaattattgcaagtcatggtgttgcatcgtaaacattacatgactgctatg gaaggaaatagtaagaatttagagaaagaagaaaatactttcagcatcttagaatttgaa gtgggacctgaacccagacccaatcttagaggagaatcaaggacatttagagtgtttttg gatccaaaccagtatcaacaagatatgaccaatactatacaaaatggagcagaggatgtg tatgaaacttttaatataataatgaggagaaaaccaaaggaaaataactttaaggctgtg ctggagactattcggaacctgatgaatactgattgtgtggtacctgactggctgcacgat atcattttaggttatggggacccaagtagtgcacattattcgaaaatgcccaatcagatt gccacccttgatttcaatgatacatttctctccattgagcatttaaaagccagcttccct ggtcataatgttaaagtaactgtagaagaccctgctctacaaataccccctttcaggata acttttccagtaagaagtggaaaagggaagaaaaggaaagatgcggatgtggaagatgaa gacaccgaggaagcaaaaaccttaattgttgagccccatgttattcctaataggggtcct tatccttataatcaacccaaacgtaatacgattcagttcactcatacacagatagaagcc atccgtgctggaatgcagcctgggctgactatggttgtgggcccacctggtacaggcaaa acagatgtggcagttcagatcatatccaacatctaccacaacttcccagaacagaggact ctaattgttactcattccaatcaggccctaaaccagttgtttgagaaaatcatggcatta gacattgatgagcgccacctactgcgtcttggtcatggagaagaagagctggagacagag aaagatttcagcaggtatggaagagttaattatgttctggctcgaagaatagaactttta gaagaagtcaaacgattgcaaaagagtctaggggttccaggagatgcctcatatacctgt gaaactgcaggctatttcttcttataccaggtaatgtctcgctgggaagagtatatcagc aaagtgaaaaataaaggtagtacattgccagatgttacggaagtctccactttcttccct ttccatgaatactttgcaaatgctcctcaacccatttttaaaggaagatcttatgaagaa gacatggaaattgctgaaggatgtttcaggcatattaagaaaatctttacgcagcttgag gaattcagagcctctgaattgcttcgaagtggactggacagatctaaataccttttagtg aaagaagccaaaattattgctatgacctgtactcatgctgccttaaaacgacatgacttg gtcaagctaggtttcaag >gi568815583r:34881362_35083434|GENSCAN_predicted_peptide_2|105_aa MVNVPKTRRTFCKKCGKHQPHKVTQYKKGKDSLYAQGKRRYDRKQSGYGGQTKPIFWKKA KTTKIVLRLECIEPNCRSKRMLAIKRCKHFELGGDKKRKGQVIQF >gi568815583r:34881362_35083434|GENSCAN_predicted_CDS_2|318_bp atggttaacgtccctaaaacccgccggactttctgtaagaagtgtggcaagcaccaaccc cataaagtgacacagtacaagaagggcaaggattctctgtatgcccagggaaagcggcgt tatgacaggaagcagagtggctatggtgggcaaactaagccaattttctggaaaaaggct aaaactacaaagattgtgctaaggcttgagtgcattgagcccaactgcagatctaagaga atgctggctattaaaagatgcaagcattttgaactgggaggagataagaagagaaagggc caagtgattcagttctaa >gi568815583r:34881362_35083434|GENSCAN_predicted_peptide_3|904_aa MWENMSGVCMWSQSWGVDYTAQNVVKGQLCMVFGVVDRAWICGVESEHLQVSVSHGAKHV SGPGLGEAKGEHLAGVEGTVGWMLPKASKAVGKVDPLSGYKGCIQWCREVGLDSKDEQVR SKPYGGFLPASSICQRHFKNLKTFVKHQQLHNETYQNNVKQVRRLLEAKQEKSMYGVYNT FTTEERWALHPCSKSDPMYSMKRRKNIHACTICGKMFPSQSKLDRHVLIHTGQRPFKCVL CTKSFRQSTHLKIHQLTHSEERPFQCCFCQKGFKIQSKLLKHKQIHTRNKAFRALLLKKR RTESRPLPNKLNANQGGFENGEIGESEENNPLDVHSIYIVPFQCPKCEKCFESEQILNEH SCFAARSGKIPSRFKRSYNYKTIVKKILAKLKRARSKKLDNFQSEKKVFKKSFLRNCDLI SGEQSSEQTQRTFVGSLGKHGTYKTIGNRKKKTLTLPFSWQNMGKNLKGILTTENILSID NSVNKKDLSICGSSGEEFFNNCEVLQCGFSVPRENIRTRHKICPCDKCEKVFPSISKLKR HYLIHTGQRPFGCNICGKSFRQSAHLKRHEQTHNEKSPYASLCQVEFGNFNNLSNHSGNN VNYNASQQCQAPGVQKYEVSESDQMSGVKAESQDFIPGSTGQPCLPNVLLESEQSNPFCS YSEHQEKNDVFLYRCSVCAKSFRSPSKLERHYLIHAGQKPFECSVCGKTFRQAPHWKRHQ LTHFKERPQGKVVALDSVMNAWAVVRVPYGTLKAEDIRPGAFVRRGVKRPGSNLREVDQG CRCPERSFSFHWSGGKSAAMAAPAQPKKIVAPTVSQINAEFVTQLACKYWAPHIKKKSPF DIKHLFNKAHLAPPLIHSTLSGYSTCFTEHRVGDTATIRFLNLFPTFPLFLFYKTAIVIM ARSQ >gi568815583r:34881362_35083434|GENSCAN_predicted_CDS_3|2715_bp atgtgggagaatatgagtggagtatgtatgtggtcccaaagttggggggtggattatact gcccaaaatgtggtcaaggggcagctgtgtatggtgtttggagttgtggatagagcttgg atatgtggggtagagtctgagcacttacaagtctctgtgagtcatggagctaaacatgtt tctggcccgggacttggagaagctaaaggagaacatctggcaggagttgaaggaactgtg ggctggatgttgcccaaggccagcaaggcagtgggcaaggtggaccctttgagtggttac aaggggtgcattcagtggtgtcgagaggttggtcttgactctaaggatgaacaagtacgg tcaaaaccctatgggggatttctgcctgcttccagtatttgtcagcgtcactttaaaaat ctgaagacatttgtgaagcaccaacaacttcacaatgaaacctatcagaataatgttaaa caggtcagaagattgctggaggccaagcaagaaaagtcaatgtatggagtgtataatact tttaccacagaggaaagatgggcattacacccgtgctctaagtctgatcccatgtatagc atgaaaagaagaaagaatattcatgcatgtacaatctgtggcaagatgtttccatcacag tcaaaacttgataggcatgtacttattcatactggtcagaggccttttaaatgtgtcttg tgtactaaatcttttcgacagtcaactcacttaaaaatccaccaacttacacattcagaa gaaagaccttttcaatgttgtttttgtcaaaaaggatttaagattcaaagcaaacttctg aagcataaacaaatccatactaggaataaggcttttcgggctcttttattaaagaagagg cgtacagaatctcgccccctgcctaataagttaaatgcaaatcagggtggttttgaaaat ggtgagattggtgaatctgaggagaataatccacttgatgtccactcaatttatattgtc ccttttcaatgtccaaagtgtgaaaagtgttttgaatcagagcagattctcaatgaacac agctgttttgctgctagaagtggcaaaattccaagcaggttcaaaagaagctacaactat aaaaccattgttaaaaaaatcttggccaagcttaagcgtgctaggagtaaaaaattagat aactttcaatctgagaaaaaagtatttaaaaagagtttcttgagaaattgtgatcttatt tctggtgagcagagctctgaacaaacccagagaacatttgtgggttctcttggcaaacat ggaacatataaaacaattggcaatagaaagaagaaaacattgactttgccattttcttgg caaaatatgggaaaaaatttgaaaggcatccttacgacagaaaacatattaagcattgat aattcagtgaataagaaagacttgtcaatctgtggttcatcaggtgaggaattctttaat aactgtgaggtacttcagtgtggtttttcagttccaagggaaaacatacgtactagacat aagatatgtccttgtgacaaatgtgagaaggtatttccttctatatccaaactaaaaaga cactatttaattcatactggacagaggccctttggctgtaatatttgtgggaaatctttt agacagtcagctcacttaaaaagacatgaacagactcataatgaaaagagtccttatgca tctctttgccaagtagaatttggaaacttcaacaatctttctaatcattcaggtaataat gttaactataatgcttcccaacaatgtcaggctcctggtgttcaaaaatacgaggtctca gagtcagatcaaatgtcaggagttaaggcagagtcacaggattttattcctggtagcacc gggcaaccctgtcttcctaatgtacttttggaatcagagcaaagcaatcctttttgcagt tattcagagcatcaggagaaaaatgatgtcttcctgtaccgatgcagtgtttgtgctaaa agtttccgatctccatctaaactggaaagacactacctaattcatgcagggcagaaacca tttgaatgctcagtttgtggcaaaacattcagacaggctcctcactggaagagacatcag cttactcactttaaagaacgaccacaagggaaagtggttgccttagattcggttatgaac gcttgggctgttgtacgggttccctacgggactctgaaggctgaggatatccggcccgga gcgtttgtgcggcgcggagttaagcgccccggcagtaacttaagggaagtggaccagggt tgccgctgcccagagcggtcctttagtttccactggagtggagggaagagtgctgccatg gcagcccctgcgcagcccaagaagatcgtggcccctacggtgtcccaaatcaatgcggag ttcgtgacccagttagcatgtaaatactgggctccccacatcaagaagaaatcacctttt gatataaagcatctgtttaacaaagcacatcttgcaccgcccttaatccattcaaccctg agtggatacagcacatgtttcacagagcacagggttggggacacggcaaccatccgattt ctcaatcttttccccacctttcccctctttctattctacaaaaccgccattgtcatcatg gcccgttctcaatga >gi568815583r:34881362_35083434|GENSCAN_predicted_peptide_4|100_aa MAHASNLSTLGGQGGRITRPKTTEVADGIEITTITFLFGREFCVTDKLLQLEEEEEEQEE EQEEEQEKKEEEKEEEEKKEEEEGGGGGGGRGRRRKGRKK >gi568815583r:34881362_35083434|GENSCAN_predicted_CDS_4|303_bp atggctcacgcctctaatctcagcactttgggaggccagggtgggcggatcacaagacca aaaacaacggaagtagcagatggaattgagattaccactattacatttctgtttggccgt gaattttgtgtaacagataagttactccagttggaggaggaggaggaggagcaggaggag gagcaggaggaggagcaggagaagaaggaggaggagaaggaggaggaggagaagaaggag gaggaggagggagggggaggaggaggagggagggggaggaggaggaaggggaggaagaaa taa >gi568815583r:34881362_35083434|GENSCAN_predicted_peptide_5|91_aa MALGKYEEKEKQQALFDCTEAVCADFSEELTQRRRTVSTRCDFISDLTNQQLPTLQPPTH QIILKNPDSRVFRETDLSTNKIPVSHTAGSA >gi568815583r:34881362_35083434|GENSCAN_predicted_CDS_5|276_bp atggctcttggcaaatatgaggaaaaggaaaagcaacaagccttatttgactgcactgag gctgtctgtgcagacttttcagaggaactgactcagcgcaggaggacagtttcaactcgc tgtgatttcatctctgacctgaccaatcagcagctccccactctccaaccccctacccac caaattatccttaaaaaccctgattcccgagttttcagggagactgatttgagtactaat aaaattccagtctcccatacagccggctctgcgtga >gi568815583r:34881362_35083434|GENSCAN_predicted_peptide_6|57_aa MAARLINEINVENVVVIEMRDDCGLNQKMAIEMEKMDGFKRYVKAGISKTSLIEYGA >gi568815583r:34881362_35083434|GENSCAN_predicted_CDS_6|174_bp atggcagcaagactgatcaatgagatcaatgtggagaatgttgtagtcatcgagatgaga gacgattgtggtctgaatcagaaaatggccattgagatggagaagatggatggattcaag agatatgtaaaagctggaatcagcaagacttccttgattgaatatggagcatga