GENSCAN 1.0 Date run: 8-Nov-116 Time: 06:02:30 Sequence gi568815583f:66195098_66433309 : 238212 bp : 45.00% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 7512 7698 187 1 1 95 111 74 0.596 9.15 1.02 Intr + 8441 8597 157 2 1 75 77 9 0.041 -1.49 1.03 Intr + 15735 15863 129 2 0 76 23 166 0.887 9.79 1.04 Intr + 17876 18011 136 1 1 49 93 102 0.911 6.94 1.05 Intr + 36683 36711 29 0 2 93 71 36 0.044 0.23 1.06 Term + 40714 40782 69 0 0 126 48 60 0.171 3.74 1.07 PlyA + 42715 42720 6 1.05 2.00 Prom + 43966 44005 40 -5.76 2.01 Init + 57369 57466 98 2 2 89 57 141 0.941 11.04 2.02 Intr + 57955 58078 124 1 1 40 73 104 0.971 4.69 2.03 Intr + 60002 60187 186 1 0 78 33 130 0.803 6.39 2.04 Term + 75975 76103 129 2 0 -29 29 379 0.999 18.48 2.05 PlyA + 76323 76328 6 1.05 3.03 PlyA - 76733 76728 6 1.05 3.02 Term - 83946 83822 125 1 2 63 55 103 0.858 3.05 3.01 Init - 85036 84979 58 1 1 78 60 34 0.509 1.07 3.00 Prom - 87999 87960 40 -2.36 4.00 Prom + 97835 97874 40 -5.96 4.01 Init + 98500 98638 139 0 1 103 49 245 0.974 22.40 4.02 Intr + 99891 100044 154 1 1 85 59 44 0.888 0.33 4.03 Intr + 111727 111855 129 1 0 79 80 74 0.962 5.51 4.04 Intr + 113612 113747 136 2 1 90 63 116 0.961 9.87 4.05 Intr + 116627 116803 177 1 0 59 94 104 0.917 8.22 4.06 Intr + 118942 119020 79 0 1 39 31 71 0.569 -4.38 4.07 Intr + 119939 120118 180 0 0 52 99 213 0.835 18.64 4.08 Intr + 123352 123521 170 2 2 77 68 70 0.954 3.57 4.09 Intr + 125474 125635 162 1 0 109 71 93 0.995 9.87 4.10 Intr + 127590 127837 248 2 2 75 90 228 0.984 17.96 4.11 Intr + 130734 131267 534 0 0 64 18 573 0.967 39.94 4.12 Intr + 133873 134027 155 1 2 100 20 186 0.766 12.72 4.13 Intr + 134124 134302 179 1 2 71 71 89 0.915 5.14 4.14 Intr + 136778 136923 146 1 2 78 95 65 0.497 5.28 4.15 Term + 137954 138215 262 2 1 29 54 136 0.450 -0.70 4.16 PlyA + 138684 138689 6 1.05 5.06 PlyA - 138836 138831 6 1.05 5.05 Term - 142084 141861 224 2 2 62 38 181 0.975 7.58 5.04 Intr - 146259 146053 207 1 0 85 54 60 0.642 1.35 5.03 Intr - 154026 153963 64 0 1 84 95 29 0.964 1.49 5.02 Intr - 154340 154218 123 2 0 82 111 41 0.821 6.58 5.01 Init - 157744 157718 27 1 0 89 95 34 0.562 3.66 5.00 Prom - 158552 158513 40 -4.16 6.08 PlyA - 159803 159798 6 1.05 6.07 Term - 184778 184315 464 0 2 58 41 436 0.821 31.12 6.06 Intr - 190259 190161 99 0 0 -2 110 73 0.519 0.58 6.05 Intr - 191715 191553 163 0 1 104 77 91 0.570 9.15 6.04 Intr - 192323 192121 203 0 2 83 61 48 0.196 0.60 6.03 Intr - 195454 195404 51 2 0 71 99 20 0.138 0.28 6.02 Intr - 199252 199189 64 1 1 40 97 54 0.146 -0.21 6.01 Init - 237010 236831 180 1 0 100 94 105 0.500 11.52 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 90606 90704 99 2 0 28 38 156 0.973 2.73 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583f:66195098_66433309|GENSCAN_predicted_peptide_1|235_aa XSLSNSSLIPNCPTNTECRGSPSLSGNFLRASGWISFRTGQFLSGLPVRTLLPSPEALAP TGSLPAYKKDRIPRRTLTRGPQRETESESPSFDPLSTFAPFYPLHQNQPKQNSSKVIPSN EYRQGHWFQEELDLSPQDKGKKQVTDGKKKLVVKDEFRASSPVDLRGRAKKLFGDIGKLW TKDGQKGEAAGDEACKSLHWGSAVVTTWVMEEQVKESHRQASSLHIKDKDYPGSV >gi568815583f:66195098_66433309|GENSCAN_predicted_CDS_1|708_bp ntatctctgagcaacagctccctgatccccaactgtccaaccaatacagaatgtcgagga agtccttccctttctggaaacttccttcgggcctctggctggatttccttcaggactggt cagtttctttcaggactgccagtcaggactctgctcccctctccagaagctctggcgccc acaggcagcttacctgcttacaagaaagacaggatcccaaggagaactctgaccagaggt ccccagagagaaactgagtcagagagtcccagctttgaccccctaagcacctttgctcct ttctaccctctccaccaaaatcagccaaaacaaaactcttcaaaggtaattcccagcaat gaatatcgccagggccactggttccaggaagagctggacttgagccctcaggacaagggg aagaagcaagtcacagatggcaagaaaaaactagtagtcaaggatgaatttagagcttcc agtcctgtagatctgcgagggagggccaagaagctgtttggtgatatcgggaaattgtgg acaaaggacgggcagaaaggagaggccgcaggagatgaggcctgcaaatccctgcactgg ggaagtgctgtagtgacgacttgggtcatggaagagcaagtcaaggagagccacaggcaa gcatcaagtctccatatcaaggacaaggactatcccggttcagtgtga >gi568815583f:66195098_66433309|GENSCAN_predicted_peptide_2|178_aa MTTRETVTLASQRGLRGTDPETRLPLDGARESSKSVHGTCWGLAAAGRIAGRRRKPAASP QALPPDPGGEGAGAPSGVGREERRGVSVPMESTLRPDTTRILIVPPLISGARNFCGTQPE AETEVQPEAEPLLSPWEEEEEEEEDEKEEEEEEEEEEEEEEEEEEEEEEEEEENSALA >gi568815583f:66195098_66433309|GENSCAN_predicted_CDS_2|537_bp atgaccactcgggagacagtcacactggcctcgcagcgcgggctgcggggcaccgatccc gagacgcggctgccgctggatggagcccgcgagtccagcaaaagcgtgcatggcacttgc tggggcttagcggccgcggggcgcatcgccggccgccgccgaaaacctgctgcgtccccg caggctctgcctcccgaccccggcggggaaggcgccggtgcacccagtggagtcgggcgg gaggagagacgtggggtctccgttcccatggagtccacgctgcggcccgacaccaccagg attctcatagttccacccctaatttctggggccagaaatttctgtggtacacaaccagaa gcagaaacagaagtacaaccagaagcagaacctctgctcagcccctgggaagaagaagaa gaagaggaggaggacgagaaggaggaagaggaagaggaagaagaagaagaagaggaagag gaagaagaagaagaagaggaagaagaagaggaagaagaaaattcagcattagcttag >gi568815583f:66195098_66433309|GENSCAN_predicted_peptide_3|60_aa MESVASMGAKAFWKSVKNVQTAVIQFQSHFLNCRLVVSTCMCSSLYSCPKDSTPVPDRIY >gi568815583f:66195098_66433309|GENSCAN_predicted_CDS_3|183_bp atggagagtgttgcctcaatgggggctaaagccttttggaagtcagtgaagaatgttcag acagcagtgatccagttccagtcccacttcctcaactgccggctggtggtttccacctgc atgtgctcatctctttactcatgccctaaggactcaacaccagtgcctgataggatctac tga >gi568815583f:66195098_66433309|GENSCAN_predicted_peptide_4|949_aa MLQKREKVLLLRTFQGRTLRIVREHYLRPCVPCHSPLCPQPAACSHDGKLLSSDVTHYVI PDWKVVQDYLEILEFPELKGIIFMQTACQAVQHQRGRRQYNKLRNLLKDARHDCILFANE FQQCCYLPRERGESMEKWQTRSIYNAAVWYYHHCQDRMPIVMVTEDEEAIQQYGSETEGV FVITFKNYLDNFWPDLKAAHELCDSILQSRRERENESQESHGKEYPEHLPLEVLEAGIKS GRYIQGILNVNKHRAQIEAFVRLQGASSKDSDLVSDILIHGMKARNRSIHGDVVVVELLP KNEWKGRTVALCENDCDDKASGESPSEPMPTGRVVGILQKNWRDYVVTFPSKEEVQSQGK NAQKILVTPWDYRIPKIRISTQQAETLQDFRVVVRIDSWESTSVYPNGHFVRVLGRIGDL EGEIATILVENSISVIPFSEAQMCEMPVNTPESPWKVSPEEEQKRKDLRKSHLVFSIDPK GCEDVDDTLSVRTLNNGNLELGVHIADVTHFVAPNSYIDIEARTRYAVSIMWELDKASYE IKKVWYGRTIIRSAYKLFYEAAQELLDGNLSVVDDIPEFKDLDEKSRQAKLEELVWAIGK LTDIARHVRAKRDGCGALELEGVEVCVQLDDKKNIHDLIPKQPLEVHETVAECMILANHW VAKKIWESFPHQALLRQHPPPHQEFFSELRECAKAKGFFIDTRSNKTLADSLDNANDPHD PIVNRLLRSMATQAMSNALYFSTGSCAEEEFHHYGLALDKYTHFTSPIRRYSDIVVHRLL MAAISKDKKMEIKGNLFSNKDLEELCRHINNRNQAAQHSQKQSTELFQCMYFKDKDPATE ERCISDGVIYSIRTNGVLLFIPRLEIISNKPYKIPNTELIHQSSPLLKSELVKEVTKSVE EAQLAQEVKVNIIQEEYQEYRQTKGRSLYTLLEEIRDLALLDVSNNYGI >gi568815583f:66195098_66433309|GENSCAN_predicted_CDS_4|2850_bp atgctgcagaagcgggagaaggtgctgctgctgaggaccttccagggccgcacgctgcgg atcgtgcgcgagcactacctgcggccctgcgtgccctgccacagcccgctctgcccgcag cccgccgcctgcagccacgatgggaaactcttgtctagtgatgtgactcattacgtgatc ccagactggaaagttgttcaagattatcttgagatccttgagtttcctgagttgaaggga attattttcatgcagacagcttgtcaagctgtgcagcatcaaagaggcaggagacagtat aacaaactgcgaaacctgctgaaggatgcgcgtcatgattgcattctctttgctaatgaa ttccagcaatgctgctatctgccacgggaaagaggagagtccatggagaagtggcagacc aggagcatatacaacgcagctgtttggtactatcatcactgccaggacaggatgccaatt gttatggtgacagaagatgaagaggcaattcagcagtatggaagtgaaacagaaggagta ttcgtgattactttcaagaattacctggacaatttctggcctgatttaaaagctgcccac gagctttgtgattctatccttcagtctcgacgggagagagagaatgagagtcaggagagc catgggaaggagtacccagaacatcttcccctggaagtgttagaagctgggattaaatct ggacgctatatccagggaattctgaatgtcaacaaacacagagcccaaatagaagctttt gttcgacttcaaggagccagcagtaaagattcagatttagtcagtgacatcctaatccac gggatgaaggctcgaaaccgctcaattcatggagatgtggtagttgtggagctgcttcct aaaaatgaatggaaaggaagaaccgtagccctgtgtgagaatgactgtgacgacaaggct tcgggcgagtccccaagtgagcccatgcctacaggtcgagtggtgggcatacttcagaag aactggcgggattatgtggtgacatttccgtccaaagaagaggtccaatctcagggcaaa aatgctcagaaaatcctggttacaccttgggattacagaattcccaaaattcgaattagc actcagcaagcagaaaccctccaggacttcagggtggtcgtgcgcatcgattcctgggag tcaacatctgtgtatccaaatggacattttgtgcgtgttttaggaagaatcggagatctg gaaggggaaattgcaaccatcctggtggaaaacagtatttcagttattcctttctcagaa gctcagatgtgtgagatgccagtaaacacaccagaaagtccctggaaggtgagtcctgaa gaggaacaaaaacgtaaagacttgaggaaaagccatctcgtattcagcattgaccccaaa ggttgtgaagatgtggatgacacactctcagtcagaaccttaaataatggcaacctggaa cttggggtccacatcgcagatgtaacacactttgtggcaccaaattcttacattgatatt gaagctagaacaaggtatgctgtaagcatcatgtgggaactggataaagcctcttatgaa attaagaaagtgtggtatggcagaaccattattcgatcagcatacaaactgttctatgaa gcagcccaagaactactggatggaaacttaagcgttgttgatgatattccagaattcaaa gacttggatgagaagagcagacaagccaagctggaggagttggtgtgggcaattggaaag ctgaccgacatagctcgccatgtcagagctaaacgagacggatgtggtgccctggaactg gaaggggtagaggtttgcgtacagctagatgacaaaaagaacattcacgacctcatcccc aagcagcccctggaagtccacgagacagtggctgaatgcatgatcctggccaaccactgg gtcgccaaaaagatctgggagagcttccctcatcaggccttgctgcgccagcaccctcct ccacaccaggagttcttttcagaactccgggaatgtgctaaagccaaaggcttcttcata gatacacggtccaataaaacactggctgattctctggataatgcgaacgacccccacgat cccattgtgaacaggctactgcgctccatggccacgcaggccatgtcgaatgctctgtac ttctccaccggatcctgtgcggaggaggagttccatcattacggtcttgcattagataaa tatacccactttacttctccaataagaagatattcagatattgtagtacaccgcttgtta atggcagccatttcaaaagataagaaaatggaaattaagggaaatctgttcagcaacaaa gatcttgaggaattatgcagacatatcaacaacagaaaccaagcagcacagcattctcag aagcagtctactgagctcttccagtgcatgtacttcaaagacaaagaccctgccaccgag gagcgttgcatatctgacggagttatttattcaattagaacaaatggtgtgcttctattt ataccaagacttgaaataattagtaacaaaccatacaagataccaaatacagaacttatt catcagagttcccccttgctgaagagtgagttagtgaaagaagtaactaaatctgtggaa gaagctcagcttgcccaagaagtcaaagtaaacatcattcaggaggaatatcaagaatat cgccaaacaaagggaaggagcctatacacacttctagaggagatacgggacctagctctc ctggatgtttcaaacaattatggaatatga >gi568815583f:66195098_66433309|GENSCAN_predicted_peptide_5|214_aa MVKELSLMKAEDLKMLIRHMEHWAHRLFPKLQFEDFIDRVEYLGSKKEVQTCLKRIRLDL PILHEDFVSNNDEVAENNEHDVTSTELDPFLTNLSESEMFASELSRSLTEEQQQRIERNK QLALERRQAKLLSNSQTLGNDMLMNTPRAHTVEEVNTDEDQKEESNGLNEDILDNPCNDA IANTLNEEETLLDQSFKNVQQQLDATSRNITEAR >gi568815583f:66195098_66433309|GENSCAN_predicted_CDS_5|645_bp atggtgaaggaactgagcctgatgaaggctgaagacttgaagatgctaatcagacacatg gagcactgggcacataggctattccctaaactgcagtttgaggattttattgacagagtt gaatacctgggaagtaaaaaggaagttcagacctgtttaaaacgaattcgacttgatctc cctattttacatgaagattttgttagcaataatgatgaagttgcggagaataatgaacat gatgtcacttctactgaattagatccctttctgacaaacttatctgaaagtgagatgttt gcttctgagttaagtagaagcctaacagaagagcaacaacaaagaattgagagaaataaa caactggccttggaaagaaggcaggcaaagctgctgagtaatagtcagaccctaggaaat gatatgttaatgaatacacccagggcacacacggttgaagaggttaatactgatgaggat caaaaggaggagtcaaatggattaaacgaagacattctggacaatccatgtaatgatgct attgccaatactttaaatgaagaggaaacactgctggaccagtcttttaaaaatgtgcaa cagcaacttgatgctacatccagaaatattactgaagctagataa >gi568815583f:66195098_66433309|GENSCAN_predicted_peptide_6|407_aa MDKSLNLATFQISPLYIMEYYADIKRNEITSFAGTWVELEVIILSKLTQVYKTKYRMFSL MCVPYKPGRTADAMNYTLNAISPVLPNSISELGKGYVRELVPLTAEPSGAGFSWMGVGFF LGILDPGNALPTPGGGQHLSLPPAALPRARCAGPSPGRGPRGCLQRTTNGGKPRRQMTRK YVTGARRTAWGRGPSRTPWRPLSPRGKQVDPAEKKSVSLCVSKTLGVQNKGNVYRTKETS TPGYPQPGVGDLAENVNITLKGHTVIVNGPRGTLRRDFNHINVELSLLGKKKKEALVDKW WGNRKELATVRTICSHVQNMIKGVTLGFRYKMRSVYAYFPIIVVIPENGSVVEIRNFLGE KYIRRVRMRPGVACSVSQAQKDELIFEGNDIELVSNSAALIQQATTV >gi568815583f:66195098_66433309|GENSCAN_predicted_CDS_6|1224_bp atggacaagtcacttaaccttgccacatttcagatttctcctctatatatcatggaatac tatgcagacataaaaaggaacgagatcacgtcctttgcgggaacatgggtggagctggag gtcattattctcagcaaactaacgcaggtatacaaaaccaaataccgcatgttctcacta atgtgtgttccttacaaacctggcagaactgctgatgctatgaactacaccctgaatgca ataagtcccgtattacccaactccatctctgaactgggtaagggctatgttagagagctg gtcccgttaactgcagagccgtcgggggccgggttcagctggatgggcgtcggcttcttc ttgggcattttggacccgggtaacgcgcttccaactccggggggagggcagcacctctcg cttcctcccgctgcgctgccccgcgcccgctgcgcaggaccaagtccgggccgcgggccc cggggctgccttcagcggacaaccaatgggggcaagcctcggcggcagatgacgcggaaa tacgtcacgggagcgcggcgcactgcctgggggcggggtccgtcgcggacgccgtggcgc cctctgtcgccccgaggcaagcaggtggacccagctgagaagaagtctgtgtcgctgtgt gtgtcaaagacattaggtgtacagaacaaaggaaacgtgtacagaacaaaggaaacatct acacctggttatccacagccaggtgttggtgacctagcagaaaatgtcaacattactctg aagggacacacagttatcgtgaatggccccagaggaaccctgcggagggacttcaatcac atcaatgtagaactcagtcttcttggaaagaaaaaaaaagaggctctggttgacaaatgg tggggtaacagaaaagaactggctaccgttcggactatttgtagtcatgtacagaacatg atcaagggtgttacactgggcttccgttacaagatgaggtctgtgtatgcttacttcccc atcatcgtcgttatcccggagaatgggtctgttgttgaaatccgaaatttcttgggtgaa aaatacatccgcagggtgcggatgagaccaggtgttgcttgttcagtatctcaagcccag aaagatgaattaatctttgaaggaaatgacattgagcttgtttcaaattcagcggctttg attcagcaagccacaacagtttaa