GENSCAN 1.0 Date run: 3-Nov-116 Time: 05:27:46 Sequence gi568815593f:65892719_66178527 : 285809 bp : 37.75% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 PlyA - 498 493 6 1.05 1.01 Sngl - 6193 6026 168 1 0 -11 34 357 0.716 14.91 1.00 Prom - 14400 14361 40 -4.35 2.02 PlyA - 14832 14827 6 1.05 2.01 Sngl - 33114 32578 537 0 0 75 34 205 0.788 7.66 2.00 Prom - 67001 66962 40 -2.25 3.00 Prom + 73543 73582 40 -4.35 3.01 Init + 83994 84114 121 2 1 68 74 79 0.606 4.90 3.02 Term + 91313 91422 110 0 2 91 36 142 0.423 6.99 3.03 PlyA + 94077 94082 6 1.05 4.00 Prom + 97499 97538 40 -6.05 4.01 Init + 100001 100189 189 1 0 64 91 166 0.674 13.56 4.02 Intr + 102029 102146 118 1 1 87 91 71 0.883 6.42 4.03 Intr + 119331 119409 79 1 1 112 108 14 0.202 3.49 4.04 Intr + 120831 120920 90 0 0 62 99 44 0.145 1.09 4.05 Intr + 133130 133259 130 2 1 66 80 107 0.923 7.48 4.06 Intr + 145665 145764 100 2 1 79 58 51 0.833 -0.04 4.07 Intr + 148062 148115 54 1 0 80 81 49 0.701 1.43 4.08 Intr + 150359 150480 122 0 2 63 96 62 0.860 3.79 4.09 Intr + 151419 151592 174 2 0 84 72 134 0.990 10.61 4.10 Intr + 153635 153820 186 1 0 72 64 124 0.993 7.36 4.11 Intr + 155949 156063 115 2 1 85 72 128 0.999 10.00 4.12 Intr + 158065 158248 184 2 1 92 89 57 0.999 4.12 4.13 Intr + 160688 162233 1546 2 1 57 86 988 0.995 83.94 4.14 Intr + 170042 170239 198 1 0 51 3 148 0.005 1.23 4.15 Intr + 182306 182512 207 1 0 73 94 82 0.706 5.65 4.16 Intr + 183598 183690 93 0 0 42 95 112 0.989 6.54 4.17 Intr + 184157 184231 75 1 0 67 91 32 0.533 0.19 4.18 Term + 185705 185812 108 1 0 88 47 96 0.990 3.03 4.19 PlyA + 186454 186459 6 1.05 5.00 Prom + 199141 199180 40 -5.95 5.01 Init + 209112 209260 149 2 2 88 -4 125 0.395 2.91 5.02 Intr + 209826 210064 239 0 2 -35 89 213 0.575 5.44 5.03 Intr + 211803 211945 143 1 2 -10 46 136 0.437 -1.25 5.04 Term + 214315 214494 180 0 0 69 39 201 0.681 9.93 5.05 PlyA + 215135 215140 6 1.05 6.00 Prom + 215671 215710 40 -10.35 6.01 Init + 217128 217783 656 2 2 42 86 273 0.210 17.30 6.02 Intr + 247220 247293 74 2 2 53 94 41 0.213 -0.67 6.03 Intr + 251621 251819 199 0 1 30 83 323 0.673 23.49 6.04 Intr + 256934 257094 161 2 2 46 83 56 0.432 -0.39 6.05 Intr + 260774 260878 105 0 0 68 81 36 0.292 0.17 6.06 Intr + 266501 266616 116 0 2 85 93 41 0.726 3.55 6.07 Intr + 269391 269555 165 2 0 81 93 137 0.998 12.74 6.08 Intr + 269696 269874 179 1 2 57 63 138 0.998 6.00 6.09 Intr + 271074 271204 131 0 2 48 111 186 0.998 16.32 6.10 Intr + 272065 272179 115 2 1 62 94 64 0.989 2.99 6.11 Intr + 277333 277452 120 1 0 72 113 23 0.782 1.89 6.12 Intr + 277867 278229 363 1 0 53 36 535 0.215 38.17 6.13 Term + 284799 284973 175 0 1 43 44 171 0.116 4.45 6.14 PlyA + 285093 285098 6 -0.45 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 223856 223704 153 2 0 78 55 98 0.848 2.44 S.002 Init - 229911 229834 78 0 0 85 59 71 0.851 5.01 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:65892719_66178527|GENSCAN_predicted_peptide_1|55_aa MCQDPKLVEEEEEEEEEEEEEEEEEEEEEEGKRREEKRKEEKPTILRLSAISNTL >gi568815593f:65892719_66178527|GENSCAN_predicted_CDS_1|168_bp atgtgtcaagaccccaaattagttgaagaagaagaagaagaagaagaagaagaagaagaa gaagaagaagaagaagaagaagaggaggaggggaagagaagagaagagaagagaaaagaa gagaagccaacaattttaaggctcagtgcaatttccaatacgttataa >gi568815593f:65892719_66178527|GENSCAN_predicted_peptide_2|178_aa MGQGRRGVTASQAAAATAPPPRPRRVKRLRAATLRRGRGRGGLPPASPQPPTAGQEACAG QSARGGGAGRRRRLGAAGRGPGEGQPAAVGSRARRAVRPTLAQAPRGLPAAPCPLPDPAL PDSRAPQGGERWQRRPCCLLAAQRPAREIESPQASAAPADGVSLYDSNRCTGVVGVQP >gi568815593f:65892719_66178527|GENSCAN_predicted_CDS_2|537_bp atggggcaggggcggaggggagttactgcctcacaggccgcggcggcaacagcgccaccg ccccggccgcgacgggtgaagaggctgcgcgccgcgactctgcggcgggggcgggggcgg ggcggcctcccgccggcttcgccccaaccgccaacagccgggcaggaggcctgcgcgggc cagtcagcgcgcgggggcggtgcgggccgacgtcggcggctgggggcagcgggccggggg ccgggggaggggcagccggcggccgtgggttcgcgtgcaaggcgcgccgtgaggcccact ctggcccaggcccctcgaggcctgcccgctgctccctgcccgctccccgaccccgcactc ccggactcccgagccccccagggtggggaacgctggcaacgccgaccctgctgtctgctc gcggcccagcgcccagccagagagatagagagcccacaggcctccgccgcgccggccgat ggagtttctttgtacgacagtaatcggtgcacaggggttgttggagtccagccctga >gi568815593f:65892719_66178527|GENSCAN_predicted_peptide_3|76_aa MTLTEHAAFKHLFNKAHLAPPLIHSTLSGYSTCFREHRVGALPQEEVSSSKFQFLGNSGP DRLHGAAGPNILAGKE >gi568815593f:65892719_66178527|GENSCAN_predicted_CDS_3|231_bp atgactcttacggagcatgctgccttcaagcatctgtttaacaaagcacatcttgcaccg cccttaatccattcaaccctgagtggatacagcacatgtttcagagagcacagggttggg gctctaccccaagaggaggtatcaagcagcaaattccagtttctgggaaatagtggacca gatcgtctccatggagcagctggtcctaacatattggccggcaaggaataa >gi568815593f:65892719_66178527|GENSCAN_predicted_peptide_4|1255_aa MTTKRSLFVRLVPCRCLRGEEETVTTLDYSHCSLEQVPKEIFTFEKTLEELYLDANQIEE LPKQLFNCQSLHKLSLPDNDLTTLPASIANLINLRELDVSKNGIQEFPENIKNCKVLTIV EASVNPISKLPDGFSQLLNLTQLYLNDAFLEFLPANFGRLISVEELDCSFNEVEALPSSI GQLTNLRTFAADHNYLQQLPPESKPLIPLQKETDSETQKMVLTNYMFPQQPRTEDGIVLE LSKGKNSCLQDSEVMFISDNESFNPSLWEEQRKQRAQVAFECDEDKDEREAPPREGNLKR YPTPYPDELKNMVKTVQTIVHRLKDEETNEDSGRDLKPHEDQQDINKDVGVKTSESTTTV KSKVDEREKYMIGNSVQKISEPEAEISPGSLPVTANMKASENLKHIVNHDDVFEESEELS SDEEMKMAEMRPPLIETSINQPKVVALSNNKKDDTKETDSLSDEVTHNSNQNNSNCSSPS RMSDSVSLNTDSSQDTSLCSPVKQTHIDINSKIRQEDENFNSLLQNGDILNSSTEEKFKA HDKKDFNLPEYDLNVEERLVLIEKSVDSTATADDTHKLDHINMNLNKLITNDTFQPEIME RSKTQDIVLGTSFLSINSKEETEHLENGNKYPNLESVNKVNGHSEETSQSPNRTEPHDSD CSVDLGISKSTEDLSPQKSGPVGSVVKSHSITNMEIGGLKIYDILSDNGPQQPSTTVKIT SAVDGKNIVRSKSATLLYDQPLQVFTGSSSSSDLISGTKAIFKFDSNHNPEEPNIIRGPT SGPQSAPQIYGPPQYNIQYSSSAAVKDTLWHSKQNPQIDHASFPPQLLPRSESTENQSYA KHSANMNFSNHNNVRANTAYHLHQRLGPARHGEMWAISPNDRLIPAVTRSTIQRQSSVSS TASVNLGDPGSTRRAQIPEGDYLSYREFHSAGRTPPMMPGSQRPLSARTYSIDGPNASRP QSARPSINEIPERTMSVSDFNYSRTSPSKRPNARVGSEHSLLDPPGKSKVPRDWREQVLR HIEAKKLEKSQPSAAGLLELTGGPLQTLFAWVSAALAAEQRILVNRKCCCLIIPLEVLSQ RSTWPCEVSVCPYWGMPLSNGQMGQPLRPQANYSQIHHPPQASVARHPSREQLIDYLMLK VAHQPPYTQPHCSPRQGHELAKQEIRVRVEKDPELGFSISGGVGGRGNPFRPDDDGIFVT RVQPEGPASKLLQPGDKIIQANGYSFINIEHGQAVSLLKTFQNTVELIIVREVSS >gi568815593f:65892719_66178527|GENSCAN_predicted_CDS_4|3768_bp atgactacaaaacgaagtttgtttgtgcggttggtaccatgtcgctgtctacgaggggaa gaggagactgtcactactcttgattattctcattgcagcttagaacaagttccgaaagag atttttacttttgaaaaaaccttggaggaactctatttagatgctaatcagattgaagag cttccaaagcaactttttaactgtcagtctttacacaaactgagtttgccagacaatgat ttaacaacgttaccagcatccattgcaaaccttattaatctcagggaactggatgtcagc aagaatggaatacaggagtttccagaaaatataaaaaattgtaaagttttgacaattgtg gaggccagtgtaaaccctatttccaagctccctgatggattttctcagctgttaaaccta acccagttgtatctgaatgatgcttttcttgagttcttgccagcaaattttggcaggtta atatcagtagaagaactggattgtagtttcaatgaagttgaagctttgccttcatctatt gggcagcttactaacttaagaacttttgctgctgatcataattacttacagcagttgccc ccagagtccaaacccctgatacctcttcaaaaagaaactgattcagagacccagaaaatg gtgcttaccaactacatgttccctcaacagccaaggactgaggatggtatcgtgctagag ctaagtaaaggcaaaaattcttgccttcaagattctgaagttatgtttatatcagataat gaaagttttaacccttcattgtgggaggaacagaggaaacagcgggctcaagttgcattt gaatgtgatgaagacaaagatgaaagggaggcacctcccagggagggaaatttaaaaaga tatccaacaccatacccagatgagcttaagaatatggtcaaaactgttcaaaccattgta catagattaaaagatgaagagaccaatgaagactcaggaagagatttgaaaccacatgaa gatcaacaagatataaataaagatgtgggtgtgaagacctcagaaagtactactacagta aaaagcaaagttgatgaaagagaaaaatatatgataggaaactctgtacagaagatcagt gaacctgaagctgagattagtcctgggagtttaccagtgactgcaaatatgaaagcctct gagaacttgaagcatattgttaaccatgatgatgtttttgaggaatctgaagaactttct tctgatgaagagatgaaaatggcggagatgcgaccaccattaattgaaacctctattaac cagccaaaagtcgtagcacttagtaataacaaaaaagatgatacaaaggaaacagattct ttatcagatgaagttacacacaatagcaatcagaataacagcaattgttcttctccatct cggatgtctgattcagtttctcttaatactgatagtagtcaagacacctcactctgctct ccagtgaaacaaactcatattgatattaattccaaaatcaggcaagaagatgaaaatttt aacagccttttacaaaatggagatattttaaacagttcaacagaggaaaagttcaaagct catgataaaaaagattttaacttacctgaatatgatttgaatgttgaagagcgattagtt ctaattgagaaaagtgttgactcaacagccacagctgatgacactcacaaattagatcat atcaatatgaatcttaataaacttataactaatgatacatttcaaccagagatcatggaa agatcaaaaacacaggatattgtgcttggaacaagctttttaagcattaattctaaagag gaaactgagcacttggaaaatggaaacaagtatcctaatttggaatccgtaaataaggta aatggacattctgaggaaacttcccagtctcctaataggactgaaccacatgacagtgat tgttctgttgacttaggtatttccaaaagcactgaagatctctcccctcagaaaagtggt ccagttggatctgttgtgaaatctcatagcataactaatatggagattggagggctaaaa atctatgatattcttagtgataatggacctcagcagccaagtacaaccgttaaaatcaca tctgctgttgatggaaaaaatatagtcaggagcaagtctgccacactgttgtatgatcaa ccattgcaggtatttactggttcttcctcatcttctgatttaatatcaggaacaaaggca attttcaagtttgattcaaatcataatcccgaagagccaaatataataagaggccccaca agtggcccacaatctgcacctcaaatatatggtcctccacagtataatatccaatacagt agcagtgctgcagtcaaagacactttgtggcactccaaacaaaatccccaaatagaccat gccagttttcctcctcagctccttcctagatcagagagcacagaaaatcaaagttatgct aaacattctgccaatatgaatttctctaatcataacaatgttcgagctaatactgcatac catttacatcagagacttggcccagcaagacatggggaaatgtgggccatctcaccaaac gaccgacttattcctgcagtaactcgaagtacaatccagcgacaaagtagtgtgtcctcc acagcctctgtaaatcttggtgatccaggctctacaaggcgggctcagattcctgaagga gattatttatcatacagagagttccactcagcgggaagaactcctccaatgatgccagga tcacagagacccctttctgcacgaacatacagcatagatggtccaaatgcatcaagacct cagagtgctcgaccctctattaatgaaataccagagagaactatgtcagttagtgatttc aattattcacggactagtccttcaaaaagaccaaatgcaagggttggttctgagcattct ttattagatcctccaggaaaaagtaaagttcctcgtgactggagagaacaagtacttcga catattgaagccaaaaagttagaaaagtcacaaccctcagctgcaggtctgttggagttg actggaggtccactccagaccctgtttgcctgggtatcagcagcactggctgcagaacag cggatattggtgaaccgcaaatgctgctgcctgatcattcctctggaagttttgtctcag aggagtacctggccgtgtgaggtgtcagtctgcccctactgggggatgcctttgagtaat ggacagatgggccagcctctcaggcctcaggcaaattatagtcaaatacatcacccccct caggcatctgtggcaaggcatccctctagagaacaactaattgattacttgatgctgaaa gtggcccaccagcctccatatacacagccccattgttctcctagacaaggccatgaactg gcaaaacaagagattcgagtgagggttgaaaaggatccagaacttggatttagcatatca ggtggtgtcgggggtagaggaaacccattcagacctgatgatgatggtatatttgtaaca agggtacaacctgaaggaccagcatcaaaattactgcagccaggtgataaaattattcag gctaatggctacagttttataaatattgaacatggacaagcagtgtccttgctaaaaact ttccagaatacagttgaactcatcattgtacgagaagtttcctcataa >gi568815593f:65892719_66178527|GENSCAN_predicted_peptide_5|236_aa MDPNQEEIPDLPEKELRRLIIKLFREGPENGKAQCKEIQKMMQEVKGEAGIIGVTEEEDN SKSLENIFGGIIEENFPGLVRDLDIQIQEAQRIPGKFIAKRSSPKHIVIRLSKVKTKEKS LKSCETEAPGYYEHLYAHKLENLEVMDKFLEKYNPPTLNQEELDTLNRPITSSEIEMDCS SMLAMEQSKMENDFDELTEEGFRKSVIPNFSELKEDVRTHSKEAKNIEKKIRRMAN >gi568815593f:65892719_66178527|GENSCAN_predicted_CDS_5|711_bp atggatccaaaccaagaagaaatccctgatttacctgaaaaagaactcaggaggttaatt attaaactattcagggaggggccagagaatggcaaagcccaatgcaaggaaatccaaaaa atgatgcaagaagtgaagggagaggctggaataatcggtgttactgaggaagaagacaat tctaaaagcttggaaaacatctttggtggaataatcgaggaaaacttccctggccttgtt agagacctagacatacaaatacaagaagcacaaagaatacctgggaaattcatcgcaaaa agatcttcacctaaacacattgtcatcaggttatccaaagttaagacaaaggaaaaaagt cttaagagctgtgaaacagaagcaccaggctactatgaacacctttatgcacataaacta gaaaacctagaagtgatggataaattcctggaaaaatacaaccctcctacattaaatcag gaagaattagataccctgaacagaccaataacaagcagtgagattgaaatggattgcagc tccatgctagcaatggaacaaagcaagatggagaatgactttgatgagttaacagaagaa ggcttcagaaagtcagtaataccaaacttctctgagctaaaggaggatgttcgaacccat agcaaggaagctaaaaacattgaaaaaaagattagacgaatggctaactag >gi568815593f:65892719_66178527|GENSCAN_predicted_peptide_6|852_aa MIISIDGGKAFDKIQQPFMLKTLNKLGIDGTYLKRIRAIFDKPTANIILNGQKLEAFPLK TGTRQGCPLSPLLFNIVLEVLARVIRQEKEIKGIQFGKEEVKLSLFADDMTVYLENPIFS AQNLLKLISNFSNVSGYKINVQKSQAFLYTNNSQIMSELPFTIATKRITYLGIQLARDVK DLFKENYKPLLNEIKEDTNKWKNIPCSQIGRILWPYCPSSSILFLSLYTPELLSPKSPPG SQSNVGEREGNGSGIGMNSGGGFGLGLGFGLTPTSVIQVTNLSSAVTSEQMRTLFSFLGE IEELRLYPPEIRQFRKCCFRDDCQALEDFRGKRVIPVGIARALVGYYNHRLFLVKKDRNS VTQVCYVKFRDPSSVGVAQHLTNTVFIDRALIVVPCAEGKIPEESKALSLLAPAPTMTSL MPGAGLLPIPTPNPLTTLGVSLSSLGAIPAAALDPNIATLGEIPQPPLMGNVDPSKIDEI RRTVYVGNLNSQTTTADQLLEFFKQVGEVKFVRMAGDETQPTRFAFVEFADQNSVPRALA FNGVMFGDRPLKINHSNNAIVKPPEMTPQAAAKELEEVMKRVREAQSFISAAIEPESGKS NERKGGRSRSHTRSKSRSSSKSHSRRKRSQSKHRSRSHNRSRSRQKDRRRSKSPHKKRSK SRERRKSRSRSHSRDKRKDTREKIKEKERVKEKDREKEREREKEREKEKERGKNKDRDKE REKDREKDKEKDREREREKEHEKDRDKEKEKEQDKEKEREKDRSKEIDEKRKKDKKSRTP PRSYNASRRSRSSSRNKKDKKREKERDHISERRERERSTSMRKSSNDRDGKEKLEKNSTS LKVSSSHSVSGT >gi568815593f:65892719_66178527|GENSCAN_predicted_CDS_6|2559_bp atgattatctcaatagatggaggaaaggccttcgacaaaattcaacagcccttcatgcta aaaactctcaataaactgggtattgatggaacatatctcaaaagaataagagctattttt gacaaacccacagccaatatcatactgaatgggcaaaaactggaagcattccctttgaaa actggcacaagacagggatgccctctctcaccactcctattcaacatagtgttggaagtt ctggccagggtaatcaggcaagagaaagaaataaagggtattcaattcggaaaagaggaa gtaaaattgtccctatttgcagatgacatgactgtatatttagaaaaccccatcttctca gcccaaaatctccttaagctgataagcaacttcagcaacgtctcaggatacaaaatcaat gtgcaaaaatcacaagcatttttatacaccaataacagccaaatcatgagtgaactccca ttcacaattgctacaaagagaataacatatctaggaatccaacttgcaagggatgtgaag gacctcttcaaggagaactacaaaccactgctcaatgaaataaaagaggatacaaacaaa tggaagaacattccatgctcacagataggaagaatcctatggccatactgcccaagctct agcattctcttcctgtccctgtacaccccagaactgctctctccaaagtcacctcctggt tctcagtctaacgttggggagcgggaaggcaacggcagcgggatcgggatgaacagcggc ggcggcttcggtttgggcttaggcttcggcctcacccccacgtcggtgattcaggtgacg aatctgtcgtcggcggtgaccagcgagcagatgcggacgcttttttccttcctaggagaa atcgaggagctgcggctctaccccccggagatccgtcagtttaggaaatgttgcttcaga gatgactgtcaggctcttgaagatttcagaggcaaaagagttattcctgttggaattgct agggctttagtaggttattataatcacagattgtttttggttaaaaaggacagaaacagt gttacccaagtatgttatgttaagtttcgtgatccatcaagtgttggcgtggcccagcat ctaactaacacggtttttattgacagagctctgatagttgttccttgtgcagaaggtaaa atcccagaggaatccaaagccctctctttattggctcctgctccaaccatgacaagtctg atgcctggtgcaggattgcttccaataccgaccccaaatcctttgactactcttggtgtt tcacttagcagtttgggagctataccagcagcagcactagaccccaacattgcaacactt ggagagataccacagccaccacttatgggaaacgtggatccttccaaaatagatgaaatt aggagaacggtttatgttggaaatctgaattcccagacaacgacagctgatcaactactt gaattttttaaacaagttggagaagtgaagtttgtgcggatggcaggtgatgagactcag ccaactcggtttgcttttgtggaatttgcagaccaaaattctgtaccaagggcccttgct tttaatggagttatgtttggagacaggccactgaaaataaatcactccaacaatgcaata gtaaaaccccctgagatgacacctcaggctgcagctaaggagttagaagaagtaatgaag cgagtacgagaagctcagtcatttatctcagcagctattgaaccagagtctggaaagagc aatgaaagaaaaggcggtcgatctcgttcccatactcgctcaaaatccaggtctagctca aaatcccattctagaaggaaaagatcacaatcaaaacacaggagtagatcccataataga tcacgttcaagacagaaagacagacgtagatctaagagcccacataaaaaacgctctaaa tcaagggagagacggaagtcaaggagtcgttcgcattcacgggacaagagaaaagacact cgagaaaagatcaaggaaaaggaaagagtgaaagagaaagacagggaaaaggagagagag agggaaaaggaacgtgaaaaagaaaaggaacggggtaaaaacaaagaccgggacaaggaa cgggaaaaggaccgggaaaaagacaaggaaaaggacagagagagagaacgggaaaaagag catgagaaggatcgagacaaagagaaggaaaaggaacaggacaaagaaaaggaacgagaa aaagacagatccaaagagatagatgaaaaaagaaagaaggataaaaaatccagaacacca cccaggagttacaatgcatcgcgaagatctcgtagttccagcagaaataagaaggataaa aagagagaaaaagaaagggaccacatcagtgaaagaagagagagagaacgttcaacgtct atgagaaagagttctaatgatagagatgggaaggagaagttggagaagaacagtacttca cttaaagtaagcagcagtcattcggtgtctggcacttga