GENSCAN 1.0 Date run: 16-Jul-119 Time: 15:51:03 Sequence gi568815581r:65429979_65658620 : 228642 bp : 47.00% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 558 852 295 0 1 113 47 121 0.835 7.51 1.02 Term + 3748 4035 288 0 0 105 39 134 0.811 5.58 1.03 PlyA + 4384 4389 6 1.05 2.05 PlyA - 5517 5512 6 -0.45 2.04 Term - 12071 11970 102 2 0 107 40 68 0.703 2.18 2.03 Intr - 12495 12402 94 2 1 114 88 11 0.581 3.67 2.02 Intr - 22578 22467 112 1 1 127 108 -49 0.440 0.44 2.01 Init - 22694 22619 76 2 1 83 72 21 0.426 1.25 2.00 Prom - 24188 24149 40 -2.16 3.00 Prom + 28212 28251 40 -4.86 3.01 Init + 28869 28926 58 2 1 56 109 38 0.197 4.17 3.02 Intr + 49697 49726 30 0 0 110 96 16 0.292 2.70 3.03 Intr + 49818 49943 126 1 0 53 7 110 0.111 0.05 3.04 Intr + 65488 65593 106 2 1 94 103 120 0.769 13.37 3.05 Term + 74448 74511 64 0 1 49 55 63 0.134 -3.54 3.06 PlyA + 75956 75961 6 1.05 4.13 PlyA - 79735 79730 6 1.05 4.12 Term - 100124 99998 127 1 1 58 52 165 0.992 7.66 4.11 Intr - 104101 103934 168 0 0 75 82 121 0.975 9.26 4.10 Intr - 105743 105648 96 1 0 98 70 46 0.880 2.92 4.09 Intr - 106575 106342 234 2 0 93 101 85 0.975 7.10 4.08 Intr - 107085 106891 195 2 0 91 65 118 0.936 8.33 4.07 Intr - 107857 107346 512 1 2 142 106 499 0.999 48.87 4.06 Intr - 108365 108225 141 2 0 88 80 177 0.995 17.45 4.05 Intr - 111579 111477 103 2 1 92 117 45 0.987 7.98 4.04 Intr - 119456 119348 109 0 1 60 65 120 0.690 6.24 4.03 Intr - 119682 119546 137 2 2 107 20 204 0.864 15.71 4.02 Intr - 128601 127828 774 2 0 15 110 613 0.251 46.80 4.01 Init - 130977 130847 131 0 2 41 101 167 0.799 10.92 4.00 Prom - 135456 135417 40 -5.56 5.06 PlyA - 137115 137110 6 1.05 5.05 Term - 156174 156062 113 1 2 63 54 136 0.475 6.42 5.04 Intr - 164209 164100 110 0 2 122 77 -56 0.012 -3.37 5.03 Intr - 167260 167153 108 0 0 74 69 83 0.238 4.40 5.02 Intr - 198672 198607 66 2 0 46 79 87 0.198 1.62 5.01 Init - 207547 207471 77 1 2 82 103 17 0.438 3.16 5.00 Prom - 209107 209068 40 -6.06 6.05 PlyA - 209301 209296 6 1.05 6.04 Term - 213316 213179 138 1 0 -39 46 211 0.682 2.16 6.03 Intr - 213935 213644 292 1 1 52 54 341 0.715 24.24 6.02 Intr - 214126 213960 167 1 2 35 8 217 0.644 7.16 6.01 Intr - 220278 220081 198 0 0 42 62 94 0.146 1.75 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:65429979_65658620|GENSCAN_predicted_peptide_1|194_aa XLVKVPRLCRCCSCNISPATYLLSILWKMPRAAAAGIHGIPESMSCQAGKRGSFVLLGSQ LPALHQRPRGLVQTIVLLDGHRGPEFQQKQLGSKPGWQRMLTALRRRCTFPFYSKLPTSK PKKPQAFGGVVGQIFNYSNLEVYFPHTAKNRKFILEKEGSSLEGLNLIFDLQSSQTHLKI SKAQKPTDTIPRPQ >gi568815581r:65429979_65658620|GENSCAN_predicted_CDS_1|585_bp nngttggtgaaggtgccaaggctttgtcgatgctgttcctgcaatatttctcccgccacc tacttactgagcatcctttggaagatgcccagggctgcagcagccggcattcatggcatt cctgaaagtatgtcctgccaggcaggaaaaaggggcagctttgtgctgctgggttctcag ctcccagcactgcaccagcgaccaaggggcctggttcagaccattgtcttacttgatgga catagagggcccgagttccagcagaagcagctgggatccaagcctggatggcagaggatg ctgacagccttgagaagacgctgtacttttccgttctacagtaagctgcccacatcaaag cccaagaagccacaggcatttgggggagttgtgggtcagatatttaattattctaatcta gaagtctattttccccacactgccaagaacagaaagttcattttggaaaaagagggtagt tccttggaagggctgaatctcatctttgatctgcagagctctcagacccacctcaagatt tccaaagcccagaaacccacagacactatccctcgcccacaataa >gi568815581r:65429979_65658620|GENSCAN_predicted_peptide_2|127_aa MGPLLFQGTFNRSPFLHLRNEGDNNRPPFAVVEATVLSHQSPYVVSSFLFLPLHNPDTRV TFKTITVAGLGHKCTLACRNGVIPTETMLTQAVQLRKLGAAKLSNLREGKSKVTEMGREA RFQLWMC >gi568815581r:65429979_65658620|GENSCAN_predicted_CDS_2|384_bp atgggtccactccttttccaaggtacgttcaatagatctccatttcttcatctccgaaat gagggcgataataacaggccaccatttgcagtggtggaagccactgtcctctctcatcag tccccttatgttgtctcctcatttctatttttgccccttcacaaccctgacacaagagtg actttcaagacaatcactgtggccggcttgggtcataaatgcaccttggcgtgcaggaat ggagtcatccccactgaaaccatgttgactcaggcagtgcagctgaggaaactgggcgca gcaaagctaagcaatttacgcgagggcaaatcgaaagtgacagaaatgggacgagaagcc cgattccagctttggatgtgctga >gi568815581r:65429979_65658620|GENSCAN_predicted_peptide_3|127_aa MEQRDCPEIAAFVNLDLFAEGWKLFKEEEDLLQEQWRDPKANTGRCGWCFQLKDVASSKS LNIFKPSPVKQGRAATRWTKEPQEHSLNPSPLGRDSRQRASNCGNPRDQIQPHGNVPSWF MSTFEKH >gi568815581r:65429979_65658620|GENSCAN_predicted_CDS_3|384_bp atggagcagagagactgcccagaaattgctgcttttgtcaacctagatttgtttgcagaa ggctggaaacttttcaaagaagaagaggacctccttcaggagcaatggagggatccaaaa gccaacactggaagatgtggatggtgcttccagctcaaggacgtggcctccagcaagtcc ctgaacatctttaagccctcacctgtcaaacagggccgagcagccacaagatggaccaag gagccgcaagagcattccctgaacccttctccactgggccgtgattctcggcagcgagcc tccaactgcggcaaccctcgggaccagattcagccgcatggcaacgtccccagctggttc atgagcacatttgagaagcactga >gi568815581r:65429979_65658620|GENSCAN_predicted_peptide_4|908_aa MFPDAAAGPGGPGAAAASRAGGRGRAPARGACEVSPPSARGDQCSSFREDAPRPPVPGEE GETPPCQPGVGKGQVTKPMPVSSNTRRNEDGLGEPEGRASPDSPLTRWTKSLHSLLGDQD GAYLFRTFLEREKCVDTLDFWFACNGFRQMNLKDTKTLRVAKAIYKRYIENNSIVSKQLK PATKTYIRDGIKKQQIDSIMFDQAQTEIQSVMEENAYQMFLTSDIYLEYVRSGGENTAYM SNGGLGSLKVVCGYLPTLNEEEEWTCADFKCKLSPTVVGLSSKTLRATASVRSTETVDSG YRSFKRSDPVNPYHIGSGYVFAPATSANDSEISSDALTDDSMSMTDSTGFADRTCACMDI SLVGGEYDKGMVLIKEDDMVIWVGDGIPPYRVGSKKQLQREMHRSVKANGQVSLPHFPRT HRLPKEMTPVEPATFAAELISRLEKLKLELESRHSLEERLQQIREDEEREGSELTLNSRE GAPTQHPLSLLPSGSYEEDPQTILDDHLSRVLKTPGCQSPGVGRYSPRSRSPDHHHHHHS QYHSLLPPGGKLPPAAASPGACPLLGGKGFVTKQTTKHVHHHYIHHHAVPKTKEEIEAEA TQRVHCFCPGGSEYYCYSKCKSHSKAPETMPSEQFGGSRGSTLPKRNGKGTEPGLALPAR EGGAPGGAGALQLPREEGDRSQDVWQWMLESERQSKPKPHSAQSTKKAYPLESARSSPGE RASRHHLWGGNSGHPRTTPRAHLFTQDPAMPPLTPPNTLAQLEEACRRLAEVSKPPKQRC CVASQQRDRNHSATVQTGATPFSNPSLAPEDHKEPKKLAGVHALQASELVVTYFFCGEEI PYRRMLKAQSLTLGHFKEQLSKKGNYRYYFKKASDEFACGAVFEEIWEDETVLPMYEGRI LGKVERID >gi568815581r:65429979_65658620|GENSCAN_predicted_CDS_4|2727_bp atgttccccgacgccgcggccggcccgggggggcccggggctgccgccgcgagccgagcc gggggccggggccgggcccctgccaggggcgcctgcgaggtctcgccgccgagcgcgcgc ggggatcaatgcagcagcttccgtgaggatgccccgcggcccccagtgccaggggaagaa ggggagaccccaccgtgtcagccaggggtgggcaagggccaggtcaccaaacccatgcct gtctcttccaacaccaggcggaacgaagatgggttgggggagccggaggggcgggcatct ccggattcccctctgacccggtggaccaagtccttacactccttattgggcgatcaagac ggtgcttacctgttccgaactttcctggagagggagaaatgcgtggataccttagacttc tggtttgcctgcaatggattcaggcagatgaacctgaaggataccaaaactttacgagta gccaaagcgatctacaaaaggtacattgagaacaacagcattgtctccaagcagctgaag cctgccaccaagacctacataagagatggcatcaagaagcagcagattgattccatcatg tttgaccaggcgcagaccgagatccagtcggtgatggaggaaaatgcctaccagatgttt ttgacttctgatatatacctcgaatatgtgaggagtgggggagaaaacacagcttacatg agtaatgggggactcgggagcctaaaggtcgtgtgtggctatctccccaccttgaatgaa gaagaggagtggacttgtgccgacttcaagtgcaaactttcgccaaccgtggttggcttg tccagcaaaactctgagggccacggcgagtgtgaggtccacggaaactgttgacagtgga tacaggtccttcaagaggagcgatcctgttaatccttatcacataggttctggctatgtc tttgcaccagccaccagcgccaacgacagtgagatatccagtgatgcgctgacggatgat tccatgtccatgacggacagcactggctttgcagaccgcacctgtgcatgtatggacatc agtttggtgggaggggagtatgacaagggtatggtgctgattaaggaggatgacatggtc atctgggtaggagatggaattcctccttatcgtgtgggcagtaagaaacagctccagaga gaaatgcatcgcagtgtgaaggccaatggccaagtgtctctacctcatttcccgagaacc caccgcctgcccaaggagatgacccccgtggaacccgccacctttgcagctgagctgatc tcgaggctggaaaagctgaagctggagttggagagccgccacagcctggaggagcgcctg cagcagatccgagaggatgaagagagagagggctccgagctcacactcaattcgcgggag ggggcgcccacgcagcaccccctctccctactgccctccggcagctacgaggaagacccg cagacgatactggacgatcacctgtccagggtcctcaagacccctggctgccagtctcca ggcgtaggccgctatagcccccgctcccgctccccggaccaccaccaccaccaccattcg cagtaccactccctgctcccgcccggtggcaagctgcctcccgcggccgcctcgccgggc gcctgccccctcctcgggggcaaaggctttgtgaccaagcagacgacgaagcatgtccac caccactacatccaccaccatgccgtccccaagaccaaggaggagatcgaggcggaggcc acgcagcgggtgcactgcttctgccctgggggcagcgagtattactgctactcgaaatgc aaaagccactccaaggctccggaaaccatgcccagcgagcagtttggcggcagcagaggc agtaccttgcccaaacgcaatgggaaaggcacggagccgggcctggccctgcccgccagg gaaggaggggcccccggcggagctggggccctgcagcttccccgggaggaaggagacagg tcgcaggatgtctggcagtggatgctggagagtgagcggcagagcaagcccaagccccat agtgcccaaagcacaaaaaaggcctaccccttggagtctgcccgctcgtctccaggcgaa cgagccagccggcaccatctgtgggggggcaacagcgggcacccccgcaccaccccccgt gcccacctgttcacccaggaccctgcgatgcctcccctgaccccacccaacacgctggct cagctggaggaggcctgtcgcaggctagctgaggtgtcgaagcccccaaagcagcggtgc tgtgtggccagtcagcagagggacaggaatcattcggccactgttcagacgggagccaca cccttctccaatccaagcctggctccagaagatcacaaagagccaaagaaactggcaggt gtccacgcgctccaggccagtgagttggttgtcacttactttttctgtggggaagaaatt ccataccggaggatgctgaaggctcagagcttgaccctgggccactttaaagagcagctc agcaaaaagggaaattataggtattacttcaaaaaagcaagcgatgagtttgcctgtgga gcggtgtttgaggagatctgggaggatgagacggtgctcccgatgtatgaaggccggatt ctgggcaaagtggagcggatcgattga >gi568815581r:65429979_65658620|GENSCAN_predicted_peptide_5|157_aa MAWAKEQRSEKDTVWKQQWWAGREDSFASCEVTAAVTNLTSTLMLLVSKTTFNRSLFLTP TLITNEDFLSANNMKGKPFTWFLKERPAPSIIILKELLPLGTQKDFCEKKKRVVVFFPPD AIAGHSQKAALCKPEREPSPGTKDVSIFILDFQPPEL >gi568815581r:65429979_65658620|GENSCAN_predicted_CDS_5|474_bp atggcctgggccaaagagcaacggagcgagaaagacacggtgtggaagcagcagtggtgg gcagggcgtgaggacagttttgcttcctgtgaggtgacggcagctgtcactaacctgaca tcaaccctgatgctgctcgtcagcaaaaccacctttaataggagcctgtttctgacacct actctcatcacaaacgaagacttcctctctgcaaacaacatgaaaggaaagccttttact tggttcctaaaggagaggccagcaccaagtataataatcctgaaagagctgctgcctctg ggaacccagaaagacttttgcgaaaaaaaaaaaagagttgtggtgttttttcctcctgat gccatagcaggacatagccagaaggcagccctctgcaagccagaaagagagccctcacca ggaaccaaagacgtcagcatcttcatcttggacttccagcctccagaactgtga >gi568815581r:65429979_65658620|GENSCAN_predicted_peptide_6|264_aa ICTLPDITTEQGPPTSRCPDGHSGKTPHNKKEELEKKQEPSCCEKKHRALVNELEESKHV WYSRGQRLFMVLWLKGVTFNVTIIDTKRQTKIVQKLCPGGQLPFLLYGMEEHKDTNKIEE FLYPKLAALNPESNTAEQDIFAKFSAYIKNSNPVVNDNLEKGLLKALKVLDNHLISCHSE EVDETSAEDEGISQRKFLDGNELTLADCNLLLTLHIVQVGFTIPEAFQGVQQYLSNAYAQ EESASTCPDDEIKLTYEQGSKALQ >gi568815581r:65429979_65658620|GENSCAN_predicted_CDS_6|795_bp atctgcaccctccctgacatcaccacagagcaggggcccccaacatccagatgtccagat ggacacagtggcaaaaccccacacaacaagaaagaagaattagagaaaaaacaagaacca tcctgttgtgagaaaaagcacagggcattggtgaatgaattggaagaaagcaaacatgtc tggtattccagaggccagagactgttcatggtgctgtggctcaagggagtcaccttcaat gtcaccatcattgacaccaagaggcagaccaagatagtgcaaaagctgtgcccaggaggg cagctcccattcctgctgtatggcatggaagaacacaaggacaccaacaagattgaggaa tttctgtaccccaagctggcagctctgaaccctgaatccaacacagctgagcaggacata tttgccaaattttctgcttacatcaagaattcaaacccagtagtcaatgacaatctggag aagggactcctgaaagccctgaaggttttagacaatcacttgatatcctgccactcagaa gaagtggatgaaaccagtgctgaagatgaaggcatctctcagaggaagtttctggatggc aatgagctcaccctggctgactgcaacctgttgctaacgctccacatagtacaagtggga ttcaccatccctgaggccttccagggcgtgcagcagtacttgagcaatgcctatgctcag gaagaatctgcctccacctgtccagatgatgagatcaagctcacctatgagcaaggatcc aaggccctccaataa