GENSCAN 1.0 Date run: 3-Nov-116 Time: 08:41:14 Sequence gi568815595r:10228817_10549543 : 320727 bp : 49.95% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 5643 5843 201 2 0 88 83 245 0.891 23.48 1.02 Intr + 9932 10223 292 1 1 125 103 101 0.800 12.01 1.03 Term + 13300 13412 113 2 2 15 45 152 0.694 2.32 1.04 PlyA + 14739 14744 6 1.05 2.00 Prom + 17286 17325 40 -8.56 2.01 Init + 20385 20798 414 2 0 81 100 293 0.473 26.27 2.02 Intr + 31321 31854 534 0 0 116 113 358 0.962 34.22 2.03 Intr + 41315 42199 885 1 0 103 87 514 0.923 44.32 2.04 Intr + 47545 47672 128 0 2 99 38 213 0.997 16.88 2.05 Intr + 49463 49646 184 2 1 63 110 230 0.989 22.49 2.06 Term + 50069 50209 141 1 0 82 47 120 0.977 5.23 2.07 PlyA + 50733 50738 6 1.05 3.09 PlyA - 50868 50863 6 1.05 3.08 Term - 51086 50974 113 0 2 36 45 106 0.062 -0.18 3.07 Intr - 57996 57888 109 0 1 83 113 131 0.999 14.96 3.06 Intr - 61062 60898 165 0 0 79 20 117 0.484 4.16 3.05 Intr - 61393 61257 137 2 2 97 94 69 0.740 8.69 3.04 Intr - 61674 61536 139 0 1 51 29 49 0.299 -4.66 3.03 Intr - 62052 61900 153 0 0 70 115 58 0.391 6.97 3.02 Intr - 64249 64173 77 2 2 82 68 31 0.506 -0.27 3.01 Init - 64526 64355 172 2 1 67 82 100 0.615 6.90 3.00 Prom - 69082 69043 40 -2.46 4.13 PlyA - 69804 69799 6 1.05 4.12 Term - 72558 72445 114 0 0 79 48 125 0.954 6.17 4.11 Intr - 75356 75210 147 2 0 140 83 129 0.958 18.03 4.10 Intr - 76340 76217 124 1 1 80 76 209 0.999 19.49 4.09 Intr - 76876 76743 134 1 2 67 100 150 0.999 13.54 4.08 Intr - 83282 83149 134 0 2 137 84 188 0.970 23.66 4.07 Intr - 83914 83763 152 0 2 80 85 165 0.942 15.21 4.06 Intr - 86620 86505 116 1 2 91 98 158 0.743 16.35 4.05 Intr - 89278 89234 45 1 0 92 87 47 0.881 3.61 4.04 Intr - 89582 89409 174 2 0 61 75 45 0.535 0.64 4.03 Intr - 90939 90911 29 1 2 123 72 32 0.499 2.93 4.02 Intr - 91738 91693 46 1 1 84 91 -5 0.281 -2.62 4.01 Init - 92236 92234 3 1 0 97 101 0 0.514 2.20 4.00 Prom - 93161 93122 40 -4.36 5.32 PlyA - 93685 93680 6 1.05 5.31 Term - 100309 99998 312 1 0 123 37 501 0.953 43.50 5.30 Intr - 101825 101794 32 0 2 118 94 -23 0.893 -0.85 5.29 Intr - 107484 107313 172 0 1 105 69 126 0.990 11.92 5.28 Intr - 109542 109360 183 0 0 64 92 391 0.998 37.08 5.27 Intr - 111533 111426 108 2 0 102 111 37 0.995 7.88 5.26 Intr - 111888 111677 212 1 2 76 72 582 0.867 54.03 5.25 Intr - 113574 113389 186 1 0 81 78 63 0.938 4.26 5.24 Intr - 114149 113936 214 2 1 103 85 431 0.998 42.59 5.23 Intr - 116759 116568 192 2 0 136 94 447 0.999 49.79 5.22 Intr - 117321 117215 107 1 2 75 58 258 0.954 21.43 5.21 Intr - 121383 121296 88 0 1 101 116 105 0.999 14.14 5.20 Intr - 121761 121582 180 0 0 84 92 405 0.999 40.56 5.19 Intr - 124098 123940 159 0 0 24 83 88 0.733 2.08 5.18 Intr - 130109 129875 235 1 1 125 80 469 0.862 47.49 5.17 Intr - 131307 131066 242 0 2 107 74 563 0.996 53.15 5.16 Intr - 143235 142993 243 0 0 104 119 606 0.977 62.99 5.15 Intr - 146828 146614 215 0 2 73 100 498 0.975 47.93 5.14 Intr - 149594 149436 159 0 0 73 100 323 0.920 31.96 5.13 Intr - 150594 150427 168 1 0 86 86 140 0.988 13.52 5.12 Intr - 156511 156452 60 2 0 83 80 47 0.794 2.11 5.11 Intr - 159586 159461 126 2 0 88 121 114 0.956 15.35 5.10 Intr - 172262 172137 126 0 0 126 99 245 0.999 30.05 5.09 Intr - 173532 173275 258 1 0 114 115 330 0.990 35.73 5.08 Intr - 174962 174765 198 0 0 104 65 84 0.895 7.02 5.07 Intr - 181999 181802 198 2 0 151 116 336 0.997 42.02 5.06 Intr - 193991 193742 250 2 1 39 99 59 0.000 -0.89 5.05 Intr - 197035 196970 66 1 0 56 99 54 0.004 2.40 5.04 Intr - 209706 209535 172 2 1 58 80 62 0.216 2.35 5.03 Intr - 214594 214454 141 0 0 22 66 112 0.075 1.87 5.02 Intr - 218743 218671 73 2 1 39 64 82 0.217 -0.64 5.01 Init - 220727 220529 199 2 1 101 115 329 0.932 35.96 5.00 Prom - 231012 230973 40 0.24 6.00 Prom + 231863 231902 40 -3.26 6.01 Init + 241101 241150 50 2 2 104 94 23 0.898 4.92 6.02 Intr + 241263 241469 207 0 0 78 49 109 0.607 4.19 6.03 Intr + 255009 255174 166 0 1 17 64 112 0.024 1.66 6.04 Intr + 257414 257494 81 1 0 73 89 26 0.044 1.03 6.05 Intr + 266251 266375 125 0 2 76 80 33 0.027 0.68 6.06 Intr + 269297 269576 280 2 1 20 44 226 0.028 8.88 6.07 Intr + 270438 270513 76 2 1 82 13 74 0.275 -1.61 6.08 Intr + 271683 271799 117 1 0 28 100 80 0.258 3.64 6.09 Intr + 277281 277471 191 1 2 80 25 147 0.577 6.90 6.10 Intr + 277569 277637 69 2 0 72 76 85 0.622 5.08 6.11 Intr + 278216 278350 135 1 0 75 59 70 0.628 3.56 6.12 Term + 280806 280964 159 2 0 31 47 117 0.652 -0.16 6.13 PlyA + 282600 282605 6 1.05 7.03 PlyA - 283008 283003 6 1.05 7.02 Term - 283097 283073 25 1 1 129 48 9 0.788 -1.30 7.01 Init - 284751 284633 119 0 2 73 86 178 0.853 15.77 7.00 Prom - 288206 288167 40 -2.46 8.05 PlyA - 293301 293296 6 1.05 8.04 Term - 293721 293635 87 0 0 69 48 69 0.504 -1.34 8.03 Intr - 294116 293905 212 0 2 97 80 128 0.698 11.53 8.02 Intr - 299118 299025 94 0 1 100 43 7 0.076 -3.06 8.01 Intr - 305317 305223 95 2 2 130 84 44 0.076 7.88 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 189780 189837 58 2 1 87 85 44 0.840 5.47 S.002 Init + 257105 257180 76 1 1 99 103 36 0.822 7.46 S.003 Sngl + 310285 310824 540 0 0 60 39 215 0.942 9.89 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:10228817_10549543|GENSCAN_predicted_peptide_1|201_aa KDLLLSDIPSSTASLCSRKTGVENVMAKEICQKYLEKGAGRLPEDCAEALATAACLCLRR RNTSLQEVCGSVAAVEERLRGRETLLPWSGLSEGTGSSSNTPEETDDVDNSSLDASSSMS VAPWAGAATPLLPTENGEGRLRVIVGREADSSSEACVGLEPPQDVTETSWQIEINEAKRK LMENILLYKEEKVDSIELFGP >gi568815595r:10228817_10549543|GENSCAN_predicted_CDS_1|606_bp aaggacttactcctcagtgatattccaagcagcaccgcctcgctctgctccaggaagacg ggcgtggagaacgtgatggcaaaggagatctgccagaagtacctggagaagggcgcaggg aggcttccggaggactgcgccgaggccctggccacggctgcctgcctgtgcctgcggagg cgtaacaccagcctgcaggaggtgtgtggctctgtggctgctgtggaagagcggctccga ggtcgggagacgttgctcccttggagtgggctttctgagggtacaggctcttcttccaac accccagaggaaacagacgacgttgacaattccagccttgatgcctcctcctccatgagt gtggcaccctgggcaggggctgccaccccacttctccccacagagaatggggaaggaagg ctgcgggtcatcgtgggaagggaggctgactcctcctctgaggcctgtgttggcctggag cctccccaggatgttacagaaacttcgtggcaaattgagatcaatgaggccaaaaggaaa ctgatggagaatattctgctctacaaagaggaaaaagtggacagcattgagctctttggc ccctga >gi568815595r:10228817_10549543|GENSCAN_predicted_peptide_2|761_aa MASERGKVKHNWSSTSEGCPRKRSCLREPCDVAPSSRPAQRSASRSGGPSSPKRLKAQKE DDVACSRRLSWGSSRRRNNSSSSFSPHFLGPGVGGAASKGCLIRNTRGFLSSGGSPLRPA NASLEEMASLEEEACSLKVDSKDSSHNSTNSEFAAEAEGQNDTIEEPNKVQKRKRDRLRD QGSTMIYLKAIQGILGKSMPKRKGEAATRAKPSAAEHPSHGEGPARSEGPAKTAEGAARS VTVTAAQKEKDATPEVSMEEDKTVPERSSFYDRRVVIDPQEKPSEEPLGDRRTVIDKCSP PLEFLDDSDSHLEIQKHKDREVVMEHPSSGSDWSDVEEISTVRFSQEEPVSLKPSAVPEP SSFTTDYVMYPPHLYSSPWCDYASYWTSSPKPSSYPSTGSSSNDAAQVGKSSRSRMSDYS PNSTGSVQNTSRDMEASEEGWSQNSRSFRFSRSSEEREVKEKRTFQEEMPPRPCGGHASS SLPKSHLEPSLEEGFIDTHCHLDMLYSKLSFQGTFTKFRKIYSSSFPKEFQGCISDFCDP RTLTDCLWEELLKEDLVWGAFGCHPHFARYYSESQERNLLQALRHPKAVAFGEMGLDYSY KCTTPVPEQHKVFERQLQLAVSLKKPLVIHCREADEDLLEIMKKFVPPDYKIHRHCFTGS YPVIEPLLKYFPNMSVGFTAVLTYSSAWEAREALRQIPLERIIVETDAPYFLPRQVPKSL CQYAHPGLALHTVREIARVKDQPLSLTLAALRENTSRLYSL >gi568815595r:10228817_10549543|GENSCAN_predicted_CDS_2|2286_bp atggcgtccgagcggggcaaggtcaagcacaactggagcagcacgtcggaagggtgtccc cgcaagcgcagctgcctccgggagccctgtgatgtggccccctccagccggccagctcag aggtctgcgtcgcgttctggagggcccagcagccccaagcgcctgaaagcccagaaggag gacgatgtggcttgctcgcggaggttatcctggggctcatcccgccgcagaaataactcc tcctcctccttctccccacatttcttgggccctggtgtgggcggggccgcctccaaaggc tgcctgattcggaacactcgggggttcctgtcttcagggggatcccctctgcgtcctgcc aacgcctctttggaagaaatggcttctctagaggaggaagcctgcagccttaaggttgat tccaaagatagttctcataactccacaaactctgaatttgcagctgaagctgagggtcag aatgatacaattgaggaacccaacaaggtccagaaaaggaagagggatagacttcgagac cagggctccacaatgatctacctgaaggctatccagggcatcctggggaaatcgatgcca aaaaggaagggagaggctgccactcgggcaaaaccaagcgcagcagagcatcccagccat ggagaaggaccagccaggagtgaaggaccagccaagactgcagaaggagcagccaggagt gtcacagtcactgctgctcagaaggagaaagacgcaaccccagaggtcagcatggaggag gataagacagtgccagagaggagcagcttctatgacaggagagtagttatagaccctcaa gagaaacccagtgaggagccccttggggaccgaaggactgtcattgacaaatgctctcca cccctagagttcttggatgactctgactctcatttagaaatccaaaagcataaagatagg gaggtggtgatggagcacccctcttctggaagtgactggtctgatgttgaggagatctcc acagtcagattctctcaggaggaacctgtctccctgaaaccttcagccgttccggagcct tcttccttcaccaccgactatgtcatgtaccctcctcatttgtacagtagtccttggtgt gactacgccagctattggaccagcagccccaagccttctagctacccctccacaggcagc agcagcaacgatgcagcccaggttgggaagagcagccggagccgcatgagtgattattcc cccaactctacagggagtgtccaaaacacctccagagacatggaggcctcagaggaaggc tggtcccagaattctcgttcatttcgcttctccagaagctcagaagaaagagaggtgaag gagaaaagaacattccaagaggagatgcctccgcgtccttgtggaggacacgcatccagc tccctgccaaagagccacctggagccaagcctagaggagggcttcattgacactcattgt cacctggacatgctctattccaagctatctttccaagggacctttacaaagttcagaaaa atttacagcagctccttccctaaggaatttcagggctgcatctctgacttctgtgatccc cgcaccctgacagattgcctatgggaggagctgttgaaagaggatctggtctggggggcc tttggctgtcaccctcattttgcacgttactacagtgagagtcaagaaagaaatcttttg caagccttaaggcaccctaaggctgtggcatttggagaaatgggcttggattactcttac aagtgcaccacgcctgtcccagaacagcacaaggtatttgagagacagctgcagctggct gtgtctctaaagaagcccttggtgatccactgccgagaagctgatgaagatctgctagaa atcatgaaaaagtttgtgccccctgactacaagatccataggcattgcttcaccggcagc tacccggtcattgagcccctgctgaagtactttcccaacatgtctgtgggcttcacggca gtgctgacatactcctctgcctgggaggcccgggaagccttgaggcagatcccactggag agaatcatcgtggaaacggatgctccctatttcctccctcgccaggttcccaaaagcctt tgccagtatgcccacccgggcctggccttgcatacggtccgagagattgccagagtcaaa gatcagccactctccctcaccttggctgccttgcgtgagaacaccagtcgcctctacagt ctttaa >gi568815595r:10228817_10549543|GENSCAN_predicted_peptide_3|354_aa MKAGPQETRNRKESHRLERTCFLSPTKFQKGEWGFVLTLKELPNWAWLLFTLEKNEEGQR GSFRMTPPAMVLPSISSSFLNSVRTRTFLVPKECTSAPRKLGHLWDGVAGLEQTPVILYK DLTATRHHLRQELQTRHELSEPGTKGTGEDPQPLSSPAARCSALNGLGEENTHPSSSTLG GPPVCNPAEAMPSPGTVCSLLLLGMLWLDLAMAGSSFLSPEHQRVQQRKESKKPPAKLQP RALAGWLRPEDGGQAEGAEDELEVRVGTSAVLCFCGSEEGGFNAPFDVGIKLSGVQYQQH SQALGKFLQDILWEEAKESLFPESPSLSWISIMDNECSSAPPHINGSGTNETSS >gi568815595r:10228817_10549543|GENSCAN_predicted_CDS_3|1065_bp atgaaggcaggccctcaagagacccgcaacagaaaggaaagccaccgactggagagaacc tgcttcctgagccccaccaagttccaaaaaggagaatggggcttcgtcctcaccctcaag gagttgccaaactgggcttggttattattcacacttgaaaagaatgaggaagggcagcga ggtagcttccggatgactcctcccgccatggttcttccgtccatcagctcttccttcctc aactctgtgaggacaagaacatttttagttcccaaggaatgtacatcagccccacggaag ctaggccacctctgggatggggttgctggtttagaacaaacgccagtcatcctatataag gacctgacagccaccaggcaccacctccgccaggaactgcagactcgccatgagctctca gagcctgggacaaaaggcaccggggaggacccccagcccctatcaagtcctgctgccagg tgttctgccctgaacggccttggagaagagaacacacatccatcatcttctaccttggga ggcccacctgtctgcaacccagctgaggccatgccctccccagggaccgtctgcagcctc ctgctcctcggcatgctctggctggacttggccatggcaggctccagcttcctgagccct gaacaccagagagtccagcagagaaaggagtcgaagaagccaccagccaagctgcagccc cgagctctagcaggctggctccgcccggaagatggaggtcaagcagaaggggcagaggat gaactggaagtccgggtcggtacctctgcagttttatgcttctgtggcagcgaggagggt gggttcaacgccccctttgatgttggaatcaagctgtcaggggttcagtaccagcagcac agccaggccctggggaagtttcttcaggacatcctctgggaagaggccaaagaaagcctc ttcccggagtcccccagtctgtcctggatctccatcatggacaatgagtgctcctctgca cccccccatatcaatggcagtggaaccaatgagaccagcagctga >gi568815595r:10228817_10549543|GENSCAN_predicted_peptide_4|405_aa MLPAEMIELLLGVGPPEVWFINPDWWGSIRVYHPSPSGSYRTHKPPLNNLSCDTSIIREL LKGKKFLLFVSVYPVPSTVLGRKKVSVINTVDTSHEDMIHDAQMDYYGTRLATCSSDRSV KIFDVRNGGQILIADLRGHEGPVWQVAWAHPMYGNILASCSYDRKVIIWREENGTWEKSH EHAGHDSSVNSVCWAPHDYGLILACGSSDGAISLLTYTGEGQWEVKKINNAHTIGCNAVS WAPAVVPGSLIDHPSGQKPNYIKRFASGGCDNLIKLWKEEEDGQWKEEQKLEAHSDWVRD VAWAPSIGLPTSTIASCSQDGRVFIWTCDDASSNTWSPKLLHKFNDVVWHVSWSITANIL AVSGGDNKVTLWKESVDGQWVCISDVNKGQGSVSASVTEGQQNEQ >gi568815595r:10228817_10549543|GENSCAN_predicted_CDS_4|1218_bp atgctcccagcagagatgattgaactgttgctcggggtagggcccccagaagtgtggttc atcaaccctgactggtggggctcaattcgtgtctaccaccccagcccctcaggctcctac aggacacacaaaccaccattaaataatttatcttgcgacactagcattattcgcgagctt ctcaagggcaaaaagttcctcttatttgtgtctgtatatccagttcctagcacagtgctt ggcagaaagaaggtgtcagtaattaacactgtggatacctcccatgaggacatgattcac gacgcccagatggactactatggcacccgcctggcaacctgctcatcagacaggtccgtc aaaatctttgatgtgcgcaatggagggcagatccttatcgccgacctcaggggtcatgag ggtcctgtgtggcaagtggcctgggctcaccccatgtacggcaacatcctggcatcgtgc tcctatgaccggaaagtcattatctggagagaggaaaacggcacctgggagaagagccac gagcatgcgggacacgactcctcagtgaactcggtgtgctgggccccccatgactacggc ctgatcctggcctgtgggagctcggatggggccatctccctgctgacttacaccggggaa ggccaatgggaagtaaagaagatcaacaacgctcacaccattggctgcaatgccgtcagc tgggcccctgctgttgtacctggaagcctcatagaccacccatcggggcagaaacccaat tacatcaagaggtttgcatcaggtggctgtgacaacctcatcaagctgtggaaggaggag gaggacggccagtggaaggaggagcagaagctagaagcgcacagtgactgggttcgagat gtggcctgggccccctccatcggcctgcccaccagcaccatcgccagctgctcccaggat ggtcgtgtgttcatttggacctgtgatgatgcctcaagcaatacgtggtcccctaaattg ttgcacaagttcaacgatgtggtgtggcatgtgagctggtccatcacagccaacatcctg gctgtctctggtggagacaataaggtgaccctgtggaaggagtcagttgatgggcagtgg gtgtgcatcagtgatgtcaacaagggccagggctccgtatcagcatcagtgacagagggc cagcagaacgagcagtga >gi568815595r:10228817_10549543|GENSCAN_predicted_peptide_5|1757_aa MGDMTNSDFYSKNQRNESSHGGEFGCTMEELRSLMELRGTEAVVKIKETYGDTEAICRRL KTSPVEAPVPGEKLMGTVAPLSENQPTTRNSLKMEWADRSDLVTCSTRGHGSVHQDPKEI GRGKENGGEAVDLLQNGRERAGSRQRRCQRSEESVTCRYSVMRVSDTGLAGSWTEQSPGS RTKEKRPTLRRAGCWGRVRIEGGGNRETGKVAAEAIQEHSLCVPVLCHVHMGAHKRMDIA STCLQRNHPALHYAHSNPQSSATSTVVCHICVCMCTSRCQVCYEHITVNELGDKCYQGGS GLPGTAPDLEKRKQIFGQNFIPPKKPKTFLQLVWEALQDVTLIILEIAAIISLGLSFYHP PGEGNEGTLLSASLRVKVDAGCTAGLAGYPVPHLAPVEGNCSEFCRKDDPMLLEIPFHGN GRSPGPRGNEGLGCATAQGGAEDEGEAEAGWIEGAAILLSVICVVLVTAFNDWSKEKQFR GLQSRIEQEQKFTVVRAGQVVQIPVAEIVVGDIAQVKYGDLLPADGLFIQGNDLKIDESS LTGESDQVRKSVDKDPMLLSGTHVMEGSGRMLVTAVGVNSQTGIIFTLLGAGGEEEEKKD KKAADGAAASNAADSANASLVNEGCSQHGPPFICDINVCGLLRLRDDRPCVVCSLLSVKI LCRVGKMQDGNVDASQSKAKQQDGAAAMEMQPLKSAEGGDADDRKKASMHKKEKSVLQGK LTKLAVQIGKAGLVMSAITVIILVLYFTVDTFVVNKKPWLPECTPVYVQYFVKFFIIGVT VLVVAVPEGLPLAVTISLAYSVKKMMKDNNLVRHLDACETMGNATAICSDKTGTLTTNRM TVVQAYVGDVHYKEIPDPSSINTKTMELLINAIAINSAYTTKILPPEKEGALPRQVGNKT ECGLLGFVLDLKQDYEPVRSQMPEEKLYKVYTFNSVRKSMSTVIKLPDESFRMYSKGASE IVLKKCCKILNGAGEPRVFRPRDRDEMVKKVIEPMACDGLRTICVAYRDFPSSPEPDWDN ENDILNELTCICVVGIEDPVRPESSSVFETKNLNSSFWRQEPDKVCVPRKQQLFAVLRGR RQQGATAQFTAIMRMLVPEAIRKCQRAGITVRMVTGDNINTARAIAIKCGIIHPGEDFLC LEGKEFNRRIRNEKGEIEQERIDKIWPKLRVLARSSPTDKHTLVKGIIDSTHTEQRQVVA VTGDGTNDGPALKKADVGFAMGIAGTDVAKEASDIILTDDNFSSIVKAVMWGRNVYDSIS KFLQFQLTVNVVAVIVAFTGACITQDSPLKAVQMLWVNLIMDTFASLALATEPPTETLLL RKPYGRNKPLISRTMMKNILGHAVYQLALIFTLLFVAPYPGPVEGTLSKASWGRAYTLTP PPTLTMLPLGLPGWVFHSVRQAHLPPPHITNEGTKAQRGEKMFQIDSGRNAPLHSPPSEH YTIIFNTFVMMQLFNEINARKIHGERNVFDGIFRNPIFCTIVLGTFAIQIVIVQFGGKPF SCSPLQLDQWMWCIFIGLGELVWGQVIATIPTSRLKFLKEAGRLTQKEEIPEEELNEDVE EIDHAERELRRGQILWFRGLNRIQTQIEVVNTFKSGASFQGALRRQSSVTSQSQDVANLS SPSRVSLSNALSSPTSLPPAAAGRLWCRAATALQIRVVKAFRSSLYEGLEKPESRTSIHN FMAHPEFRIEDSQPHIPLIDDTDLEEDAALKQNSSPPSSLNKNNSAIDSGINLTTDTSKS ATSSSPGSPIHSLETSL >gi568815595r:10228817_10549543|GENSCAN_predicted_CDS_5|5274_bp atgggtgacatgaccaacagcgacttttactccaaaaaccaaagaaatgagtcgagccat gggggcgagttcgggtgcacaatggaggagctccgctccctcatggagctgcggggcact gaggctgtggtcaagatcaaggagacttatggggacaccgaagccatctgccggcgcctc aaaacctcacctgttgaagccccagtccctggggagaagctgatgggcactgtggcccct ctctcagaaaatcagccaaccacacgcaacagcttaaagatggagtgggctgatagatca gacctggtcacatgctcaaccagagggcacgggtcagtccaccaagaccccaaggagatt ggaagagggaaggaaaatggaggagaggctgtggacctgctgcagaatggaagggagaga gcaggcagcaggcaaagaagatgtcagaggtcggaggagagtgtcacatgcaggtacagt gtgatgagagtatctgacactggattggcaggcagttggacagagcagtcgccaggaagt aggacgaaggagaaacggcccacactgaggagggcaggctgttggggtagagtacgaatt gagggtggtgggaacagggagacaggcaaggtggctgctgaggccatccaggaacacagc ctctgtgtgcccgttctgtgccatgtccacatgggtgcacataagcgaatggacattgcg tccacatgcttgcaaagaaaccacccagcgctgcactatgcacattcaaatccacaatct agtgccacatccacagttgtgtgccacatctgtgtgtgcatgtgcacgtctagatgtcag gtctgctatgagcacataactgtgaatgagctgggtgacaaatgctatcagggaggatca ggtttgccgggcaccgctccagacctggaaaagagaaagcaaatttttgggcaaaacttt atacctccaaagaagccaaaaaccttcctgcagctcgtgtgggaggcgctgcaggacgtg acgctcatcatcctggagattgccgccatcatctccctggggctgtccttctaccacccg cccggcgagggcaacgaaggtacccttctctcagccagcttgagggttaaagtggatgct gggtgcacagcaggccttgcagggtatccagtgccccacctggcccccgtggaggggaac tgctctgagttttgcaggaaggatgatccgatgctgctggagattcctttccatggaaat ggccgctccccaggtcccagaggaaatgaaggcctgggatgtgcgacggcccagggtggg gcagaggatgaaggagaggcagaggcaggttggatcgagggggccgccattctcctctca gttatctgtgtggtcctggtcacggccttcaatgactggagcaaagagaaacagttccgg ggcctgcagagccgcatcgagcaggaacagaaatttaccgtggtccgggctggccaggtg gtccagatccctgtggctgagatcgtggttggggacatagcccaggtcaaatatggtgac ctcctccctgccgacggcctcttcatccagggcaatgacctcaagattgatgaaagctcc ctaactggagagtctgaccaggtgcgcaagtccgtggacaaggaccccatgctgctgtca ggaacccacgtgatggagggctcaggacggatgttggtgactgctgtgggtgtgaactct cagactggcatcatctttaccctcctgggggctggtggtgaagaggaagagaagaaagac aaaaaagcagcagacggtgcggcagcttcaaatgctgcagatagtgcgaatgccagccta gtcaatgagggatgcagtcagcacggcccgcccttcatctgtgacattaatgtctgtggc ctgttgcgtcttcgtgatgaccgaccatgtgttgtctgttctctgttgtctgttaaaatt ctttgtcgtgtaggtaaaatgcaggatggcaatgtggacgccagccagagcaaagccaaa caacaggacggggcagccgccatggagatgcagcccctcaagagtgccgagggcggcgac gctgacgacaggaagaaggccagcatgcacaagaaggagaagtccgtgctgcagggcaag ctcaccaagctggctgtgcagatcgggaaggcgggcttggtgatgtcagccatcacggtg atcatcctggtgctctacttcactgtggacaccttcgtggtcaacaagaagccgtggctg cctgagtgcacgcccgtctacgtgcagtactttgtcaagttcttcatcattggcgtgacg gtgctggtggtcgccgtgcccgaggggctccctctggccgtcaccatctcgttggcctat tcggtgaagaaaatgatgaaggacaacaacctggtacgccacctggatgcctgtgagacc atgggcaatgccacagccatctgctcagacaagacaggcacgctgaccaccaatcgcatg acagtggtacaggcctatgtcggcgacgtccactataaagagatccccgaccccagctcc atcaacaccaagaccatggagctgctgatcaatgccatcgccatcaacagcgcctacacc accaagattctgcccccagagaaggagggcgccctgcctcggcaggtgggcaacaagacg gagtgcggcctgctgggcttcgtgctggacctgaagcaggactacgagcccgtgcgcagc cagatgccagaggagaagttgtacaaagtgtacaccttcaactccgtgcgcaagtccatg agcactgtcatcaagctgcccgacgagagcttccgcatgtacagcaagggggcttctgag atcgtgctcaagaagtgctgcaaaatcctcaatggggcgggagagcctcgtgtcttccgg ccccgcgaccgggacgagatggtaaagaaggtgattgagcccatggcttgcgatgggctc cgcactatctgcgtggcctaccgcgacttccccagcagcccggagccggactgggacaat gagaatgacatcctcaacgaactcacctgcatctgcgtggtgggcatcgaggacccggtg cggccagagtcctccagcgtctttgaaaccaagaacctaaattcctccttttggaggcag gagccagataaggtgtgcgttcccaggaaacagcagctctttgctgttctcaggggcagg agacagcagggggccacagcccagtttacagccatcatgaggatgttagtcccagaagcc atccgcaagtgccagcgggcaggcatcacggtccgcatggtcactggcgacaatatcaac acggctcgggccatcgccatcaagtgtggcatcatccatcctggggaggactttctgtgc ctcgagggcaaggagttcaacaggaggatccgcaacgagaagggggagattgagcaggag cgaattgacaagatctggccaaagctgcgggtgctggctcgctcctccccaacggacaag cataccctggttaaaggcatcatcgacagcacacacactgagcagcggcaggtggtggcc gtgacgggggacgggaccaacgacgggcctgcactcaagaaggccgacgtgggcttcgcc atgggcatcgcaggcactgacgtggccaaggaggcctcagacatcatcctgacagacgac aatttcagcagcatcgtcaaggcagtgatgtggggccgcaacgtctatgacagcatctcc aaattcttgcagttccagctcaccgtcaacgtggtggccgtgattgtggccttcacaggc gcctgcatcacgcaggactcccctctgaaggccgtgcagatgctctgggtgaacctcatc atggacacgtttgcctcgctggcactggccactgagccgcccacggagaccctgctgctg aggaagccgtacggccgcaacaagccgctcatctccaggaccatgatgaagaacatcctg ggccatgctgtctaccagcttgccctcatcttcaccctgctctttgttgcaccttatcct ggccccgtggaagggaccctcagcaaggcttcctggggcagggcatacacgctcacacca ccaccgactctcacaatgcttccacttgggctgcctggttgggtcttccactcagtgagg caagcacatttgcctccaccccacattacaaacgagggcaccaaggctcagagaggcgag aagatgttccagatcgacagcgggaggaacgcgcccctgcattcgccaccctcagaacat tacaccatcatcttcaacaccttcgtcatgatgcagctcttcaacgagatcaacgcccgc aagatccacggcgagcgcaatgtctttgacggcatcttccggaaccccatcttctgcacc atcgtgctgggcacctttgccatccagatagtgatcgtgcagtttggagggaagccattc agctgctctccactgcagctggaccagtggatgtggtgcatattcattgggttaggagag ctcgtttggggccaggtcatcgccaccatcccgaccagcagactcaagttcctcaaggag gcaggcaggctcacacagaaggaggagatcccggaggaggagctcaacgaggacgtggag gagatcgaccacgcggagcgggagctgcggcggggccagatcctgtggttccgaggcctg aatcggatccagacacagattgaagtagtcaatactttcaagagcggggcctcctttcag ggggccctgcggagacagtcctcggtcaccagccagagccaggatgtagccaatctctct agccctagtcgcgtgtcgttgtccaatgctctttcctctccgaccagccttcctcctgct gcagccgggcgcctgtggtgcagggcagccactgctctgcagatccgcgtcgtgaaggcg ttccgtagctctctctatgaaggtttagaaaagcctgaatctcgaacctccatccataac ttcatggctcatcctgaattccggatcgaagattcccagccccacatccccctcattgat gacaccgacctggaagaagatgccgcgctcaagcagaactcgagcccgccgtcatccctc aacaagaacaacagcgccatcgacagtgggatcaacctgacgaccgacacaagcaaatca gctacctcttcaagtccagggagccccatccacagcctggagacgtcgctttag >gi568815595r:10228817_10549543|GENSCAN_predicted_peptide_6|551_aa MQDVTHTKLRFQEPRPRWENRPGREKESLEVTQGVIKEGRAVSDEEVEIEIAVRKVTTMT SIAATFRTRGYAPSATVAHLTLKQHRQMLEPPTEGLGGRCIGVSPRHRAGHPEAQGSRRS RDKGVSPRGIHGRTKKDAPWGNGDFTIKCLELLQGFNEMTICFSNSSQGGRIKTKDMVNS WGPGLLLLKSQSQGHPELVPVGSCIGTWARNVKMYILESAQFSLRHNGLMRQAVLSSPFD SGGNGGLEIKDMQLVSGQALDVTPSSLGPDVVLNYCATSTLSQAHVGWRALPMGTDQKTS EGSARKHPLSLYLDHTRAKPCTWILFASRWALNPMTNVFRSRGEDTERHTGRSEVKMKAE MGVTQPQVKAPPDLGGKGAKVANGEKSTQHILNTSVFPGPVELSPQQLSAALKRYPCDPP CLCATPPALGGDIYDELQADCGLTDTSTGTQGPRPPAPWKGAHLPLSPQGGLNAGRMLRS YDVKGRVVIQGLGLHWGKGLPKFEAHNSGSLAPDLDGPWTVDPGDHTGWLARNLLSSHME ASHPLPTPTKA >gi568815595r:10228817_10549543|GENSCAN_predicted_CDS_6|1656_bp atgcaggatgtcactcacaccaagctcagatttcaggagcccagacccagatgggaaaac agacctgggagggaaaaggaatcacttgaagtgacccagggagttatcaaggaaggcagg gctgtctctgacgaggaggttgaaatagaaatagcagttcggaaagtgacaacaatgaca agtatagctgctaccttcaggacccggggctatgcaccttctgccactgttgcccatctc accctcaaacaacacagacagatgctggagccacccacggaaggacttgggggccggtgc attggcgtcagcccccggcacagggccgggcacccagaggcccagggatcaaggcggtcc agggataaaggggtctcccctcgagggattcatggaagaacaaagaaagatgccccctgg ggaaatggggatttcaccatcaagtgcctagaactgttgcagggatttaatgagatgaca atctgtttttcaaacagcagccagggaggcaggataaagaccaaggacatggtcaattcc tgggggccaggcctgttgctgctgaagagtcagtctcaaggacaccctgaacttgtcccc gtgggcagttgcattgggacatgggccagaaatgtcaaaatgtatatcttggaatccgca cagttcagtctccgacacaacggcctgatgaggcaggcagtgttgtcatctccatttgac agtggagggaacggaggcttggagattaaggacatgcagctggtcagcggccaggcattg gatgtgaccccaagcagcctgggcccagatgttgtcctcaactactgcgctacatcaacg ctaagtcaggcccacgtaggatggcgtgccctgcccatgggcacagatcaaaaaacttca gaaggatcagcaagaaaacatccgctgagcctctacctggaccacacccgggccaagccc tgtacttggatcctgtttgcctcacgttgggccctaaatccaatgaccaatgtctttaga agcagaggagaggacaccgagagacacacaggaaggagtgaggtgaagatgaaagcagag atgggtgtgacgcagccacaagtcaaggcacccccagacctggggggcaaaggggcaaag gtcgcaaatggggagaagtcgactcaacacattttaaacacatctgtgtttccagggcca gtggagctgagcccgcagcagctctctgctgccctgaagcgctacccctgcgacccgccc tgcctgtgtgcgaccccacctgccctcgggggggacatctatgatgagttgcaggccgac tgtgggctgactgacacttccacgggaacccagggaccccggcccccagccccatggaag ggtgcccacctgccccttagcccgcagggtggcctcaacgcagggaggatgctgcgaagc tacgatgtcaagggccgggtggtcattcaaggccttggcctccactggggcaagggttta cccaagtttgaagcacacaactcaggatctctagccccagaccttgacgggccgtggact gtggatccaggggaccacacgggctggcttgcccgaaatctcctctccagtcacatggaa gcctcgcatccactccccactcccaccaaggcctga >gi568815595r:10228817_10549543|GENSCAN_predicted_peptide_7|47_aa MTVLHCRIADAVTRRALTEGFVFIFGFHPENPSQLPPFPGEAAFQQE >gi568815595r:10228817_10549543|GENSCAN_predicted_CDS_7|144_bp atgactgtgctgcactgccgcattgcagatgctgttacacgcagggccctcaccgagggc ttcgtgttcatcttcgggttccatcctgaaaacccttcccagctgccccccttcccaggg gaggcagcctttcagcaagagtga >gi568815595r:10228817_10549543|GENSCAN_predicted_peptide_8|162_aa XLNFVTQSLEPARHGPQFPDPDTSHLTRRRGGARKGGSPAGSQASVPWPPCRVWPYAADF CSDFLRATLGADIIVTISAAEEPQALGENMRHPRSPPGVSVTLSKPHYLSEPPASSSVKG GSGHLPQAAVRLYELTSPTLASAEAMHTESYFQWNDQEVEGR >gi568815595r:10228817_10549543|GENSCAN_predicted_CDS_8|489_bp nagctgaacttcgtcacccagtccctagagccagcaagacatgggccccagtttccagac cctgacacctctcatttaaccagaagacgtggaggggccagaaagggaggcagccctgct ggcagccaggcctcggtgccgtggccaccttgcagggtgtggccatacgctgcagacttc tgcagtgacttcctcagagcaaccctgggtgccgacatcatcgtcaccatctctgcagct gaagaacctcaggctctgggagaaaacatgaggcatccaaggtcacccccaggtgtcagt gtgaccttgagcaagccccattacctctctgagcctccagcttcttcatctgtaaaagga ggttcaggacacctgccccaggctgctgtgaggctctatgagctaacaagtccaacactg gcttcagcagaagctatgcacacagaaagttacttccagtggaatgaccaggaggtagaa gggagatag