GENSCAN 1.0 Date run: 4-Nov-116 Time: 11:40:44 Sequence gi568815575f:153404632_153608442 : 203811 bp : 56.87% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 6783 7072 290 2 2 40 110 103 0.474 2.29 1.02 Intr + 7334 7382 49 2 1 75 105 42 0.620 3.77 1.03 Term + 7733 7828 96 1 0 73 47 60 0.443 -1.33 1.04 PlyA + 9492 9497 6 -0.45 2.00 Prom + 11176 11215 40 -1.81 2.01 Init + 12966 13199 234 2 0 83 116 52 0.863 5.63 2.02 Intr + 13674 13724 51 2 0 62 113 29 0.837 2.39 2.03 Intr + 14042 14168 127 1 1 40 106 141 0.999 11.86 2.04 Term + 16012 16997 986 2 2 115 41 1678 0.999 158.53 2.05 PlyA + 21821 21826 6 1.05 3.03 PlyA - 23009 23004 6 -0.45 3.02 Term - 23818 23583 236 2 2 51 40 91 0.015 -2.59 3.01 Init - 34369 34261 109 1 1 77 73 138 0.884 11.55 3.00 Prom - 37374 37335 40 -5.61 4.14 PlyA - 39863 39858 6 1.05 4.13 Term - 40833 40089 745 2 1 75 53 1042 0.328 92.78 4.12 Intr - 45967 45802 166 2 1 31 77 53 0.130 -1.97 4.11 Intr - 49877 49763 115 2 1 141 84 236 0.999 29.02 4.10 Intr - 51135 50911 225 0 0 69 80 265 0.956 22.51 4.09 Intr - 51733 51634 100 0 1 122 78 165 0.977 19.51 4.08 Intr - 52020 51862 159 2 0 52 76 252 0.999 20.02 4.07 Intr - 52597 52506 92 1 2 71 88 113 0.986 8.89 4.06 Intr - 53747 53649 99 2 0 43 47 100 0.472 2.21 4.05 Intr - 58040 57979 62 0 2 84 116 82 0.905 9.54 4.04 Intr - 60424 60357 68 0 2 128 100 14 0.807 5.64 4.03 Intr - 60992 60870 123 1 0 67 43 67 0.447 0.31 4.02 Intr - 64630 64515 116 1 2 84 52 99 0.833 5.65 4.01 Init - 65956 65819 138 1 0 67 48 291 0.813 21.11 4.00 Prom - 67321 67282 40 -5.31 5.00 Prom + 68456 68495 40 -3.81 5.01 Init + 69379 69433 55 0 1 74 61 -2 0.431 -2.67 5.02 Intr + 69711 69797 87 1 0 110 33 154 0.593 12.54 5.03 Intr + 72925 73160 236 2 2 104 71 76 0.151 5.44 5.04 Intr + 73352 73538 187 1 1 83 81 -9 0.118 -2.32 5.05 Intr + 74630 74755 126 0 0 81 113 52 0.161 8.16 5.06 Intr + 75949 76374 426 2 0 96 100 209 0.996 17.24 5.07 Intr + 76730 77017 288 0 0 95 94 134 0.988 12.46 5.08 Intr + 77179 77294 116 2 2 15 91 141 0.999 7.77 5.09 Intr + 77696 77828 133 1 1 64 38 220 0.659 15.32 5.10 Intr + 78085 78110 26 2 2 116 29 31 0.673 -1.77 5.11 Intr + 78674 78813 140 1 2 83 86 189 0.999 17.97 5.12 Intr + 81139 81465 327 1 0 113 91 583 0.991 56.46 5.13 Term + 81999 82179 181 0 1 123 37 172 0.989 13.00 5.14 PlyA + 82388 82393 6 1.05 6.00 Prom + 86577 86616 40 -5.91 6.01 Init + 90522 90586 65 2 2 79 -1 169 0.246 5.67 6.02 Intr + 99622 99749 128 1 2 71 24 96 0.312 2.33 6.03 Intr + 99990 100283 294 1 0 134 61 338 0.779 33.23 6.04 Intr + 100607 100719 113 0 2 17 74 268 0.350 18.90 6.05 Intr + 101232 101445 214 2 1 86 97 571 0.999 56.61 6.06 Intr + 101898 102008 111 1 0 34 97 223 0.976 18.65 6.07 Intr + 102199 102292 94 2 1 115 94 131 0.997 15.92 6.08 Intr + 102416 102554 139 2 1 96 94 166 0.998 18.97 6.09 Term + 103617 103814 198 2 0 120 44 412 0.850 37.72 6.10 PlyA + 104892 104897 6 -1.95 7.04 PlyA - 105716 105711 6 -3.94 7.03 Term - 107465 107362 104 0 2 81 40 90 0.467 2.24 7.02 Intr - 112538 112202 337 2 1 40 80 186 0.683 8.65 7.01 Init - 113511 113386 126 0 0 34 51 281 0.965 17.12 7.00 Prom - 117056 117017 40 -6.30 8.00 Prom + 118331 118370 40 -0.81 8.01 Init + 122721 122795 75 2 0 79 34 104 0.095 3.14 8.02 Term + 125137 125217 81 0 0 119 47 73 0.076 4.29 8.03 PlyA + 126513 126518 6 -3.44 9.00 Prom + 127297 127336 40 -1.41 9.01 Init + 131617 131824 208 0 1 98 110 386 0.535 40.76 9.02 Intr + 132874 132928 55 2 1 110 71 -6 0.756 -1.57 9.03 Intr + 133534 133662 129 1 0 79 86 77 0.950 6.91 9.04 Intr + 134028 134246 219 0 0 84 53 113 0.875 5.34 9.05 Intr + 135875 136020 146 2 2 60 47 77 0.873 1.24 9.06 Intr + 136728 136925 198 1 0 29 116 418 0.991 38.54 9.07 Intr + 137038 137295 258 2 0 99 113 514 0.976 52.97 9.08 Intr + 137692 137817 126 2 0 43 94 338 0.920 31.06 9.09 Intr + 138412 138650 239 2 2 98 35 243 0.848 17.86 9.10 Intr + 139876 139960 85 0 1 47 43 122 0.596 3.49 9.11 Intr + 141457 141498 42 2 0 93 92 40 0.653 3.60 9.12 Intr + 143204 143368 165 0 0 71 91 216 0.993 20.65 9.13 Intr + 144009 144223 215 1 2 94 84 323 0.999 31.46 9.14 Intr + 144866 145108 243 1 0 85 49 450 0.999 39.03 9.15 Intr + 145414 145655 242 0 2 79 87 479 0.841 43.78 9.16 Intr + 148404 148704 301 0 1 96 58 483 0.593 43.38 9.17 Intr + 151418 151597 180 1 0 141 54 379 0.908 40.38 9.18 Intr + 151700 151787 88 1 1 94 63 102 0.884 8.34 9.19 Intr + 152286 152392 107 1 2 120 103 226 0.960 27.73 9.20 Intr + 153481 153672 192 0 0 82 94 459 0.994 46.11 9.21 Intr + 155098 155311 214 0 1 76 74 364 0.922 32.61 9.22 Intr + 156045 156256 212 1 2 66 91 508 0.999 48.06 9.23 Intr + 157504 157611 108 0 0 117 109 90 0.999 14.98 9.24 Intr + 157641 157778 138 2 0 94 33 38 0.493 0.07 9.25 Intr + 160290 160472 183 2 0 104 115 419 0.966 46.70 9.26 Intr + 164972 165125 154 1 1 115 64 56 0.602 6.06 9.27 Intr + 170103 170290 188 1 2 112 47 16 0.194 -0.17 9.28 Intr + 175347 175597 251 2 2 86 8 566 0.540 45.17 9.29 Term + 175862 175991 130 2 1 -6 44 262 0.619 10.36 9.30 PlyA + 176151 176156 6 -0.45 10.03 PlyA - 176459 176454 6 -0.45 10.02 Term - 177428 176946 483 2 0 105 48 133 0.515 6.14 10.01 Init - 180991 180980 12 1 0 110 96 17 0.606 3.77 10.00 Prom - 182464 182425 40 -3.71 11.06 PlyA - 183315 183310 6 1.05 11.05 Term - 183823 183734 90 1 0 132 45 82 0.991 6.32 11.04 Intr - 188072 187875 198 2 0 93 94 346 0.928 35.77 11.03 Intr - 190048 189916 133 0 1 101 83 103 0.986 12.25 11.02 Intr - 191556 191373 184 1 1 125 95 252 0.988 28.96 11.01 Init - 194442 194331 112 0 1 52 91 136 0.926 8.66 11.00 Prom - 197895 197856 40 -7.20 12.02 PlyA - 198394 198389 6 -0.45 12.01 Sngl - 200520 199894 627 0 0 78 55 875 0.295 79.78 12.00 Prom - 201099 201060 40 -8.38 13.00 Prom + 201118 201157 40 -7.00 13.01 Init + 201886 202293 408 0 0 71 57 433 0.413 35.08 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575f:153404632_153608442|GENSCAN_predicted_peptide_1|144_aa MLHAPPTTRPVPTLPPPRPRPLPARSSAHQTETPLQDALRERYPEAKSRQAQEPPPPARV SWGRGLGGLLGGGGGLGLGPAGSRKGGRLPRLPAEPSHRSRGWLKSILSLICLIPPEQAF EICQGHGGSKPCPVVLVLVAERRQ >gi568815575f:153404632_153608442|GENSCAN_predicted_CDS_1|435_bp atgctccatgccccccccacaacgcgccccgttcccacgctgccgccgccccggccccgc ccgctcccggcccgctcgtcggcgcaccagactgagactccccttcaggacgctcttcgc gagcggtacccagaggcgaagtcgagacaagcccaggagcccccaccccccgcccgcgtt tcctggggacgcggcctcggagggcttctaggaggaggcggcggccttggcctcgggcct gccgggagcaggaagggaggccgactccctcggctgcccgccgagcccagccaccggtcc cgaggctggctaaagagcatcctgtcgctgatctgcctgatacctcctgagcaagcattt gagatctgccagggacacggcggtagtaagccctgccctgttgttctcgttctagtggct gagcgcagacaataa >gi568815575f:153404632_153608442|GENSCAN_predicted_peptide_2|465_aa MGRELRKAAKEEKKGKKERAPSKAWSGERQYGSGHGTSIAESVLGLNGAPQKDILKSQLL VSVTVTSFGKGDFAAVIKLLCALVMAAILLTTRPKVPVSFEDVSVYFTKTEWKLLDLRQK VLYKRVMLENYSHLVSLGDRTQSKTSTSTQKHSGRQLPGADPQGGKEGQAARSSVLQRGA QGLGQSSAAGPQGPKGAEKRYLCQQCGKAFSRSSNLIKHRIIHSGEKPYACPECGKLFRR SFALLEHQRIHSGEKPYACPECSKTFTRSSNLIKHQVIHSGERPFACGDCGKLFRRSFAL LEHARVHSGERPYACPECGKAFSRSSNLIEHQRTHRGEKPYACGQCAKAFKGVSQLIHHQ RSHSGERPFACRECGKAFRGRSGLSQHRRVHSGEKPYECSDCGKAFGRRANLFKHQAVHG ARRPAKAETARRLAGPGSTGPGSAVAATSPPRPSTAARPSRPSRR >gi568815575f:153404632_153608442|GENSCAN_predicted_CDS_2|1398_bp atggggagagagctaagaaaagcagcaaaagaagaaaagaaaggaaagaaggaacgggct ccctcgaaagcctggtcaggcgagcgccagtatggaagtgggcatggcacctcaattgca gagagtgtcctgggtttgaatggtgccccccaaaaagacatattgaagtctcaactcctg gtatctgtgactgtgacctcatttggaaagggagactttgcagctgtaatcaagctcctc tgcgccctggtgatggcagccattctcctgaccacgagacccaaggtgccagtatctttt gaggatgtgtccgtgtacttcacaaagacagaatggaagcttctggacctcagacaaaag gtcctctacaagcgggtgatgctggagaactatagccatttggtgtcactgggtgacaga acacagagcaagacgtcgacttcaacgcagaagcattctggacgacaactccccggggcc gatccacaaggtggcaaggaggggcaggcggcgaggtcgtctgtgctccagagaggtgcc cagggcttggggcagagttcggctgcggggccgcagggccccaaaggcgcggagaagcgg tacctgtgccagcagtgtgggaaggccttcagccgcagctccaacctcatcaagcaccgc atcatccacagtggcgagaagccttacgcgtgccccgagtgcggcaagctgtttcgccgc agcttcgcgctcctggagcaccagcgcatccacagcggcgagaagccctacgcctgcccc gagtgcagcaagaccttcacgcgcagctccaacctcatcaagcaccaggtcatccacagc ggcgagcggcccttcgcctgcggcgactgcggcaaactgttccgccgcagcttcgcgctc ctggagcacgcgcgcgtgcacagcggcgagcggccctacgcgtgcccagagtgcggcaag gccttcagccgcagctccaacctcatcgagcaccagcgcacgcaccgcggcgagaagccc tacgcctgcggccagtgcgccaaggccttcaagggcgtctcgcagctcatccaccaccag cgcagccacagcggcgagcggcccttcgcgtgccgcgagtgcggcaaggccttccgtggc cgttcgggcctcagccagcaccggcgcgtgcacagcggtgagaagccctacgagtgcagc gactgcggcaaggccttcggccggcgcgccaacctattcaagcaccaggcagtgcacggc gccaggcgccctgcgaaggcggagacggcgcggaggctagcgggccctgggagcaccggc cctgggagcgcggtggcggccaccagccccccgcggccgagcacagccgccaggccttcc aggcccagccgccgctga >gi568815575f:153404632_153608442|GENSCAN_predicted_peptide_3|114_aa MEDQAEGGWEEGYLEAEEEDDEDAEEEQEPEVTAAGILEAKSLKSRCQQIWFLVIVLFGL DTAAFLLYPHTQWRKEALVSLPLVVGNQSHHEGPTVMTSSKPNALPEAPTPNAM >gi568815575f:153404632_153608442|GENSCAN_predicted_CDS_3|345_bp atggaagaccaggctgagggaggctgggaagaggggtatctggaggctgaagaggaagac gatgaagatgccgaggaagaacaagaacctgaggtgacggcagctggcattctggaggct aaaagtctaaagtcaaggtgccagcagatttggttcttggtaatagttctctttggcttg gacacagctgccttcttgctgtatcctcacacacagtggaggaaggaagctctagtgtct cttcctcttgttgttgggaaccaatcccatcatgagggtcccaccgtcatgacctcatct aaacctaatgcgctcccagaggccccaactcctaatgccatgtga >gi568815575f:153404632_153608442|GENSCAN_predicted_peptide_4|735_aa MGGARLGARNMAGQDAGCGRGGDDYSEDEGDSSVSRAAVEVFGKLKDLNCPFLEGLYITE PKTIQELLCSPSEYRLEILEWMCTRQGQAFTVPKVMSLDCDHRLCCGRSPHMTECRPHKP FLQRAGVWPSLQDRFSSLKGVPTEVKIQEMTKLGHELMLCAPDDQELLKHVDRPGQCCGD VAARCPQITLEHMTVVSTTVQAGCACAQKQLHFMDQLLDTIRSLTIGCSSCSSLMEHFED TREKNEALLGELFSSPHLQMLLNPECDPWPLDMQPLLNKQSDDWQWASASAKSEEEEKLA ELARQLQESAAKLHALRTEYFAQHEQGAAAGAADISTLDQKLRLVTSDFHQLILAFLQVY DDELGECCQRPGPDLHPCGPIIQATHQNLTSYSQLLQVVMAVADTSAKAVETVKKQQGEQ ICWGGSSSVMSLETEAMLSVQEWDRRVEGTNSGEMPPTPAVVTGSAGGEGGSLEDSPRGS GVPGTGQCLRTCYPRGNITMSEAPRAETFVFLDLEATGLPSVEPEIAELSLFAVHRSSLE NPEHDESGALVLPRVLDKLTLCMCPERPFTAKASEITGLSSEGLARCRKAGFDGAVVRTL QAFLSRQAGPICLVAHNGFDYDFPLLCAELRRLGARLPRDTVCLDTLPALRGLDRAHSHG TRARGRQGYSLGSLFHRYFRAEPSAAHSAEGDVHTLLLIFLHRAAELLAWADEQARGWAH IEPMYLPPDDPSLEA >gi568815575f:153404632_153608442|GENSCAN_predicted_CDS_4|2208_bp atgggcggggcgcggctcggagcgcgaaacatggcggggcaggacgctggctgcggccgt ggcggcgacgactactcagaggacgagggcgacagcagcgtgtccagggcggctgtggag gtgttcgggaagctgaaggacctaaactgccccttcctcgagggtctgtatatcacagag ccaaagacaattcaggaactgctgtgcagcccctcagagtaccgcttggagatcctagag tggatgtgtacccgtcaagggcaagccttcaccgtgcccaaggtgatgagcctcgactgt gaccacagactctgttgcggcaggagcccccacatgactgaatgcaggcctcacaagcct tttctgcagagggccggggtctggccctcactgcaggacaggttcagctcactgaaaggg gtcccaacagaggtgaagatccaagaaatgacgaagctgggccacgagctgatgctgtgt gcgccagatgaccaggagctcctcaagcatgtagataggccaggtcagtgctgtggggac gtggcagcccgctgcccccagatcaccttggagcacatgactgtggtcagcaccaccgtg caggcaggctgtgcctgcgcccagaagcagctacacttcatggaccagttgctcgatacc atccggagcctgaccattgggtgctccagttgctcgagcctgatggagcacttcgaggac accagggagaagaacgaggccttgctgggggagctcttctctagcccccacctgcagatg ctcctgaatccagagtgcgacccgtggcccctggacatgcagcccctcctcaacaagcag agtgatgactggcagtgggccagtgcctctgccaagtccgaggaggaggagaagctggcg gagcttgccaggcagctgcaggagagtgctgccaagttgcacgcgcttagaacggagtac tttgcacagcatgagcaaggggctgctgcgggcgcagccgacatcagcaccctagaccag aagctgcgtctggtcacttccgacttccaccagctaatcttggcttttctccaagtctac gacgacgagctgggcgagtgctgccagcgcccaggccctgacctccacccgtgcggcccc atcatccaggccacgcaccagaatctgacttcctacagccaactgctgcaagtggtcatg gcagttgctgacacctctgcgaaggccgtggagaccgtgaagaagcagcaaggcgagcag atctgctggggtggcagcagctccgtcatgagtctagaaaccgaagcaatgctgtcagtg caagaatgggacaggagagttgagggtaccaacagtggagagatgccaccaaccccggca gtggtgacgggctcagcgggcggggaaggtggcagcctagaggacagcccaaggggcagc ggggtgccagggacagggcagtgtttgaggacttgctatccccgtgggaacatcaccatg tccgaggcaccccgggccgagacctttgtcttcctggacctggaagccactgggctcccc agtgtggagcccgagattgccgagctgtccctctttgctgtccaccgctcctccctggag aacccggagcacgacgagtctggtgccctagtattgccccgggtcctggacaagctcacg ctgtgcatgtgcccggagcgccccttcactgccaaggccagcgagatcaccggcctgagc agtgagggcctggcgcgatgccggaaggctggctttgatggcgccgtggtgcggacgctg caggccttcctgagccgccaggcagggcccatctgccttgtggcccacaatggctttgat tatgatttccccctgctgtgtgccgagctgcggcgcctgggtgcccgcctgccccgggac actgtctgcctggacacgctgccggccctgcggggcctggaccgcgcccacagccacggc acccgggcccggggccgccagggttacagcctcggcagcctcttccaccgctacttccgg gcagagccaagcgcagcccactcagccgagggcgacgtgcacaccctgctcctgatcttc ctgcaccgcgccgcagagctgctcgcctgggccgatgagcaggcccgtgggtgggcccac atcgagcccatgtacttgccgcctgatgaccccagcctggaggcctga >gi568815575f:153404632_153608442|GENSCAN_predicted_peptide_5|775_aa MENTKAGVGALQMRQGSGCHQLLQEHIPQRSQHGTTIAITIASSGPGGSWGKLHLVLAAY FKSAVPQCYPVPASAAAPRQDTGPQPSCVTHRTHKEMVASSLTPEAFADSLDGPCFWSVR GRVQIQDGDSCLMALAGPCPSSTASLLPSTQALPTINSFLKIASKPKSTLDRAVGKASSI LALKSRASAKRSVLLPILALWAGSCSGGAPPTPMGLATLQLLPSPPGAPDGQLQPIPGIG HPDKPEAGKLDQLRDQPTPKQGAQGTPTQSPSTGWKALPRPGLALRKESPPVTLEQEQGH NKGLVAEWAQPQATAAMRAGAGKPEALKLRPWQAGRDPQAQEGAAVTEEDQGQRTGGRED KGRGLKPRRPPKGTSHQPGLRIRRPQKDRSRGQGGGGSTSKTPGHGWKRPGSTHGHRHRH ADLGTTQQAMPSLPASCLLAQAVIACGNVKMKHVPALTHPGLTTLYLAENEIAKIPAHTF LGLPNLEWLDLSKNKLDPRGLHPHAFKNLMRLKRLNLVGNSLTTVPALPASLQELKLNDN LLQGLQGSSFRGLSQLLTLEELHLGTNLIEEVAEGALSHIHSLSVLVLSHNWLQEHWLAP RAWIHLPKLETLDLSYNRLVHVPRFLPRGLRRLTLHHDHIERIPGYAFAHMKPGLEFLHL SHNRLQADGIHSVSFLGLRASLAELLLDHNQVQAIPRGLLGLKGLQVLGLSHNRIRQVPL NSICDMRVAQDSNLTSTHLENNLIDRRRIPPTAFSCTRAYHSVVLQPQRRGEEGS >gi568815575f:153404632_153608442|GENSCAN_predicted_CDS_5|2328_bp atggagaacacgaaagcaggggtgggggccttgcaaatgagacaaggaagtggctgccac cagctcctgcaagagcacattccgcagcgcagtcagcatggtaccaccatcgccatcacc atcgcctccagtgggcctggaggatcctggggcaagcttcacttagtcttggcagcatac ttcaagtctgccgtgccccagtgctaccctgtcccagcttcggcagctgcccccaggcag gacactggcccacagccatcctgtgtcacccacaggacgcacaaggagatggtggcctca tccctgactccagaggcatttgctgactccctagatggtccctgtttctggtctgtacgt ggcagggtccaaattcaggatggagactcctgcctgatggcccttgccggcccctgcccc tcctccactgcttcccttctcccctccacccaagccttgcccacaattaactcatttctt aagatcgcttccaaacctaagtcaacgctggacagggctgtaggaaaagcttcctcaata ctggccctgaagagccgagccagcgccaagaggagtgtgctgctccccatcctggcactg tgggcggggagctgctcaggaggggccccaccaacccccatgggcttggctaccctgcag ctgctgcccagcccaccaggggcccccgacggtcagctgcagcccatccctggcatcggc cacccagacaagcctgaggctgggaagctggaccagttgcgggatcagcccaccccgaag cagggagctcaaggaacccccacccagtccccctccactggctggaaagcgcttcccagg ccagggctggccctgaggaaggagtcacccccagtgaccttggagcaggagcagggtcac aacaagggcctggtcgctgagtgggctcagccccaggccacagctgccatgagggctggg gcagggaagcccgaggccttgaagctgaggccctggcaggccggcagggaccctcaagct caagagggggcagcagtcaccgaggaggaccagggccagaggacaggaggccgggaagac aagggaaggggcctgaaacccaggaggccccccaaagggacctcccatcaacctgggctg aggatccggcgcccacagaaggaccgcagccgaggccagggtggcggcggcagcacctcc aagaccccaggccatgggtggaaaagaccaggaagcacacatgggcacaggcacaggcac gcagacctgggcaccacccagcaggccatgccctctctgccggcctcgtgcctcctggcc caggcagtcatcgcctgtggcaatgtcaagatgaagcatgtccctgccctgacccaccct ggtctgaccacactctacctggcagagaatgaaattgccaagatcccagcccacacgttc ctggggctgcccaacctggagtggctggatctcagcaagaacaagctggatccccgaggc ctgcacccccatgccttcaagaatctgatgcggctgaagcggctgaacctggttgggaac tcgctgaccacagtcccggccctacctgcctccctgcaggagctcaaactcaacgacaac ctcctgcagggcttgcaaggcagcagcttccgtgggctcagccagctgttgacgctggag gagctgcacctgggcaccaacctcatcgaggaggtggcggagggcgcactgagccacatc cacagcctcagcgtgctggtgctcagccacaactggcttcaggagcactggctggcaccc cgagcctggattcatctcccgaagctggagacccttgacctgtcctacaaccggctggtg cacgtgccccgcttcctgccgcggggcctgaggcgcctgacgctgcaccacgaccacatc gagcgcatccctggctacgcgttcgcgcacatgaagccaggcctagagttcctgcacctg tcccacaacaggctgcaggctgacggcatccacagcgtgtccttcctgggcctgcgcgcc tcgctggcggagctgctcctggatcataaccaggtgcaggccatcccacgcggcctcctg ggcctcaagggactgcaggtgctgggcctgagccacaacaggatcagacaagtgcccttg aattccatctgtgacatgcgcgtggctcaggactccaaccttacctccacacacctggag aacaacctcattgaccggcgccgcatcccgcccactgccttctcctgcacccgagcctat cacagcgtggtcctccagccccagcggcggggggaggagggctcctag >gi568815575f:153404632_153608442|GENSCAN_predicted_peptide_6|451_aa MPLALLPPLSQALVLLSPVPRGLMDAPALKQASGRLRGVGACRLADEETFHRLIKPSPVR DGEGGPSAMWPLWRLVSLLALSQALPFEQRGFWDFTLDDGPFMMNDEEASGADTSGVLDP DSVTPTYSAMCPFGCHCHLRVVQCSDLGLSLSDGERGMQGGSGLKSVPKEISPDTTLLDL QNNDISELRKDDFKGLQHLYALVLVNNKISKIHEKAFSPLRKLQKLYISKNHLVEIPPNL PSSLVELRIHDNRIRKVPKGVFSGLRNMNCIEMGGNPLENSGFEPGAFDGLKLNYLRISE AKLTGIPKDLPETLNELHLDHNKIQAIELEDLLRYSKLYRLGLGHNQIRMIENGSLSFLP TLRELHLDNNKLARVPSGLPDLKLLQVVYLHSNNITKVGVNDFCPMGFGVKRAYYNGISL FNNPVPYWEVQPATFRCVTDRLAIQFGNYKK >gi568815575f:153404632_153608442|GENSCAN_predicted_CDS_6|1356_bp atgcccctggctctgctgcctccgctgtcccaagccctggtcctgctgtccccagtgccg cgaggcctcatggacgcccccgccctaaagcaggcttctggcagactcagaggcgtgggg gcctgcagacttgctgatgaggagacatttcatcgcctgattaagccaagtcctgtccgg gatggggaggggggtccatccgccatgtggcccctgtggcgcctcgtgtctctgctggcc ctgagccaggccctgccctttgagcagagaggcttctgggacttcaccctggacgatggg ccattcatgatgaacgatgaggaagcttcgggcgctgacacctcgggcgtcctggacccg gactctgtcacacccacctacagcgccatgtgtcctttcggctgccactgccacctgcgg gtggttcagtgctccgacctgggtttgtccctgagtgatggggagcggggcatgcaggga ggctcaggtctgaagtctgtgcccaaagagatctcccctgacaccacgctgctggacctg cagaacaacgacatctccgagctccgcaaggatgacttcaagggtctccagcacctctac gccctcgtcctggtgaacaacaagatctccaagatccatgagaaggccttcagcccactg cggaagctgcagaagctctacatctccaagaaccacctggtggagatcccgcccaaccta cccagctccctggtggagctccgcatccacgacaaccgcatccgcaaggtgcccaaggga gtgttcagcgggctccggaacatgaactgcatcgagatgggcgggaacccactggagaac agtggctttgaacctggagccttcgatggcctgaagctcaactacctgcgcatctcagag gccaagctgactggcatccccaaagacctccctgagaccctgaatgaactccacctagac cacaacaaaatccaggccatcgaactggaggacctgcttcgctactccaagctgtacagg ctgggcctaggccacaaccagatcaggatgatcgagaacgggagcctgagcttcctgccc accctccgggagctccacttggacaacaacaagttggccagggtgccctcagggctccca gacctcaagctcctccaggtggtctatctgcactccaacaacatcaccaaagtgggtgtc aacgacttctgtcccatgggcttcggggtgaagcgggcctactacaacggcatcagcctc ttcaacaaccccgtgccctactgggaggtgcagccggccactttccgctgcgtcactgac cgcctggccatccagtttggcaactacaaaaagtag >gi568815575f:153404632_153608442|GENSCAN_predicted_peptide_7|188_aa MRVRARCGAQGAGCGALPAGLGPRGDARPQCRKDTQARRGAQRKRRLPEILTVGYHGNTT VRQNTETAHRWAPETDTQGHIHPRCEWRQTHRQTPVQTPVHTHKHAFPPTWKETPVPAGG PPGFPAPGAVEASDPRAGLCEHSEQRAENARWGTALQGGVVSLRGPARIVGPACQGLARC PFDKPGLV >gi568815575f:153404632_153608442|GENSCAN_predicted_CDS_7|567_bp atgcgcgtgcgcgcgcgatgcggggcgcagggtgcagggtgcggggcgctgcccgccggc ctggggccccggggtgacgcgcggccgcagtgccgcaaggacacgcaggcccggcgcggg gctcagaggaagcggcgcttaccggagatcctcaccgtcggttatcacgggaacaccacc gttaggcagaacacagagactgcacacaggtgggcaccagagacagacacgcaggggcac atacatccccggtgtgagtggagacagacacacagacaaaccccagtgcagacacctgtc cacacccacaagcacgcgttcccacctacctggaaagagacccccgtgcctgccggtgga cccccaggcttcccagctccaggggctgtggaggcctctgaccccagggccggcctttgc gagcattctgagcagcgagcagagaatgctcgctggggcacagccctccaaggaggggtc gtgtccctcaggggtcctgcccgcatcgtgggcccagcctgccaaggtctggctcgctgc ccctttgacaagcccggactcgtctaa >gi568815575f:153404632_153608442|GENSCAN_predicted_peptide_8|51_aa MPSLPSGLLGRPGLAGGAWGARRGRISQLSYEALTGVESYSVGLSVTGSFY >gi568815575f:153404632_153608442|GENSCAN_predicted_CDS_8|156_bp atgccgtccctgccttcggggctgctgggcaggcccgggctggcaggaggggcgtgggga gcacgtaggggcaggatttcacagctctcctatgaggccctcacaggagtggaatcctac agtgttggcctctctgtgactggctccttctactga >gi568815575f:153404632_153608442|GENSCAN_predicted_peptide_9|1671_aa MGDMANSSIEFHPKPQQQRDVPQAGGFGCTLAELRTLMELRGAEALQKIEEAYGDVSGLC RRLKTSPTEGGTQAAGRSEDGGGGLRASPAVHPDPGAQTCLPLSPPQTQGPEPRGQLALL CPKPHANPLTRPPICVHANRPAPAPMPWTWTALGPAGEACAQSQRGRSCVPPPRQWVLQW GSRRDPEPTGRLRRNEQGYSCKGRNFWKVSLRSLLPAATGTLETADSSALAVLMPQSLGP CPPSLQAVPAPTGLADNTNDLEKRRQIYGQNFIPPKQPKTFLQLVWEALQDVTLIILEVA AIVSLGLSFYAPPGEESEACGNVSGGAEDEGEAEAGWIEGAAILLSVICVVLVTAFNDWS KEKQFRGLQSRIEQEQKFTVIRNGQLLQVPVAALVVGDIAQVKYGDLLPADGVLIQANDL KIDESSLTGESDHVRKSADKDPMLLSGTHVMEGSGRMVVTAVGVNSQTGIIFTLLGAGGE EEEKKDKKGSAAVPPVPGAGGGSFHAAHSFFAGRCHWGLRESQPKWESELEGFLEEENPE GALEGEQQLASWVEGKQQDGAMESSQTKAKKQDGAVAMEMQPLKSAEGGEMEEREKKKAN APKKEKSVLQGKLTKLAVQIGKAGLVMSAITVIILVLYFVIETFVVEGRTWLAECTPVYV QYFVKFFIIGVTVLVVAVPEGLPLAVTISLAYSVKKMMKDNNLVRHLDACETMGNATAIC SDKTGTLTTNRMTVVQSYLGDTHYKEIPAPSALTPKILDLLVHAISINSAYTTKILPPEK EGALPRQVGNKTECALLGFVLDLKRDFQPVREQIPEDKLYKVYTFNSVRKSMSTVIRMPD GGFRLFSKGASEILLKKCTNILNSNGELRGFRPRDRDDMVRKIIEPMACDGLRTICIAYR DFSAGQEPDWDNENEVVGDLTCIAVVGIEDPVRPEVATTFSVSCPGNCALCPSLSGLVPE AIRKCQRAGITVRMVTGDNINTARAIAAKCGIIQPGEDFLCLEGKEFNRRIRNEKGEIEQ ERLDKVWPKLRVLARSSPTDKHTLVKGIIDSTTGEQRQVVAVTGDGTNDGPALKKADVGF AMGIAGTDVAKEASDIILTDDNFTSIVKAVMWGRNVYDSISKFLQFQLTVNVVAVIVAFT GACITQDSPLKAVQMLWVNLIMDTFASLALATEPPTESLLLRKPYGRDKPLISRTMMKNI LGHAVYQLAIIFTLLFVGELFFDIDSGRNAPLHSPPSEHYTIIFNTFVMMQLFNEINARK IHGERNVFDGIFSNPIFCTIVLGTFGIQIVIVQFGGKPFSCSPLSTEQWLWCLFVGVGEL VWGQTALQTASVMGPLALTLLQSLGVLTLGSTLRAQACDTPGPRDRTQGKVIATIPTSQL KCLKEAGHGPGKDEMTDEELAEGEEEIDHAERELRRGQILWFRGLNRIQTQMEVVSTFKR SGSVQGAVRRRSSVLSQLHDVTNLSTPTHAILSAANPTSAAGTLHCKLQHLKANQGHVSP CTWRAPPLQLAGSGIAHGTPHLHPFRFGSVWPGMGLVSLQRHPPCIRVVKAFRSSLYEGL EKPESKTSIHNFMATPEFLINDYTHNIPLIDDTDVDENEERLRAPPPPSPNQNNNAIDSG IYLTTHVTKLLIRTEEDEDREEEKEEEEDKKGEEKVLRPKEEGEEVESAKS >gi568815575f:153404632_153608442|GENSCAN_predicted_CDS_9|5016_bp atgggcgacatggccaatagttccatcgagttccaccccaagccccagcagcagcgggat gtcccccaggctggaggctttgggtgcacgctggcggagctgcgcaccctcatggagctg cgaggggccgaggcgctgcagaagatcgaggaggcctacggggatgtcagcgggctctgc cggaggctgaagacctcacccacagagggtggcactcaggctgcagggaggagtgaagat ggaggaggggggctgcgggccagcccggccgtgcacccggaccctggtgctcagacctgc ctgcccctcagtcctcctcagacacaagggcctgagccacgggggcagctggctttgttg tgccccaagccccatgccaatcccctgacaaggccgcccatctgtgtccatgccaaccgg ccagcccctgcccccatgccctggacctggactgccctcgggcctgctggggaagcatgt gctcagagccagaggggccgcagctgtgttcctcctccccgtcaatgggtgctgcagtgg ggcagccgtcgggaccccgagcccacaggaaggctaaggagaaacgagcaaggctacagc tgcaagggcagaaacttctggaaagtaagcctccgctccctgttgccagcagccacaggc accctggagactgctgattcctcggcactggccgtgctcatgccccaatccctcgggccc tgcccaccctctttgcaggctgtccctgcccccacaggcctggcggacaacaccaatgac ctggagaagcgcaggcagatctacgggcagaacttcatccccccaaagcaacccaagacc ttcctgcagctggtgtgggaggccctgcaggacgtgaccctcatcatcctggaggtggct gccatcgtctctctgggcctctcgttctatgcgccgccaggagaggagagtgaagcctgt gggaatgtgtcgggaggcgcagaagatgagggcgaggccgaagctggctggatcgagggg gctgccatcctgctgtccgtcatctgtgtggtgctggtcacggccttcaatgactggagc aaggagaagcagttccgaggcctgcagagccgaattgagcaggagcagaagttcacggtc atccggaacgggcagctcctccaggtccccgtggctgcgctggtggtgggggacattgcc caggtcaagtacggcgacctgctgccagccgacggcgtgctcatccaggccaatgacctc aagatcgacgagagctccctgacgggcgagtctgaccacgtgcgcaagtcagctgacaaa gatcccatgctgctctcaggcactcatgtcatggaaggttctggaagaatggtggtgacc gccgttggcgtgaattcccagacaggcatcatcttcacgctgcttggagctggcggagag gaggaagagaagaaagataagaaaggtagcgcagcagtgccgccagtccctggtgctggt ggcggctccttccacgctgcccattctttctttgcgggccggtgccactgggggctcagg gagagccagcccaagtgggagtcagagctagagggcttcctggaggaagagaaccctgag ggggccctggagggagagcagcagctggccagctgggtagagggcaagcagcaggatggg gccatggagagtagccagaccaaagctaagaagcaggatggtgcagtggccatggagatg cagcccctgaagagcgcggagggtggggagatggaggagcgggagaagaagaaagccaac gcacccaaaaaggagaagtctgtccttcaggggaagctcacaaagctagccgtgcagatc gggaaagcagggctggtgatgtctgccatcaccgtcatcatcctggtcctctactttgtg attgagacgtttgtcgtggaaggccggacatggctggcagagtgcacgccggtctatgta caatacttcgtgaagttcttcatcattggtgtcactgtgctggtcgtggctgtcccagag ggcctgcctcttgctgtcaccatctccttagcttactctgtcaagaaaatgatgaaagac aacaacctggtgcgccacctggatgcctgcgagaccatgggcaacgccacagccatctgc tccgacaagacgggcacgctcaccaccaaccgtatgaccgtggtccagtcctacctagga gacacccactacaaagagattccggcccccagcgccctgacccctaagatcctcgacctc ctggtccatgccatctccatcaacagtgcctataccaccaaaatactacctcctgagaag gaaggcgccctcccacgccaggtgggcaataagacggagtgcgccctgctgggcttcgtc ttggacctgaagcgggacttccagcccgtgcgcgagcagatcccggaagacaagctttac aaagtgtacaccttcaactcggtccgcaagtccatgagcacagtcatccgcatgcccgac ggtggcttccgcctcttcagcaagggggcctcagagatcctcttgaaaaagtgcaccaac atcttgaacagcaatggcgaactccggggctttcggcctcgggaccgggacgacatggtg aggaagatcatcgagccgatggcttgcgatggcctccgcaccatctgcatcgcctaccgg gacttctctgcaggccaggagcccgactgggacaacgagaatgaggtcgtgggtgacctc acctgcatagctgtcgtgggcattgaggaccctgtgcggcccgaggtagccaccaccttc tctgtgagctgccctggtaactgtgccctctgcccgagcttgtccggactggtccctgaa gctatccgaaaatgccagcgtgctggcatcacagtccgcatggtgactggggacaacatc aacacggcccgggccatcgcagccaaatgcggcatcatccagcccggggaggacttcctg tgcctagaagggaaggagttcaaccggcggatccgcaatgagaaaggcgagatagaacag gagcggctggacaaggtgtggcccaagctgagggtgctggcccggtcgtctcccaccgac aagcacacactggtcaaagggattatcgacagcaccactggtgagcagcggcaggtggtg gctgtgacaggggatggcaccaacgatgggccggccctcaagaaggcggacgtgggcttc gccatgggcatcgcagggaccgacgtggccaaggaggcctccgacatcatcctgaccgat gacaacttcaccagcatcgtcaaggcagtcatgtggggccgtaacgtctatgacagcatc tccaagttcctgcagtttcaactgacggtcaatgtggtggctgtgatcgtggccttcaca ggtgcctgcattactcaggactctcctctcaaagccgtgcagatgttgtgggtgaacttg atcatggacacatttgcctctctggccctggcgacggagccacccacagagtcgctgctg ctgcggaagccgtacggccgcgacaagcccctcatctcccgcaccatgatgaagaacatt ctgggccacgccgtgtaccagctcgccatcatcttcaccctgctgtttgtcggggagctc ttcttcgacatcgacagcgggaggaatgcgcccctgcactcgccaccctcagagcactac accatcatcttcaacacgttcgtcatgatgcagctctttaacgagatcaacgcccgcaag atccacggcgagaggaacgtgttcgacggcatcttcagcaaccccatcttctgcaccatc gttttgggcactttcgggattcagattgtcatcgtccagtttggcgggaagcccttcagc tgctccccactatccacagaacagtggctctggtgcctgtttgttggtgttggggagctg gtctggggacagactgccctgcagacagcttctgtgatgggcccacttgccctaaccctg cttcagagccttggggtcctcaccctaggaagcacactgagggctcaggcctgtgacaca ccaggacccagggacaggacgcaagggaaggtcattgccaccatccccaccagccagctc aagtgcctgaaggaagccgggcacgggcccgggaaggacgagatgaccgacgaggagctg gccgaaggcgaggaagagatcgaccatgccgagcgggagctccgcaggggccagatcctc tggttccggggcctgaaccggattcagacgcagatggaggtagtgagtaccttcaagaga agcggttcagttcagggtgctgtgcgccggcggtcttcggtcctcagccagcttcatgac gtaaccaatctttctacccctactcacgcaattctctctgctgccaatcctaccagtgct gctgggaccctgcactgcaaactccagcacttgaaggctaaccaggggcacgtctcccca tgcacatggcgggcacctccactccagctggccggcagtggcatcgcacatggcactcca cacctgcaccccttccgctttggctcagtgtggccagggatgggccttgtcagtctgcag cgtcacccgccctgtatccgggtggtgaaagcgttccgtagctcgctctatgaaggcctg gagaaaccagaatccaagacctccattcacaacttcatggccacgcccgagtttctgatc aatgactacacccacaacatcccgctcattgacgacacggacgtggacgagaacgaggag cgcctccgggcccccccgcccccgtcccccaaccagaacaacaacgccatagacagcggc atctacctgaccacgcatgtcaccaagctcctcatccggactgaggaagacgaggacagg gaggaggaaaaggaggaagaggaggacaaaaagggggaggagaaggttcttcgtccaaag gaggaaggagaagaagtagaaagtgcgaagagctga >gi568815575f:153404632_153608442|GENSCAN_predicted_peptide_10|164_aa MPGPVPTVSQWLSPRPKHLPAKRDTCKCGRQGSPMASLCGPRGFLRRVAKVGPLFIPFAH LGETEAWPPGAHTARIRSPQDLVLSHKSMTSWLKLGRWTQRLGWEECFGGGTSDVTADIK VVYRGLLGICQSKQWESIQVLRGQGLNCGCVPAVILPPPAPPGF >gi568815575f:153404632_153608442|GENSCAN_predicted_CDS_10|495_bp atgcctggcccggtcccaactgtctcccagtggctgagcccaaggcccaaacacctccct gcaaaacgtgacacctgcaagtgtggaaggcaagggtcccccatggcatcactgtgtgga cctaggggcttcctcaggagggtggcaaaagtcgggcccctctttatcccctttgctcat ttgggggaaactgaggcttggccaccaggtgcccacactgccaggatacgctcccctcag gacctggtgctgagccacaaatccatgacttcatggcttaagcttggccgctggacacaa agactagggtgggaagagtgctttggtggtggcacctctgatgtcacagcagacatcaag gtggtctaccgaggactcctcggcatttgccaaagcaaacagtgggagtccatccaggtg ctgaggggacagggtctgaactgtggctgcgtccctgccgtcattctcccaccaccagct cctccgggcttctga >gi568815575f:153404632_153608442|GENSCAN_predicted_peptide_11|238_aa MEAPEGGGGGPAARGPEGQPAPEARVHFRVARFIMEAGVKLGMRSIPIATACTIYHKFFC ETNLDAYDPYLIAMSSIYLAGKVEEQHLRTRDIINVSNRYFNPSGEPLELDSRFWELRDS IVQCELLMLRVLRFQVSFQHPHKNWLNRHSWQRTPVAVTAWALLRDSYHGALCLRFQAQH IAVAVLYLALQVYGVEVPAEVEAEKPWWQVFNDDLTKPIIDNIVSDLIQIYTMDTEIP >gi568815575f:153404632_153608442|GENSCAN_predicted_CDS_11|717_bp atggaagccccggagggcggcggaggggggcctgcagcgcggggcccggaggggcagccg gcgcccgaagccagggtgcacttccgagtggcgaggttcatcatggaggcaggtgtcaag ctagggatgcggtccattcccattgccactgcttgcaccatttaccataagttcttttgc gagaccaacctggacgcctatgacccttacctgattgccatgtcttcaatttacttggcc ggcaaagtggaagagcagcacctgcggactcgtgacatcatcaatgtgtccaacaggtac tttaacccaagcggtgagcccctggaattggactcccgcttctgggaactccgggacagc atcgtgcagtgtgagcttctcatgctgagagttctgcgcttccaggtctccttccagcat ccacacaagaactggctgaaccgccacagctggcagcggacccctgttgccgtcaccgcc tgggccctgctgcgggacagctaccatggggcgctgtgcctccgcttccaggcccagcac atcgccgtggcggtgctctacctggccctgcaggtctacggagttgaggtgcccgccgag gtcgaggctgagaagccgtggtggcaggtgtttaatgacgaccttaccaagccaatcatt gataatattgtgtctgatctcattcagatttataccatggacacagagatcccctaa >gi568815575f:153404632_153608442|GENSCAN_predicted_peptide_12|208_aa MAGGLAGMGGIQNKKETMQSLNDRLASYLGRVRSLETENQKLESKIREHLEKKGSQVRDW SHYFKTIEDLRAQISANTVDNARTVLQIDSACLAADDFRVKYETELAMCQSVENNIHGLC KVIDDTNVTRLQLETEIEALKEELLFMKKNHEEEVKGLQAQIASSELTVEVEAPNLRTLP KSWQTSGPNMTSWLGRKERSWTVLVSAD >gi568815575f:153404632_153608442|GENSCAN_predicted_CDS_12|627_bp atggccgggggtctggcaggaatgggaggcatccagaacaagaaggagaccatgcaaagc ctgaacgaccgcctggcctcctacctgggcagagtgaggagcctggagactgagaaccag aagctggagagcaaaatccgggagcatctggagaagaagggatcccaggtcagagactgg agccattacttcaagaccatcgaggacctgagggctcagatctctgcaaatactgtggac aatgcccgcaccgttctgcagatcgacagtgcctgtcttgctgctgatgactttagagtc aagtatgagacagagctggccatgtgccagtctgtggagaacaacatccatgggctttgc aaggtcattgatgacaccaatgtcactcggctgcagctggagacagagatcgaggccctc aaggaggagctgctcttcatgaagaagaaccacgaagaggaagtaaaaggcctacaagcc cagattgccagctctgagttgaccgtggaggtagaggccccaaatctcaggaccttgcca aaatcatggcagacatccgggcccaatatgacgagctggcttggaagaaaagagaggagc tggacagtactggtctcagcagattga >gi568815575f:153404632_153608442|GENSCAN_predicted_peptide_13|136_aa MRPLDVAELSEPEEVEVLELEGGFEQFLLPVIHEMPEDIASLTSTGGRTCGSGARCGRWT ICSSRSKPRWRLRRSILNRLHNPGDAVEGRVTKRCQKAEEKAKEMVKMAEMLVELAQRIE TNESRARSVVYSQWIL >gi568815575f:153404632_153608442|GENSCAN_predicted_CDS_13|408_bp atgcggcccctggatgtcgccgagctgtcagaaccggaggaagtggaggtgctggagctg gaggggggttttgagcagtttctgctcccggtcatccatgagatgccagaggacatcgcg tcgctgacaagcactggcgggcgtacctgtggaagcggagcacgctgtgggagatggaca atatgctcttccagatcaaaacccaggtggaggcttcggaggagcatcctcaaccgcctc cacaacccgggcgacgcagtggagggcagggtgaccaagaggtgccagaaggccgaggag aaggccaaggaaatggtgaagatggcagagatgctggtggagctggcccagaggatagag acgaacgagtcgagagcacggtcggtggtttacagccaatggattctg