GENSCAN 1.0 Date run: 5-Nov-116 Time: 17:01:25 Sequence gi568815580r:50127384_50366404 : 239021 bp : 43.37% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1513 1640 128 1 2 -23 127 75 0.310 0.62 1.02 Intr + 3119 3203 85 0 1 52 68 106 0.363 3.88 1.03 Intr + 5881 6103 223 1 1 104 100 -3 0.075 0.63 1.04 Term + 20433 21263 831 2 0 11 42 353 0.076 16.02 1.05 PlyA + 21272 21277 6 1.05 2.00 Prom + 21620 21659 40 -8.96 2.01 Init + 21749 22803 1055 1 2 82 53 277 0.387 17.28 2.02 Intr + 32551 32707 157 1 1 104 28 125 0.361 8.11 2.03 Intr + 42464 42500 37 1 1 88 77 57 0.283 2.44 2.04 Term + 53115 53278 164 1 2 90 33 55 0.087 -1.70 2.05 PlyA + 54493 54498 6 1.05 3.00 Prom + 63497 63536 40 -5.36 3.01 Init + 67409 67909 501 1 0 107 85 152 0.567 9.83 3.02 Term + 88868 88894 27 1 0 125 41 48 0.545 1.87 3.03 PlyA + 89070 89075 6 1.05 4.43 PlyA - 89465 89460 6 1.05 4.42 Term - 90320 90177 144 2 0 78 47 106 0.331 3.41 4.41 Intr - 100226 100085 142 1 1 59 37 209 0.248 13.26 4.40 Intr - 111322 111220 103 2 1 78 86 138 0.999 11.83 4.39 Intr - 115733 115517 217 2 1 58 56 149 0.816 6.78 4.38 Intr - 123593 123375 219 2 0 58 52 181 0.930 10.00 4.37 Intr - 124401 124098 304 2 1 87 40 290 0.999 20.69 4.36 Intr - 133854 133681 174 2 0 95 95 86 0.993 9.05 4.35 Intr - 134836 134607 230 1 2 87 69 191 0.995 13.77 4.34 Intr - 139032 138953 80 1 2 11 91 73 0.376 -0.83 4.33 Intr - 140280 140089 192 1 0 60 22 133 0.346 3.36 4.32 Intr - 140928 140796 133 0 1 37 28 138 0.656 2.92 4.31 Intr - 145572 145441 132 0 0 96 49 87 0.717 6.44 4.30 Intr - 146088 145951 138 0 0 71 81 69 0.741 5.16 4.29 Intr - 146480 146181 300 2 0 72 94 166 0.977 12.53 4.28 Intr - 146970 146803 168 0 0 69 64 358 0.942 31.64 4.27 Intr - 147662 147594 69 2 0 111 75 88 0.997 9.18 4.26 Intr - 147929 147746 184 1 1 71 93 14 0.554 -0.01 4.25 Intr - 148061 147987 75 1 0 64 74 72 0.443 2.03 4.24 Intr - 148345 148217 129 0 0 82 100 147 0.551 15.11 4.23 Intr - 148523 148452 72 1 0 88 96 29 0.448 2.22 4.22 Intr - 148734 148566 169 1 1 4 92 68 0.429 -2.20 4.21 Intr - 149035 148868 168 2 0 131 48 108 0.439 11.02 4.20 Intr - 149361 149279 83 2 2 102 105 42 0.990 6.58 4.19 Intr - 149615 149449 167 2 2 111 92 99 0.977 11.36 4.18 Intr - 149821 149707 115 0 1 52 75 112 0.961 6.75 4.17 Intr - 152634 152500 135 2 0 117 69 188 0.936 19.48 4.16 Intr - 154633 154543 91 2 1 82 59 37 0.010 -0.75 4.15 Intr - 155356 155224 133 1 1 70 -49 247 0.011 9.42 4.14 Intr - 155623 155471 153 1 0 77 66 219 0.501 18.87 4.13 Intr - 155978 155882 97 1 1 100 78 65 0.991 6.71 4.12 Intr - 156181 156132 50 1 2 63 110 64 0.998 3.58 4.11 Intr - 156432 156322 111 0 0 104 53 190 0.923 17.68 4.10 Intr - 156718 156511 208 0 1 88 83 378 0.999 36.38 4.09 Intr - 157179 156995 185 0 2 117 89 194 0.999 20.99 4.08 Intr - 157456 157337 120 1 0 41 65 210 0.924 14.69 4.07 Intr - 157864 157619 246 1 0 66 93 221 0.955 18.06 4.06 Intr - 157968 157942 27 0 0 81 109 22 0.788 1.91 4.05 Intr - 158545 158366 180 1 0 83 88 319 0.999 31.46 4.04 Intr - 158874 158639 236 1 2 115 109 267 0.999 28.91 4.03 Intr - 159256 159156 101 0 2 90 78 182 0.654 17.15 4.02 Intr - 159475 159357 119 1 2 46 50 231 0.838 14.36 4.01 Init - 160716 160642 75 0 0 104 47 37 0.176 2.29 4.00 Prom - 170125 170086 40 -3.06 5.03 PlyA - 171018 171013 6 1.05 5.02 Term - 197857 197314 544 0 1 -33 37 930 0.974 69.54 5.01 Init - 220194 217235 2960 0 2 44 53 991 0.604 82.37 5.00 Prom - 220287 220248 40 -7.66 6.02 PlyA - 220456 220451 6 1.05 6.01 Sngl - 221700 220684 1017 0 0 88 43 574 0.997 49.94 6.00 Prom - 223389 223350 40 -5.86 7.02 PlyA - 223482 223477 6 1.05 7.01 Sngl - 225792 225589 204 0 0 52 54 280 0.946 16.19 7.00 Prom - 237136 237097 40 -1.26 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 155356 155210 147 1 0 70 52 286 0.972 21.10 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815580r:50127384_50366404|GENSCAN_predicted_peptide_1|422_aa XIPNQRLPAGKAGFLLTNPQRITFTKHTQGLIKITEEPPAKSHGRSSWALAKAPSKGDKK NENEQYPPYLNCALLKLEKVKISTNKITTKKILHPKIREEFMRVLAALNEANVWALMLFQ SIKRTFNRKSESSYGADVRNMTQKEKELNKQLPCQILGSCEVNRRQKLLPALADVKNGAV EIDRPLARLIKKKREKNQIDTIKNDKGDITTDPTEIQTTIREYYKHLYANKLENLEEMDT FLDTYTLPRLNQEEVESLNRPITEAEIVAIINSLPTKKSPGPDGFTAEFYQRYKEELVPF LLKLFQSIEKEGILPNSFYEASIILIPKPGRDTSKKENFRPISLMNIDAKILNKILANRI QQYIKKLIRHDQVGFIPGMQGWFNICKSINVIQHINRTKDKNHMIISIDAEKVFDKIQQP SC >gi568815580r:50127384_50366404|GENSCAN_predicted_CDS_1|1269_bp nnaatacccaatcagcggctgccggctggaaaagcaggctttttattgacaaacccgcag aggatcaccttcacaaaacacacccagggcctaattaaaatcacagaagaaccacctgcc aaatctcacggaaggtcttcctgggcattggcaaaggctccttcaaaaggagacaagaag aatgagaacgagcaatatccaccctacctgaactgtgctctcctcaaattggaaaaggta aaaataagcacaaacaaaataaccacaaagaagatactacatcccaagatcagagaggag ttcatgagagtgcttgctgccttaaacgaggctaatgtttgggccttaatgctattccaa agtattaaaagaaccttcaatagaaagagtgaaagctcatatggtgcagacgtcagaaac atgacccagaaggaaaaggagctcaataagcaactaccttgccaaatcctgggatcctgt gaggtaaacaggaggcagaaactgttaccagcattagcggatgtgaaaaatggggcagtg gaaattgatagaccactagcaagactaataaagaaaaaaagagagaagaatcaaatagac acaataaaaaatgataaaggggatatcaccaccgatcccacagaaatacaaactaccatc agagaatactacaaacacctctatgcaaataaactagaaaatctagaagaaatggataca ttcctcgacacatacactctcccaagactaaaccaggaagaagttgaatctctgaataga ccaataacagaagctgaaattgtggcaataatcaatagcttaccaaccaaaaagagtcca ggaccagatggattcacagccgaattctaccagaggtacaaggaggaactggtaccattc cttctgaaactattccaatcaatagaaaaagagggaatcctccctaactcattttatgag gccagcatcatcctgataccaaagcctggcagagacacaagcaaaaaagagaattttaga ccaatatccttgatgaacattgatgcaaaaatcctcaataaaatactggcaaaccgaatc cagcagtacattaaaaagcttatccgccatgatcaagtgggcttcatccctgggatgcaa ggctggttcaatatatgcaaatcaataaatgtaattcagcatataaacagaaccaaagac aaaaatcacatgattatctcaatagatgcagaaaaggtctttgacaaaattcaacaacct tcatgctaa >gi568815580r:50127384_50366404|GENSCAN_predicted_peptide_2|470_aa MKDLFKENYKPLLNVIKEDTNKWKNIPCSWIGRINIVKMAILPKVIYRFNAIPIKLPMTF FTELEKTTLKFIWNQKRAHIAKSILSQKNKAGGIMLPDFKLYYKATVTKTAWYWYQKRDI DQWNRTEPSEIMPHIYNHLIFDKPEKNKQWGKDSLFNKWCWENWLAICRKLKLDPFLTPY TKINSRWIKDLHVRPKTIKTLEENLGITIQDTGTGKDFMSKTPKAMARAKIDKWDLIQLK CFCTAKETTIRVNRQPTKWEKIFTTYSSDKGLISRIYNELKQIYKKKTNNPIKKWAKDMN RHFSKEDICAAKKHMKKCSPSLAIREMQIKTTMRYHLTPVRMAIIKKSGNNRTGTGTQTL HMNNVATLDPGESQVGQYSRRRDGMARGANGSPLSMGPHANTHNNTDVRIKSHDEQVLLP TYPLPEFSFYIVVRYTKFSIVIFLNVQFSGIKYIHIVVQPSQPPVSKALF >gi568815580r:50127384_50366404|GENSCAN_predicted_CDS_2|1413_bp atgaaggacctcttcaaggagaactacaaaccactgctcaatgtaatcaaagaggataca aacaaatggaagaacattccatgctcatggataggaagaatcaatatcgtgaaaatggcc atactgcccaaggtaatttacagattcaatgccatccccatcaagctaccaatgactttc ttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcccacatc gccaagtcaatcctaagccaaaagaacaaagctggaggcatcatgctacctgacttcaaa ctatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaagagagatata gatcaatggaacagaacagagccctcagaaataatgccacatatctacaaccatctgatc tttgacaaacctgagaaaaacaagcaatggggaaaggattccctatttaataaatggtgc tgggaaaactggctagccatatgtagaaagctgaaactggatcccttccttacaccttat acaaaaatcaattcaagatggattaaagacttacatgttagacctaaaaccataaaaacc ctagaagaaaacctaggcattaccattcaggacacaggcacgggcaaggacttcatgtct aaaacaccaaaagcaatggcaagagccaaaattgacaaatgggatctaattcaactaaag tgcttctgcacagcaaaagaaactaccatcagagtgaacaggcaacctacaaaatgggag aaaattttcacaacctactcatctgacaaagggctaatatccagaatctacaatgaactc aaacaaatttacaagaaaaaaacaaacaaccccatcaaaaagtgggcgaaggacatgaac agacacttctcaaaagaagacatttgtgcagccaaaaaacacatgaaaaaatgctcacca tcactggccatcagagaaatgcaaatcaaaaccacaatgagataccatctcacaccagtt agaatggcaatcattaaaaagtcaggaaacaacagaacaggcacaggcacacagacgctc cacatgaacaacgtagcaacactggatcctggagagtcacaagttggacagtactcacgc agacgtgatggcatggccaggggtgcaaatggctcccctctgtccatgggaccacatgct aacacgcacaacaatacagatgtgaggatcaagtcccatgatgagcaagtcctccttccc acctacccgctccccgagtttagtttttacattgtggtaagatacacaaaatttagcatt gtcatctttttaaatgtacagttcagtggcatcaagtacattcacattgttgtacaaccc tcgcaaccacccgtctccaaagctttgttttaa >gi568815580r:50127384_50366404|GENSCAN_predicted_peptide_3|175_aa MARAGRGSGPGSWLPRGSQSAAGPPGAMFPGWPGVSRSSRSSPTCPAGFPQAPRPARSRR RDLYSRRGGAATAGQELRAPGEEAAPHHSPPRCGEEARAGEEGGPARVRADGPCAAAPLS PAGARPDAPSRAPAATWASGPGAGRSERSSGTGAGRPLLDAARAQAQAIDSPPNL >gi568815580r:50127384_50366404|GENSCAN_predicted_CDS_3|528_bp atggcccgggccgggcggggctcgggccccggctcctggctgccccgcggctctcagtcc gcggctggcccgcctggcgccatgttcccgggctggcctggagtttctcgatcttctcgc tcttctccgacctgccccgccggcttcccgcaggcgccgcggccggctcgctcccggcgg cgcgacctttactcccgccgcggcggcgcagctacggccggacaggagttgcgagcgccg ggggaggaggccgcgccgcaccactcccctcccaggtgtggggaggaggcgagggccggc gaggagggaggacccgctcgcgtcagagcggacggcccgtgcgccgccgcgcctctgagc cctgccggtgcccggcccgacgcgccctcccgcgcccccgctgccacctgggcctccggg ccgggggcggggcgctcagagaggagctccgggaccggcgcgggccgccccctgctggac gccgcgcgcgcgcaggcacaggccatcgacagtccacccaacctgtaa >gi568815580r:50127384_50366404|GENSCAN_predicted_peptide_4|2057_aa MAAPVKNPQLGRRHMSPRNVRNSTHEGDGSDPEPPDAGEDSKSENGENAPIYCICRKPDI NCFMIGCDNCNEWFHGDCIRITEKMAKAIREWYCRECREKDPKLEIRYRHKKSRERDGNE RDSSEPRDEGGGRKRPVPDPDLQRRAGSGTGVGAMLARGSASPHKSSPQPLVATPSQHHQ QQQQQIKRSARMCGECEACRRTEDCGHCDFCRDMKKFGGPNKIRQKCRLRQCQLRARESY KYFPSSLSPVTPSESLPRPRRPLPTQQQPQPSQKLGRIREDEGAVASSTVKEPPEATATP EPLSDEDLPLDPDLYQDFCAGAFDDHGLPWMSDTEESPFLDPALRKRAVKVKHVKRREKK SEKKVMERKEERYKRHRQKQKHKDKWKHPERADAKDPASLPQCLGPGCVRPAQPSSKYCS DDCGMKLAANRIYEILPQRIQQWQQSPCIAEEHGKKLLERIRREQQSARTRLQEMERRFH ELEAIILRAKQQAVREDEESNEGDSDDTDLQIFCVSCGHPINPRVALRHMERCYAKYESQ TSFGSMYPTRIEGATRLFCDVYNPQSKTYCKRLQVLCPEHSRDPKVPADEVCGCPLVRDV FELTGDFCRLPKRQCNRHYCWEKLRRAEVDLERVRVWYKLDELFEQERNVRTAMTNRAGL LALMLHQTIQHDPLTTDLRSTVRRGGGSGTRPPLGLMFGGTHEHRIDPSERLLLLPVASM AEDWLDCPALGPGWKRREVFRKSGATCGRSDTYYQSPTGDRIRSKVELTRYLGPACDLTL FDFKQGILCYPAPKAHPVAVASKKRKKPSRPAKTRKRQVGPQSGEVRKEAPRDETKADTD TAPASFPAPGCCENCGISFSGDGTQRQRLKTLCKDCRAQRIAFNREQRMFKVSTTCFLFT TLQWEQDVMELVESEGMMMGPQDEESFYRVLSIDLLRYGDEANGGLKFMVMPADVGSVDM GGPVLLEQARDLVPTIPGSVWAVGSVQPASCPMMWHRGCSASVNGDAASGLWKGAEGVEY AGAVRPKRIVAIAPSAFALPALVSGASGNVSSDVAYGTLLTACVAVIRDVSDALPWLWLP QLVWGVSVSAWCHQAETTLSDLSQGKHARRKGGCDSKMAARRRPGAQPLPPPPPSQSPEP TEPHPRALAPSPPAEFIYYCVDEDELQPYTNRRQNRKCGACAACLRRMDCGRCDFCCDKP KFGGSNQKRQKCRWRQCLQFAMKRLLPSVWSESEDGAGSPPPYRRRKRPSSARRHHLGPT LKPTLATRTAQPDHTQAPTKQEAGGGFVLPPPGTDLVFLREGASSPVQVPGPVAASTEAL LQEAQCSGLSWVVALPQVKQEKADTQDEWTPGTAVLTSPVLVPGCPSKAVDPGLPSVKQE PPDPEEDKEENKDDSASKLAPEEEAGGAGTPVKFRRGPLGCTTLGIPNGRPRICADFPKP YFGPKAFSLPPSRRMCGITTLLLTGNATVVTTCAAGPSWFPDLCFLFCLDPDSQKPVTGL LIHILTLVHCDPQTLNYAPPGDSKMYSQRFGTVQREVKGPTPKVVIVRSKPPKGQGAEHH LERIRRSHQKHNAILASIKSSERDRLKAEWDQHNDCKILDSLVRARIKDAVQGFIINIEE RRNKLRELLALEENEYFTEMQLKKETIEEKKDRMREKTKLLKEKNEKERQDFVAEKLDQQ FRERCEELRVELLSIHQKKVCEERKAQIAFNEELSRQKLVEEQMFSKLWEEDRLAKEKRE AQEARRQKELMENTRLGLNAQITSIKAQRQATQLLKEEEARLVESNNAQIKHENEQDMLK KQKAKQETRTILQKALQERIEHIQQEYRDEQDLNMKLVQRALQDLQEEADKKKQKREDMI REQKIYHKYLAQRREEEKAQEKEFDRILEEDKAKKLAEKDKELRLEKEARRQLVDEVMCT RKLQVQEKLQREAKEQEERAMEQKHINESLKELNCEEKENFARRQRLAQEYRKQLQMQIA YQQQSQEAEKEEKRREFEAGVAANKMCLDKVNERCDAAGALDLKSLTNPEANPGPCTPEK IARVCQPSLKTAKPKTL >gi568815580r:50127384_50366404|GENSCAN_predicted_CDS_4|6174_bp atggcggcgcccgtgaagaatccgcaattaggtcgccgtcatatgtcgcctaggaacgta cggaattcgacccacgagggagatggttcagacccagagcctccagatgccggggaggac agcaagtccgagaatggggagaatgcgcccatctactgcatctgccgcaaaccggacatc aactgcttcatgatcgggtgtgacaactgcaatgagtggttccatggggactgcatccgg atcactgagaagatggccaaggccatccgggagtggtactgtcgggagtgcagagagaaa gaccccaagctagagattcgctatcggcacaagaagtcacgggagcgggatggcaatgag cgggacagcagtgagccccgggatgagggtggagggcgcaagaggcctgtccctgatcca gacctgcagcgccgggcagggtcagggacaggggttggggccatgcttgctcggggctct gcttcgccccacaaatcctctccgcagcccttggtggccacacccagccagcatcaccag cagcagcagcagcagatcaaacggtcagcccgcatgtgtggtgagtgtgaggcatgtcgg cgcactgaggactgtggtcactgtgatttctgtcgggacatgaagaagttcgggggcccc aacaagatccggcagaagtgccggctgcgccagtgccagctgcgggcccgggaatcgtac aagtacttcccttcctcgctctcaccagtgacgccctcagagtccctgccaaggccccgc cggccactgcccacccaacagcagccacagccatcacagaagttagggcgcatccgtgaa gatgagggggcagtggcgtcatcaacagtcaaggagcctcctgaggctacagccacacct gagccactctcagatgaggacctacctctggatcctgacctgtatcaggacttctgtgca ggggcctttgatgaccatggcctgccctggatgagcgacacagaagagtccccattcctg gaccccgcgctgcggaagagggcagtgaaagtgaagcatgtgaagcgtcgggagaagaag tctgagaagaaggtgatggagaggaaggaggagcgatacaagcggcatcggcagaagcag aagcacaaggataaatggaaacacccagagagggctgatgccaaggaccctgcgtcactg ccccagtgcctggggcccggctgtgtgcgccccgcccagcccagctccaagtattgctca gatgactgtggcatgaagctggcagccaaccgcatctacgagatcctcccccagcgcatc cagcagtggcagcagagcccttgcattgctgaagagcacggcaagaagctgctcgaacgc attcgccgagagcagcagagtgcccgcactcgccttcaggaaatggaacgccgattccat gagcttgaggccatcattctacgtgccaagcagcaggctgtgcgcgaggatgaggagagc aacgagggtgacagtgatgacacagacctgcagatcttctgtgtttcctgtgggcacccc atcaacccacgtgttgccttgcgccacatggagcgctgctacgccaagtatgagagccag acgtcctttgggtccatgtaccccacacgcattgaaggggccacacgactcttctgtgat gtgtataatcctcagagcaaaacatactgtaagcggctccaggtgctgtgccccgagcac tcacgggaccccaaagtgccagctgacgaggtatgcgggtgcccccttgtacgtgatgtc tttgagctcacgggtgacttctgccgcctgcccaagcgccagtgcaatcgccattactgc tgggagaagctgcggcgtgcggaagtggacttggagcgcgtgcgtgtgtggtacaagctg gacgagctgtttgagcaggagcgcaatgtgcgcacagccatgacaaaccgcgcgggattg ctggccctgatgctgcaccagacgatccagcacgatcccctcactaccgacctgcgctcc actgttcggcggggaggagggagcgggacacgaccccccctggggttgatgttcggaggg acccacgaacacaggatagaccccagcgagaggctactgctgcttcctgtggcctccatg gctgaggactggctggactgcccggccctgggccctggctggaagcgccgcgaagtcttt cgcaagtcaggggccacctgtggacgctcagacacctattaccagagccccacaggagac aggatccgaagcaaagttgagctgactcgatacctgggccctgcgtgtgatctcaccctc ttcgacttcaaacaaggcatcttgtgctatccagcccccaaggcccatcccgtggcggtt gccagcaagaagcgaaagaagccttcaaggccagccaagactcggaaacgtcaggttgga ccccagagtggtgaggtcaggaaggaggccccgagggatgagaccaaggctgacactgac acagccccagcttcattccctgctcctgggtgctgtgagaactgtggaatcagcttctca ggggatggcacccaaaggcagcggctcaaaacgttgtgcaaagactgtcgagcacagaga attgccttcaaccgggaacagagaatgtttaaggtaagcacaacctgcttcctcttcaca actctgcagtgggagcaggatgtgatggagctggtagagtctgaagggatgatgatgggt ccacaggatgaagagtcattttatagggtgctctccatagaccttctcagatacggtgat gaggctaatgggggactaaaattcatggtgatgccagctgatgtagggtcagtagacatg ggaggaccagtgcttctggagcaggcccgtgatctcgtccctaccattcctggcagcgtg tgggctgtggggagtgtgcagcctgccagctgccccatgatgtggcatcggggctgttct gcaagtgtgaacggagacgctgcctccggattgtggaaaggagccgagggtgtggagtat gccggggctgtcagacccaagaggattgtggccattgccccatctgccttcgccctcccc gccctggtctcaggcgccagtggaaatgtgtccagcgacgttgcctacggcaccttgctc accgcctgcgtcgccgtcatcagagatgtcagcgacgcactcccctggctgtggctcccc caactggtgtggggagtttctgtgtctgcctggtgccaccaagctgaaaccaccctttct gatctttcccagggtaaacatgcccgccgcaagggaggctgtgactccaagatggctgcc aggcggcgccccggagcccagccactgcctccaccacccccatcacagtccccagagccc acagagccgcaccccagagccctggccccctcgccacctgccgagttcatctattactgt gtagacgaggacgagctacagccctacacgaaccgccggcagaaccgcaagtgcggggcc tgtgcagcctgcctacggcggatggactgtggccgctgcgacttctgctgcgacaagccc aaattcgggggcagcaaccagaagcgccagaagtgtcgttggcgccaatgcctgcagttt gccatgaagcggctgctgcccagtgtctggtcagagtctgaggatggggcaggatcgccc ccaccttaccgtcgtcgaaagaggcccagctctgcccgacggcaccatcttggccctacc ttgaagcccaccttggctacacgcacagcccaaccagaccatacccaggctccaacgaag caggaagcaggtggtggctttgtgctgcccccgcctggcactgaccttgtgtttttacgg gaaggcgcaagcagtcctgtgcaggtgccgggccctgttgcagcttccacagaagccctg ttgcaggaggcccagtgctctggcctgagttgggttgtggccttaccccaggtgaagcaa gagaaggcggatacccaggacgagtggacaccaggcacagctgtcctgacttctcccgta ttggtgcctggctgccctagcaaggcagtagacccaggcctgccttctgtgaagcaagag ccacctgacccagaggaggacaaggaggagaacaaggatgattctgcctccaaattggcc ccagaggaagaggcaggaggggctggcacacccgtgaaattccgacgaggccctttgggc tgcacaaccttgggaatccccaacgggcggccccgaatctgcgccgattttcccaagccg tacttcgggcccaaggcgttttcattgccgccttcccggaggatgtgtggtattaccacc ttgctcttgactggaaatgcaactgtggttaccacctgtgctgcaggtccgtcctggttc cctgatttgtgtttcctgttctgcctggatcctgattctcagaaaccagttactggactt ctgatccacatcctgaccctggtccactgtgaccctcagactcttaattatgctccacca ggggactcgaaaatgtacagccagcggtttggcaccgtacagcgggaggttaagggcccc acccccaaagtggtgatcgtgagatccaagcctcctaaaggccaaggagctgagcaccat ctagaaagaatccgacgcagccatcagaagcataatgctattttggcttccattaagtca agtgagcgggatcgcttgaaagctgagtgggaccagcacaatgactgcaagattttggac agccttgtgcgagcaagaatcaaggatgctgtgcaagggtttatcattaacattgaagaa agacgaaataagctacgtgagcttttagcattagaagaaaatgagtattttacagaaatg caattgaagaaagaaaccattgaggagaaaaaagataggatgagagagaaaactaaatta ctaaaagagaagaatgaaaaagagaggcaggattttgtggctgaaaagctagaccagcaa ttcagggaacgctgtgaggagctccgtgttgaattgttatctatccatcagaagaaggtg tgtgaggagcggaaagcacagattgcatttaatgaggagctgagcaggcaaaagctggtg gaagagcagatgttctccaaactctgggaggaagaccgattagccaaggaaaagcgagaa gcccaagaggcgaggagacagaaagagctgatggagaacacacgcctggggctgaatgcc cagatcaccagcatcaaggcacaaaggcaggcgacacagctgctgaaggaagaggaggca cgccttgtggaaagtaacaacgcacagattaaacatgagaatgaacaggatatgctaaag aaacagaaggcaaagcaggaaactaggaccattttgcaaaaagccctacaagagaggata gaacatattcagcaggaatacagagacgaacaggacttgaacatgaagctcgtgcaaagg gcccttcaagacttacaggaagaggcagataaaaagaaacaaaaaagagaagatatgata agagaacagaagatataccataaatatttggcacagagacgtgaggaagaaaaagctcag gagaaagaatttgacagaatattagaggaagacaaggcaaagaagttggctgagaaggac aaggagctgagacttgaaaaggaggcaaggagacagcttgtggatgaggtcatgtgtaca agaaaacttcaagttcaagaaaagttgcaacgagaagctaaagaacaggaagaacgtgct atggaacagaaacacataaatgaaagtcttaaagaacttaactgtgaagagaaggagaat tttgcaagacgccaacgtttagcccaggagtacaggaagcaacttcagatgcaaatcgcc taccagcagcagtcccaagaagcagagaaggaagagaaacgccgagagtttgaagcaggt gtagcagcaaacaagatgtgtttggacaaggtgaatgaaagatgtgatgctgctggggcc ctggatctgaaatctctgacaaatcccgaagccaatcctggtccatgtacaccggagaag atagccagggtctgccagccctcactcaaaactgcaaaacccaaaacgctctga >gi568815580r:50127384_50366404|GENSCAN_predicted_peptide_5|1167_aa MGDFNTPLSTLDRSTRQKVNKDTQELNSALHQADLIDIYRTLHPKSKEYTFFSAPHHTYS KIDHIVGSKALLSKCKRTEIITNYLSDHSAIKLELRIKNLTQSRSTTWKLNNLLLNDYWV HNEMKAEIKMFFETNENKDTTYQNLWDAFKAVCRGKFIALNAYKRKQERSKIDTLTSQLK ELEKQEQTHSKASRRQEITKIREELKEIETQKTLQKINESRSWFFERINKIDRPLARLIK KKREKNQIDTIKNDKGDITTDPTEIQTTIREYYKHLYANKLENLEEMDTFLDTYTLPRLN QEEVESLNRPITGSEIVAIINSLPTKKSPGPDGLTAEFYQRYKEELVPFLLKLFQSIEKE GILPNSFYEASIILIPKPGRDTTKKENFRPISLMNIDTKILNKILANRIQQHIKKLIHHD QVGFIPGMQGWFNIRKSINVIQHINRAKDKNHMIISIDAEKAFDNIQQPFMLKTLNKLGI DGTYFKIIRAIYDKPTANIILNGQKLEAFPLKTGTRQGCPLSPLLFNIVLEVLARAIRQE KEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQAFL YTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLKEIKEDTNKWKNI PCSWVGRINIVKMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRARIAKSILS QKNKAGGITLPDFKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEIMPHIYNYLIFDKPEK NKQWGKDSLFNKWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLG ITIQDIGVGKDFMSKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRPPTTWEKIFAT YSSDKGLISRIYNELKQIYKKKTNNPIKKWVKDMNRHFSKEDIYAAEKYMKKCSSSLAIR EMQIKTTMRYHLTPVRMAIIKKSGNNRRKKKKKKRKKRKRKKKKKERRRRKKKEEGEGGG RRRKEEEGGGRRRKEEEGEGEEEGEGEEGEEGEGGEGGEGGEGEGGEGGEGEEGEEEGGG GGGEGEEGEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEE EEEEEEEEEIALGGDQILTGKHMAFLK >gi568815580r:50127384_50366404|GENSCAN_predicted_CDS_5|3504_bp atgggagactttaacaccccactgtcaacattagacagatcaacgagacagaaagtcaac aaggatacccaggaattgaactcagctctgcaccaagcggacctaatagacatctacaga actctccaccccaaatcaaaagaatatacatttttttcagcaccacaccacacctattcc aaaattgaccacatagttggaagtaaagctctcctcagcaaatgtaaaagaacagaaatt ataacaaactatctctcagaccacagtgcaatcaaactagaactcaggattaagaatctc actcaaagccgctcaactacatggaaactgaacaacctgctcctgaatgactactgggta cataacgaaatgaaggcagaaataaagatgttctttgaaaccaatgagaacaaagacacc acataccagaatctctgggacgcattcaaagcagtgtgtagagggaaatttatagcacta aatgcctacaagagaaagcaggaaagatccaaaattgacaccctaacatcacaattaaaa gaactagaaaagcaagagcaaacacattcaaaagctagcagaaggcaagaaataactaaa atcagagaagaactgaaggaaatagagacacaaaaaacccttcaaaaaatcaatgaatcc aggagctggttttttgaaaggatcaacaaaattgatagaccgctagcaagactaataaag aaaaaaagagagaagaatcaaatagacacaataaaaaacgataaaggggatatcaccacc gatcccacagaaatacaaactaccatcagagaatactacaaacacctctacgcaaataaa ctagaaaatctagaagaaatggatacattcctcgacacatacactctcccaagactaaac caggaagaagttgaatctctgaatagaccaataacaggctctgaaattgtggcaataatc aatagtttaccaaccaaaaagagtccaggaccagatggactcacagccgaattctaccag aggtacaaggaggaactggtaccattccttctgaaactattccaatcaatagaaaaagag ggaatcctccctaactcattttatgaggccagcatcattctgataccaaagccgggcaga gacacaaccaaaaaagagaattttagaccaatatccttgatgaacattgatacaaaaatc ctcaataaaatactggcaaaccgaatccagcagcacatcaaaaagcttatccaccatgat caagtgggcttcatccctgggatgcaaggctggttcaatatacgcaaatcaataaatgta atccagcatataaacagagccaaagacaaaaaccacatgattatctcaatagatgcagaa aaagcctttgacaatattcaacaacccttcatgctaaaaactctcaataaattaggtatt gatgggacgtatttcaaaataataagagctatctatgacaaacccacagccaatatcata ctgaatgggcaaaaactggaagcattccctttgaaaactggcacaagacagggatgccct ctctcaccgctcctattcaacatagtgttggaagttctggccagggcaatcaggcaggag aaggaaataaagggtattcaattaggaaaagaggaagtcaaattgtccctgtttgcagac gacatgattgtttatctagaaaaccccatcgtctcagcccaaaatctccttaagctgata agcaacttcagcaaagtctcaggatacaaaatcaatgtacaaaaatcacaagcattctta tacaccaacaacagacaaacagagagccaaatcatgagtgaactcccattcacaatagct tcaaagagaataaaatacctaggaatccaacttacaagggatgtgaaggacctcttcaag gagaactacaaaccactgctcaaggaaataaaagaggacacaaacaaatggaagaacatt ccatgctcatgggtaggaagaatcaatatcgtgaaaatggccatactgcccaaggtaatt tacagattcaatgccatccccatcaagctaccaatgactttcttcacagaattggaaaaa actactttaaagttcatatggaaccaaaaaagagcccgcatcgccaagtcaatcctaagc caaaagaacaaagctggaggcatcacgctacctgacttcaaactatactacaaggctaca gtaaccaaaacagcatggtactggtaccaaaacagagatatagatcaatggaacagaaca gagccctcagaaataatgccgcatatctacaactatctgatctttgacaaacctgagaaa aacaagcaatggggaaaggattcactatttaataaatggtgctgggaaaactggctagcc atatgtagaaagctgaaactggatcccttccttacaccttatacaaaaatcaattcaaga tggattaaagatttaaacgttagacctaaaaccataaaaaccctagaagaaaacctaggc attaccattcaggacataggcgtgggcaaggacttcatgtccaaaacaccaaaagcaatg gcaacaaaagccaaaattgacaaatgggatctaattaaactaaagagcttctgcacagca aaagaaactaccatcagagtgaacaggccacctacaacatgggagaaaatttttgcaacc tactcatctgacaaagggctaatatccagaatctacaatgaactcaaacaaatttacaag aaaaaaacaaacaaccccatcaaaaagtgggtgaaggacatgaacagacacttctcaaaa gaagacatttatgcagccgaaaaatacatgaagaaatgctcatcatcactggccatcaga gaaatgcaaatcaaaaccactatgagatatcatctcacaccagttagaatggcaatcatt aaaaagtcaggaaacaacagaagaaagaagaagaagaagaagaggaagaagaggaagagg aagaagaagaagaaagaaagaagaagaaggaagaagaaggaagaaggagaaggaggagga aggaggaggaaggaggaggaaggaggaggaaggaggaggaaggaggaggaaggagaagga gaagaagaaggagaaggagaagaaggagaagaaggagaaggaggagaaggaggagaagga ggagaaggagaaggaggagaaggaggagaaggagaagaaggagaagaagaaggaggagga ggaggaggagaaggagaggaaggggaggaagaggaggaagaggaggaagaggaggaggaa gaggaagaggaggaagaggaggaagaggaggaggaagaggaagaggaggaagaggaggag gaagaggaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagag gaagaggaagaagaagaagaagaagaaatagctttaggcggagatcaaatcctaacagga aaacacatggcatttttgaaataa >gi568815580r:50127384_50366404|GENSCAN_predicted_peptide_6|338_aa MGKKQNRKTGNSKTQSASPPPKERSSSPATEQSCMENDFDELREEGFRRSNYSELREDIQ TKGKEVENFEKNLEECITRITNTEKCLKELMELKTKARELREECRSLRSRCDQLEERVSA MEDEMNEMKREGKFREKRIKRNEQSLQEIWDYVKRPNLRLIGVPESDGENGTKLENTLQD IIQENFPNLARQANVQIQEIQRTSQRYSSRRATPRHIIVRFTKVEMKEKMLRAAREKGRV TLKGKPIRLTADLSAETLQARREWGPIFNILKEKNFQPRISYPAKLSFISEGEIKYFIDK QMLRDFVTTRPALKELLKEALNMERNNRYQPLHNHAKM >gi568815580r:50127384_50366404|GENSCAN_predicted_CDS_6|1017_bp atggggaaaaaacagaacagaaaaactggaaactctaaaacgcagagcgcctctcctcct ccaaaggaacgcagttcctcaccagcaacagaacaaagctgtatggagaatgattttgac gagctgagagaagaaggcttcagacgatcaaattactctgagctacgggaggacattcaa accaaaggcaaagaagttgaaaactttgaaaaaaatttagaagaatgtataactagaata accaatacagagaagtgcttaaaggagctgatggagctgaaaaccaaggctcgagaacta cgtgaagaatgcagaagcctcaggagccgatgcgatcaactggaagaaagggtatcagca atggaagatgaaatgaatgaaatgaagcgagaagggaagtttagagaaaaaagaataaaa agaaatgagcaaagcctccaagaaatatgggactatgtgaaaagaccaaatctacgtctg attggtgtacctgaaagtgatggggagaatggaaccaagttggaaaacactctgcaggat attatccaggagaacttccccaatctagcaaggcaggccaacgttcagattcaggaaata cagagaacgtcacaaagatactcctcgagaagagcaactccaagacacataattgtcaga ttcaccaaagttgaaatgaaggaaaaaatgttaagggcagccagagagaaaggtcgggtt accctcaaaggaaagcccatcagactaacagcggatctctcggcagaaaccctacaagcc agaagagagtgggggccaatattcaacattcttaaagaaaagaattttcaacccagaatt tcatatccagccaaactaagcttcataagtgaaggagaaataaaatactttatagacaag caaatgctgagagattttgtcaccaccaggcctgccctaaaagagctcctgaaggaagcg ctaaacatggaaaggaacaaccggtaccagccgctgcacaatcatgccaaaatgtaa >gi568815580r:50127384_50366404|GENSCAN_predicted_peptide_7|67_aa MSICVSCFEEEEEEEEEERRRRKKKKKKDKEKEEERRKEEERRKEEEGRRRKKKKKEEEE ESNYLFF >gi568815580r:50127384_50366404|GENSCAN_predicted_CDS_7|204_bp atgtccatctgtgtctcctgcttcgaagaagaagaagaggaggaagaagaagaaagaaga agaaggaagaaaaagaagaagaaggataaggagaaggaagaagaaagaagaaaagaagaa gaaagaaggaaggaggaagaaggaagaagacggaagaagaagaagaaagaagaagaagaa gaaagcaattacctttttttttga