GENSCAN 1.0 Date run: 5-Nov-116 Time: 14:12:47 Sequence gi568815576r:30143654_30344936 : 201283 bp : 49.48% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 PlyA - 3229 3224 6 1.05 1.04 Term - 27712 27616 97 0 1 62 44 84 0.053 -1.16 1.03 Intr - 48577 48480 98 1 2 92 95 64 0.457 6.31 1.02 Intr - 52974 52806 169 2 1 79 14 136 0.363 5.25 1.01 Init - 65499 65372 128 0 2 37 84 93 0.194 3.43 1.00 Prom - 67289 67250 40 -7.46 2.00 Prom + 67764 67803 40 -4.66 2.01 Init + 73358 73430 73 1 1 49 75 79 0.660 1.95 2.02 Intr + 74114 74284 171 0 0 107 72 69 0.933 7.11 2.03 Intr + 74692 74924 233 2 2 55 46 147 0.920 4.69 2.04 Term + 79692 79994 303 2 0 55 42 155 0.772 2.77 2.05 PlyA + 80502 80507 6 1.05 3.05 PlyA - 80776 80771 6 1.05 3.04 Term - 85650 85496 155 1 2 94 50 24 0.147 -2.72 3.03 Intr - 88773 88668 106 0 1 103 93 99 0.535 11.69 3.02 Intr - 89238 89206 33 0 0 82 94 51 0.868 3.52 3.01 Init - 92910 92812 99 0 0 87 44 33 0.214 -0.94 3.00 Prom - 95291 95252 40 -7.46 4.08 PlyA - 96818 96813 6 1.05 4.07 Term - 100408 99998 411 1 0 121 43 884 0.999 82.44 4.06 Intr - 101280 101102 179 1 2 95 92 88 0.998 9.54 4.05 Intr - 107431 107402 30 2 0 121 81 1 0.052 0.80 4.04 Intr - 109201 109066 136 1 1 58 75 67 0.142 2.54 4.03 Intr - 120811 120296 516 1 0 103 45 545 0.015 44.86 4.02 Intr - 121491 121349 143 1 2 102 66 132 0.872 12.47 4.01 Init - 123146 123113 34 2 1 97 89 33 0.416 3.10 4.00 Prom - 138412 138373 40 -1.36 5.10 PlyA - 141484 141479 6 1.05 5.09 Term - 142035 141967 69 0 0 118 54 137 0.989 11.24 5.08 Intr - 142273 142119 155 2 2 76 51 283 0.663 23.19 5.07 Intr - 142445 142363 83 1 2 112 73 104 0.997 10.58 5.06 Intr - 142723 142610 114 0 0 71 56 173 0.999 11.96 5.05 Intr - 143298 143172 127 1 1 81 43 187 0.609 13.24 5.04 Intr - 143634 143502 133 0 1 64 77 171 0.992 13.82 5.03 Intr - 143907 143720 188 1 2 112 105 212 0.977 24.71 5.02 Intr - 145123 145053 71 0 2 40 81 120 0.959 5.33 5.01 Init - 145844 145732 113 2 2 110 78 158 0.996 14.97 5.00 Prom - 146554 146515 40 -12.49 6.10 PlyA - 148358 148353 6 1.05 6.09 Term - 149198 148722 477 2 0 100 40 407 0.999 31.94 6.08 Intr - 150200 149998 203 0 2 72 77 308 0.510 27.10 6.07 Intr - 150457 150268 190 1 1 73 79 195 0.985 16.26 6.06 Intr - 151208 151143 66 2 0 81 105 131 0.987 13.20 6.05 Intr - 151402 151288 115 0 1 101 66 209 0.974 20.45 6.04 Intr - 152190 152084 107 0 2 95 88 153 0.998 15.01 6.03 Intr - 155898 155791 108 0 0 75 105 106 0.978 11.48 6.02 Intr - 160977 160878 100 2 1 118 100 141 0.931 18.41 6.01 Init - 183228 183020 209 0 2 81 77 431 0.977 37.59 6.00 Prom - 187948 187909 40 -4.66 7.14 PlyA - 188360 188355 6 1.05 7.13 Term - 191042 190941 102 2 0 91 42 111 0.993 5.08 7.12 Intr - 191885 191814 72 2 0 85 105 56 0.939 6.60 7.11 Intr - 192100 191999 102 1 0 90 80 112 0.979 10.97 7.10 Intr - 193527 193373 155 1 2 118 95 90 0.871 12.49 7.09 Intr - 194244 194037 208 0 1 101 82 175 0.999 16.85 7.08 Intr - 195381 195136 246 0 0 76 72 288 0.996 23.66 7.07 Intr - 195598 195477 122 2 2 77 86 219 0.999 20.81 7.06 Intr - 196728 196543 186 1 0 70 94 295 0.929 27.96 7.05 Intr - 196958 196845 114 0 0 101 87 3 0.787 1.92 7.04 Intr - 197159 197042 118 2 1 119 97 157 0.999 19.74 7.03 Intr - 198232 198039 194 2 2 69 99 269 0.919 25.31 7.02 Intr - 198697 198547 151 1 1 93 100 265 0.999 27.94 7.01 Intr - 199226 199152 75 2 0 93 97 42 0.700 5.31 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 120811 120230 582 1 0 103 38 551 0.983 46.10 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576r:30143654_30344936|GENSCAN_predicted_peptide_1|163_aa MGKRLGVIQEEAWKTGESSGLGAENHGPLSGPSWASYRPRGRGPDLAPLFLGLIYEVLTH IAKQLSELLANRKVIVTEYFLCQTLTLATAMVMTLAMAMGLLLLHQMGCLNSSGLPFIGP SQCLAQNPFPRRSRTMGVLNLLNLECLQNLRVDMLNQWLSVEV >gi568815576r:30143654_30344936|GENSCAN_predicted_CDS_1|492_bp atggggaaacgacttggtgtgatccaagaggaagcttggaaaaccggggaaagctccggc ctgggagctgagaaccacggtcctctttccgggcccagctgggcctcctaccggccacgt ggccggggcccagacctggcccctctgtttcttggtttaatttatgaggttctcacacac attgcaaagcaattatctgagctgctggccaacaggaaggtaattgtaactgagtacttc ctttgtcaaacgctgacgctggccaccgcgatggtgatgacgctggccatggcaatgggc ttattgctgctgcatcagatgggctgcctgaacagttcaggacttcctttcatcggaccc agccagtgccttgcacagaacccattcccaagaaggtcgagaacaatgggagttttgaac ctgctgaatttggagtgcctacaaaaccttcgtgtggacatgctgaatcagtggctatct gtggaagtctga >gi568815576r:30143654_30344936|GENSCAN_predicted_peptide_2|259_aa MAGWKGGLASIFFCPPPLATPGWGEGTTFALRQTARPLPGQELSQTSVHQHAAGVDEPGQ PEESCPGSHTAASALSGPGRVGGPAGRQSPCEALGTTIVTVTGVHVDDHVLQASNMLSTS HFISIYRSRTPQKRVARPAERKGPCPKSPSTEELGFQPQPSSPLNAPLGSSPLSGSSVHR SLTQREAHSVHRKDEGQDVWAGASPVGGNDLDNEGDHSLVHETPDTRPGLCLQSQKEDAL GETREVPRGNGQAEGVPVN >gi568815576r:30143654_30344936|GENSCAN_predicted_CDS_2|780_bp atggccggctggaagggaggcctcgccagcatcttcttctgcccacctcccctggccact ccgggttggggagagggcaccacctttgccctgcgccagacagccaggcccttgccaggc caagagttgagtcagacttcagttcaccaacatgctgccggggtggacgagcctggacag ccagaggaatcttgtcccggctcccatacggcagccagtgctttgtcaggcccagggcgg gtgggagggccggctgggcgccagagcccatgcgaggctctcggcacgaccattgtcact gttactggggttcacgtggatgatcacgtgctgcaggccagcaacatgctgagtacttca catttcatatccatctaccggagtcgcactccacagaagagagtggcaagacccgcagag aggaaaggaccttgcccaaagtcccccagcactgaagagctgggcttccagccccagccc tcgagtcccctgaatgcacccctgggaagcagcccgctgtccggctcctccgtgcacagg tctctaacccagagagaggcccacagtgtccaccggaaggacgaggggcaggacgtgtgg gcaggagcctcccctgtaggaggaaatgacctggataatgagggtgaccacagcctcgtt catgaaaccccagatacccgacctggtctctgtcttcaaagccagaaggaggacgcttta ggggaaacaagggaagtcccacggggaaatggtcaggcggagggcgttcctgttaactga >gi568815576r:30143654_30344936|GENSCAN_predicted_peptide_3|130_aa MAKTQSNETGPKRGQDDTGTNWGFSLTEGGLLKIQAAAKGSSKQGFPIMPGTILGAQNVF ANGLGKFVASRAEDECRFPEGASCPEPAAGQADLTPAGGLCCVGFLQGVARTPWMAGCPW QNGPEFPSQP >gi568815576r:30143654_30344936|GENSCAN_predicted_CDS_3|393_bp atggctaagacacagtccaatgaaacgggtcctaaaaggggccaggatgacacaggcaca aactggggctttagcctgactgagggtggacttttgaagatccaagcagccgccaagggc tcttccaagcagggtttcccgataatgcctggtaccattttgggtgcccagaatgtcttt gccaacggcctggggaagttcgtggcctccagagcagaggatgagtgcaggttccctgaa ggggcgtcctgtcccgaacccgcagctggccaggcggacctgaccccagctggagggctg tgctgcgtgggcttcctgcagggtgtagctcggaccccctggatggctggttgcccttgg caaaatgggccagagtttccctcacagccttag >gi568815576r:30143654_30344936|GENSCAN_predicted_peptide_4|482_aa MGVLLTQRTLLSLVLALLFPSMASMAAIGSCSKEYRVLLGQLQKQTDLMQDTSRLLDPYI RIQGLDVPKLREHCRERPGAFPSEETLRGLGRRGFLQTLNATLGCVLHRLADLEQRLPKA QDLERSGLNIEDLEKLQMARPNILGLRNNIYCMAQLLDNSDTAEPTKAGRGASQPPTPTP ASDAFQRKLEGCRFLHGYHRFMHSVGRVFSKWGESPNRSRRHSPHQALRKGAVALGLRLP WEGGCGHFLFPHPVVRMKDSLVEYERKACDCPGLHPGPPEGSAHFPGVVPLLLVLHWKHG AGSPLPITPVNATCAIRHPCHNNLMNQIRSQLAQLNGSANALFILYYTAQGEPFPNNLDK LCGPNVTDFPPFHANGTEKAKLVELYRIVVYLGTSLGNITRDQKILNPSALSLHSKLNAT ADILRGLLSNVLCRLCSKYHVGHVDVTYGPDTSGKDVFQKKKLGCQLLGKYKQIIAVLAQ AF >gi568815576r:30143654_30344936|GENSCAN_predicted_CDS_4|1449_bp atgggggtactgctcacacagaggacgctgctcagtctggtccttgcactcctgtttcca agcatggcgagcatggcggctataggcagctgctcgaaagagtaccgcgtgctccttggc cagctccagaagcagacagatctcatgcaggacaccagcagactcctggacccctatata cgtatccaaggcctggatgttcctaaactgagagagcactgcagggagcgccccggggcc ttccccagtgaggagaccctgagggggctgggcaggcggggcttcctgcagaccctcaat gccacactgggctgcgtcctgcacagactggccgacttagagcagcgcctccccaaggcc caggatttggagaggtctgggctgaacatcgaggacttggagaagctgcagatggcgagg ccgaacatcctcgggctcaggaacaacatctactgcatggcccagctgctggacaactca gacacggctgagcccacgaaggctggccggggggcctctcagccgcccacccccacccct gcctcggatgcttttcagcgcaagctggagggctgcaggttcctgcatggctaccatcgc ttcatgcactcagtggggcgggtcttcagcaagtggggggagagcccgaaccggagccgg agacacagcccccaccaggccctgaggaagggggctgtggctcttggcctcaggctgccc tgggaaggtggctgtggccacttcctcttcccgcaccctgtggtgaggatgaaagactcc ctggtggagtacgaaagaaaggcctgtgactgtcctgggctccacccaggcccacctgag ggctctgctcattttccaggagttgtgcccctgctgttggttctgcactggaaacatggg gcggggagccccctccccatcacccctgtcaacgccacctgtgccatacgccacccatgt cacaacaacctcatgaaccagatcaggagccaactggcacagctcaatggcagtgccaat gccctctttattctctattacacagcccagggggagccgttccccaacaacctggacaag ctatgtggccccaacgtgacggacttcccgcccttccacgccaacggcacggagaaggcc aagctggtggagctgtaccgcatagtcgtgtaccttggcacctccctgggcaacatcacc cgggaccagaagatcctcaaccccagtgccctcagcctccacagcaagctcaacgccacc gccgacatcctgcgaggcctccttagcaacgtgctgtgccgcctgtgcagcaagtaccac gtgggccatgtggacgtgacctacggccctgacacctcgggtaaggatgtcttccagaag aagaagctgggctgtcaactcctggggaagtataagcagatcatcgccgtgttggcccag gccttctag >gi568815576r:30143654_30344936|GENSCAN_predicted_peptide_5|350_aa MELHILEHRVRVLSVARPGLWLYTHPLIKLLFLPRRSRCKFFSLTETPEDYTLMVDEEGF KELPPSEFLQVAEATWLVLNVSSHSGAAVQAAGVTKIARSVIAPLAEHHVSVLMLSTYQT DFILVREQDLSVVIHTLAQEFDIYREVGGEPVPVTRDDSSNGFPRTQHAGPSPTVHPIQS PQNRFCVLTLDPETLPAIATTLIDVLFYSHSTPKEAASSSPEPSSITFFAFSLIEGYISI VMDAETQKKFPSDLLLTSSSGELWRMVRIGGQPLGFDECGIVAQIAGPLAAADISAYYIS TFNFDHALVSTKGWGRGESEAQKCGRQEVPEDGIGSVIEVLQRRQEGLAS >gi568815576r:30143654_30344936|GENSCAN_predicted_CDS_5|1053_bp atggagctgcacatcctagaacaccgggtgcgggtgctgagcgtcgcccgtcccggtctc tggctctacacccacccgctcatcaagctgctcttcctgccccgccgcagccggtgcaag ttcttcagcctgacggagacccctgaggattacacgcttatggtggacgaggagggcttt aaagagctgcccccatctgagttcctgcaagtagctgaggccacatggctggtgctgaac gtgtcgtctcacagcggtgcggcagtgcaggctgctggggtcaccaagatcgcccgttcg gtcatcgcgccactggccgagcaccacgtgtctgtgctgatgctgtccacttaccagacg gacttcatcctggtgcgggagcaggacctgtccgtggtgatccacacgctggcccaggag ttcgacatttaccgcgaggtgggcggagagcctgtgcctgtgacgagggatgattccagc aatggctttccccgcactcagcatgcagggcccagccccacggtgcatcccatccagagc ccacagaaccgcttctgtgtcctcacactggaccctgagacgcttccagccatcgccacc accctcatagatgtcctcttctactcgcacagcacccccaaggaggcagcctctagcagt cctgaacccagctccatcacgttctttgccttctccctcatcgagggttatatctccatt gtcatggatgctgaaacacagaaaaagttccccagtgacctcctgctgaccagctcctcg ggggagctgtggaggatggtgcgcatcggtggacagcccctgggctttgatgaatgtggc atcgtggcacagattgcaggtcccctggctgccgctgacatctctgcctactacatcagc accttcaacttcgaccacgccctggtgagcaccaagggctggggcagaggggagagtgag gcccagaaatgtgggcggcaggaggtgcccgaggacggtatcggcagcgtcatcgaggtc ctccagcggcggcaggaaggcctggcttcctga >gi568815576r:30143654_30344936|GENSCAN_predicted_peptide_6|524_aa MAKSNGENGPRAPAAGESLSGTRESLAQGPDAATTDELSSLGSDSEANGFAERRIDKFGF IVGSQGAEGALEEVPLEVLRQRESKWLDMLNNWDKWMAKKHKKIRLRCQKGIPPSLRGRA WQYLSGGKVKLQQNPGKFDELDMSPGDPKWLDVIERDLHRQFPFHEMFVSRGGHGQQDLF RVLKAYTLYRPEEGYCQAQAPIAAVLLMHMPAEQAFWCLVQICEKYLPGYYSEKLEAIQL DGEILFSLLQKVSPVAHKHLSRQKIDPLLYMTEWFMCAFSRTLPWSSVLRVWDMFFCEEK PQKASLYLLPIPHAGVKIIFRVGLVLLKHALGSPEKVKACQGQYETIERLRSLSPKIMQE AFLVQEVVELPVTERQIEREHLIQLRRWQETRGELQCRSPPRLHGAKAILDAEPGPRPAL QPSPSIRLPLDAPLPGSKAKPKPPKQAQKEQRKQMKGRGQLEKPPAPNQAMVVAAAGDAC PPQHVPPKDSAPKDSAPQDLAPQVSAHHRSQESLTSQESEDTYL >gi568815576r:30143654_30344936|GENSCAN_predicted_CDS_6|1575_bp atggcgaagagcaacggagagaatgggccgcgcgcgcccgcggccggggaaagcctgtcg ggaacccgggagagcctggcccagggccccgacgccgcaaccaccgacgaactcagctct ctcgggtctgactcggaggccaacggcttcgccgagcgccgcatcgacaagttcggcttc atcgtgggctcgcagggcgccgagggcgcgctggaggaagtacccctggaggtgctgagg cagagggagtccaagtggctggacatgctcaacaactgggacaaatggatggccaagaag cacaaaaagattcgtctgcggtgccaaaagggcatcccgccttctctgcggggccgtgct tggcagtacctgtcaggaggcaaggtgaagttacagcagaaccctggaaagtttgacgag ctggacatgtcccctggggaccccaagtggctggacgtgattgagcgtgacctgcaccgg cagttcccattccatgagatgtttgtgtcccgggggggccacggccagcaggacctattc cgtgtgctgaaggcctacacgctgtaccggcccgaggagggctactgccaggcccaggcg cccattgccgctgtcttgctcatgcatatgcctgctgagcaagccttctggtgcctggta cagatctgtgagaagtacctgcccggctactacagcgagaaactggaggcgatccagctg gacggggagatccttttctcgctgttgcagaaggtgtcgccggtggcccacaagcacctc agccgtcagaagatcgacccgctcctctatatgacagaatggttcatgtgcgccttctcc cgaaccttgccctggagctctgtgctgcgtgtctgggacatgttcttctgtgaagagaaa ccccaaaaagcttccttgtacttactgcccatcccccatgcaggggtcaagatcatcttc cgggtggggctggtgctgctgaagcacgcgctgggctcccctgagaaggtcaaagcctgc cagggccagtacgagaccatcgagcgactgcggagcctcagccccaagatcatgcaggag gcctttctggtccaggaggtggtggagttgcccgtgacagagcgccagattgagcgcgaa cacctcattcagctgcggcgctggcaggagacccggggtgagctgcagtgccgctccccg cccaggctgcatggtgccaaggctatcttggatgcagaacctggtccccggcctgcccta caaccttcaccatccatccgcctgcccctagatgcccccctccctggctccaaagccaag cccaagccacccaagcaggcccagaaggagcagcggaaacagatgaaggggagagggcag ctggagaagcccccagccccaaatcaagccatggtggtggccgctgcaggagatgcatgt cccccacagcatgtgcccccgaaggactcagcccccaaggactcagcccctcaggatttg gctccccaggtctcagcccaccaccgctcccaggagagcttgacgtcccaagagagtgag gacacctacttgtaa >gi568815576r:30143654_30344936|GENSCAN_predicted_peptide_7|614_aa ILIPPKGLFSKLKKEAENPREVLDQVCYRVEWAKFQERERKKEEEEKEKERVAYAQIDWH DFVVVETVDFQPNEQGNFPPPTTPEELGARILIQERYEKFGESEEVEMEVESDEEDDKQE KAEEPPSQLDQDTQVQDMDEGSDDEEEGQKVPPPPETPMPPPLPPTPDQVIVRKDYDPKG GLEVQLIVKALQARLLQVSWWEQGPVISGALVPFTVPASKPLPPAPAPDEYLVSPITGEK IPASKMQEHMRIGLLDPRWLEQRDRSIREKQSDDEVYAPGLDIESSLKQLAERRTDIFGV EETAIGKKIGEEEIQKPEEKVTWDGHSGSMARTQQAAQANITLQEQIEAIHKAKGLVPED DTKEKIGPSKPNEIPQQPPPPSSATNIPSSAPPITSVPRPPTMPPPVRTTVVSAVPVMPR PPMASVVRLPPGSVIAPMPPIIHAPRINVVPMPPSAPPIMAPRPPPMIVPTAFVPAPPVA PVPAPAPMPPVHPPPPMEDEPTSKKLKTEDSLMPEEEFLRRNKGPVSIKVQVPNMQDKTE WKLNGQVLVFTLPLTDQVSVIKVKIHEATGMPAGKQKLQYEGIFIKDSNSLAYYNMANGA VIHLALKERGGRKK >gi568815576r:30143654_30344936|GENSCAN_predicted_CDS_7|1845_bp atcttgattccacccaaaggtttattttcaaagctcaagaaagaggctgaaaacccccga gaagttttggatcaggtgtgttaccgagtggaatgggccaaattccaggaacgtgagagg aagaaggaagaagaggagaaggagaaggagcgggtggcctatgctcagatcgactggcat gattttgtggtggtggaaacagtggacttccaacccaatgagcaagggaacttccctccc cccaccacgccagaggagctgggggcccgaatcctcattcaggagcgctatgaaaagttt ggggagagtgaggaagttgagatggaggtcgagtctgatgaggaggatgacaaacaggag aaggcggaggagcctccttcccagctggaccaggacacccaagtacaagatatggatgag ggttcagatgatgaagaagaagggcagaaagtgcccccacccccagagacacccatgcct ccacctctgcccccaactccagaccaagtcattgtccgcaaggattatgatcccaaagga gggcttgaggtgcagcttattgtaaaagcattgcaagcccgtctgttacaggtttcatgg tgggaacagggcccagtcatctctggtgccctagtgcctttcacagtgccagcctccaag cccttgcctccagcccctgctccagatgagtatcttgtgtcccccattactggggagaag atccccgccagcaaaatgcaggaacacatgcgcattggacttcttgaccctcgctggctg gagcagcgggatcgctccatccgtgagaagcagagcgatgatgaggtgtacgcaccaggt ctggatattgagagcagcttgaagcagttggctgagcggcgtactgacatcttcggtgta gaggaaacagccattggtaagaagatcggtgaggaggagatccagaagccagaggaaaag gtgacctgggatggccactcaggcagcatggcccggacccagcaggctgcccaggccaac atcaccctccaggagcagattgaggccattcacaaggccaaaggcctggtgccagaggat gacactaaagagaagattggccccagcaagcccaatgaaatccctcaacagccaccgcca ccatcttcagccaccaacatccccagctcggctccacccatcacttcagtgccccgacca cccacaatgccacctccagttcgtactacagttgtctccgcagtacccgtcatgccccgg cccccaatggcatctgtggtccggctgcccccaggctcagtgatcgcccccatgccgccc atcatccacgcgcccagaatcaacgtggtgcccatgcctccctcggcccctcctattatg gccccccgcccaccccccatgattgtgccaacagcctttgtgcctgctccacctgtggca cctgtcccagctccagccccaatgccccctgtgcatcccccacctcccatggaagatgag cccacctccaaaaaactgaagacagaggacagcctcatgccagaggaggagttcctgcgc agaaacaagggtccagtgtccatcaaagtccaggtgcccaacatgcaggataagacggaa tggaaactgaatgggcaggtgctggtcttcaccctcccactcacggaccaggtctctgtc attaaggtgaagattcatgaagccacaggcatgcctgcagggaaacagaagctacagtat gagggtatcttcatcaaagattccaactcactggcttactacaacatggccaatggcgca gtcatccacctggccctcaaggagagaggcgggaggaagaagtag