GENSCAN 1.0 Date run: 17-Feb-117 Time: 10:50:19 Sequence gi568815587f:62142308_62343478 : 201171 bp : 43.75% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2675 2784 110 1 2 99 87 239 0.951 23.98 1.02 Intr + 2862 2982 121 0 1 108 81 306 0.947 32.40 1.03 Intr + 3322 3456 135 0 0 44 80 223 0.704 17.86 1.04 Intr + 4351 4595 245 0 2 108 78 518 0.994 49.10 1.05 Intr + 6169 6247 79 1 1 52 100 219 0.727 19.05 1.06 Intr + 6432 6539 108 2 0 89 50 231 0.607 19.88 1.07 Intr + 7750 7900 151 0 1 70 99 138 0.760 12.84 1.08 Term + 9455 9669 215 0 2 89 54 263 0.999 20.39 1.09 PlyA + 10000 10005 6 1.05 2.00 Prom + 16939 16978 40 -3.86 2.01 Init + 17866 18029 164 0 2 70 71 70 0.040 2.71 2.02 Intr + 20309 20461 153 2 0 61 94 49 0.046 1.99 2.03 Intr + 26925 27389 465 0 0 91 -69 579 0.450 34.74 2.04 Term + 27398 27557 160 2 1 19 51 234 0.696 10.21 2.05 PlyA + 30375 30380 6 1.05 3.00 Prom + 36241 36280 40 -4.96 3.01 Init + 47978 48032 55 1 1 112 84 116 0.998 13.15 3.02 Intr + 49749 49936 188 1 2 72 19 112 0.483 2.11 3.03 Intr + 50158 50206 49 0 1 107 38 87 0.788 3.85 3.04 Term + 64002 64135 134 1 2 88 42 99 0.192 3.55 3.05 PlyA + 64535 64540 6 1.05 4.00 Prom + 66329 66368 40 -6.26 4.01 Init + 66425 66479 55 1 1 91 109 177 0.999 19.56 4.02 Intr + 68106 68293 188 1 2 115 46 174 0.674 15.31 4.03 Intr + 85289 85325 37 1 1 73 82 28 0.052 -1.46 4.04 Intr + 87430 87539 110 2 2 68 76 59 0.124 2.70 4.05 Term + 89903 90112 210 1 0 54 48 81 0.137 -2.01 4.06 PlyA + 91705 91710 6 1.05 5.00 Prom + 98418 98457 40 -1.96 5.01 Init + 100001 100055 55 1 1 114 96 121 0.993 15.33 5.02 Intr + 100982 101169 188 0 2 88 53 108 0.927 6.71 5.03 Intr + 102036 102199 164 2 2 101 40 10 0.121 -3.73 5.04 Intr + 104190 104235 46 0 1 92 110 32 0.099 4.21 5.05 Intr + 109865 109931 67 1 1 61 52 83 0.014 0.48 5.06 Intr + 127634 127769 136 0 1 51 13 92 0.029 -2.37 5.07 Intr + 127855 127964 110 1 2 29 109 186 0.067 14.73 5.08 Intr + 128574 128761 188 1 2 72 46 145 0.452 8.11 5.09 Term + 147408 147941 534 2 0 -18 42 231 0.181 2.15 5.10 PlyA + 148119 148124 6 1.05 6.00 Prom + 148361 148400 40 -7.26 6.01 Init + 148744 149870 1127 0 2 60 53 286 0.898 15.97 6.02 Term + 150797 150917 121 2 1 49 50 97 0.649 -0.05 6.03 PlyA + 151694 151699 6 1.05 7.03 PlyA - 153993 153988 6 1.05 7.02 Term - 155351 155161 191 0 2 86 28 103 0.441 1.71 7.01 Init - 156703 156649 55 1 1 112 96 90 0.998 11.67 7.00 Prom - 159843 159804 40 -5.06 8.03 PlyA - 160337 160332 6 1.05 8.02 Term - 161924 161608 317 0 2 21 42 181 0.669 1.90 8.01 Init - 164803 164749 55 1 1 92 100 67 0.927 9.75 8.00 Prom - 167332 167293 40 -4.26 9.00 Prom + 172315 172354 40 -6.26 9.01 Init + 174519 174577 59 2 2 67 72 56 0.218 2.68 9.02 Intr + 179685 179770 86 0 2 80 61 82 0.364 4.16 9.03 Term + 188959 189227 269 2 2 22 50 416 0.846 26.86 9.04 PlyA + 189804 189809 6 1.05 10.00 Prom + 192984 193023 40 -6.46 10.01 Init + 195671 195860 190 1 1 99 99 347 0.048 36.07 10.02 Intr + 198303 198416 114 1 0 57 81 55 0.352 2.12 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 109865 109954 90 1 0 61 45 112 0.924 1.92 S.002 Intr + 195660 195860 201 1 0 94 99 388 0.917 39.76 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587f:62142308_62343478|GENSCAN_predicted_peptide_1|387_aa EKERQRLENLRRKEEAEQLRRQKVEEDKRRRLEEVKLKREERLRKVLQARERVEQMKEEK KKQIEQKFAQIDEKTEKAKEERLAEEKAKKKAAAKKMEEVEARRKQEEEARRLRWLQQVR AQEEEERRHQELLQKKKEEEQERLRKAAEAKRLAEQREQERREQERREQERREQERREQE RREQERQLAEQERRREQERLQAERELQEREKALRLQKEQLQRELEEKKKKEEQQRLAERQ LQEEQEKKAKEAAGASKALNVTVDVQSPACTSYQMTPQGHRAPPKINPDNYGMDLNSDDS TDDEAHPRKPIPTWARGTPLSQAIIHQYYHPPNLLELFGTILPLDLEDIFKKSKPRYHKR TSSAVWNSPPLQGARVPSSLAYSLKKH >gi568815587f:62142308_62343478|GENSCAN_predicted_CDS_1|1164_bp gagaaggagcggcagcgcctggagaatctgcggcggaaggaggaggccgagcagctgcgc aggcagaaggtggaggaggacaagcggcggcggctggaggaggtgaagctgaagcgtgag gaacgcctccgcaaggtgctgcaggcccgcgagcgggtggagcagatgaaggaggagaag aagaagcagattgagcagaagtttgctcagatcgacgagaagactgagaaggccaaggag gagcggctggcagaggagaaggccaagaaaaaggcggcggccaagaagatggaggaggtg gaagcacgcaggaagcaggaagaggaggcacgtaggctcaggtggctgcagcaggtgcga gcacaggaggaggaagagcggcggcaccaagagctgctgcagaagaagaaggaagaggag caggagcggctgcggaaggcggccgaggctaagcggctggcagagcagcgggagcaggag cggcgggagcaggagcggcgcgagcaggagcggcgcgagcaggagcggcgggagcaggag cggcgcgagcaggagcgacagctggcagagcaggagcgtcggcgggagcaggagcggctc caggccgagagggagctgcaggagcgggagaaggccctgcggctgcagaaggagcagctg cagagggaactggaggagaagaagaagaaggaagagcagcagcgtctggctgagcggcag ctgcaggaggagcaagagaagaaagccaaggaggcagcaggggccagcaaggccctgaat gtgactgtggacgtgcagtctccagcttgtacctcatatcagatgactccgcaagggcac agggcccctcccaagatcaacccagataactacgggatggatctgaatagcgacgactcc accgatgatgaggcccatccccggaagcccatccccacctgggcccgaggcaccccgctc agccaggctatcattcaccagtactaccacccaccgaaccttctggagctctttggaacc attctcccactggacttggaggatatcttcaagaagagcaagccccgctatcacaagcgc accagctctgctgtctggaactcaccgcccctgcagggcgccagggtccccagcagcctg gcctacagcctgaagaagcactga >gi568815587f:62142308_62343478|GENSCAN_predicted_peptide_2|313_aa MDKFLDTYTLPSLNQEEVESLNRPITSSEIEAAINSLPTKKSSGPDGFTAEFYQRDMNEA GNHHPQQTNTGTENQTLHVLTHKWEMNKENTWTQRGEHHTLGPVGRRPTSTDSSSLVAES LAGIRKMATNFSVYEKIWFDKFKYDIAERRLYEQMNGPVASTSRQKNGASVILHDIARAR ENIQKSLARSSGPRVSSGPNGEHSELVVLIASLEVENQRLSSVVQELQQAMSRLEAQLNV LEKSSAGHRATVPQTQHVSPIGALAKKPTTPAEDDKGNDIDLFGSNNEEEDEEAAQLREE WLPFPPVIRKALL >gi568815587f:62142308_62343478|GENSCAN_predicted_CDS_2|942_bp atggataaattcctggacacatacaccctcccaagtctaaaccaggaagaagtcgaatcc ctaaatagaccaataacaagttctgaaattgaggcagcaattaatagcctaccaaccaaa aaaagttcaggaccagatggattcacagccgaattctaccagagggacatgaatgaagct ggaaaccatcatcctcagcaaactaacacaggaacagaaaaccaaacactgcacgttctc actcataagtgggagatgaacaaggagaacacatggacacagcgaggggaacatcacaca ctggggcctgtcggcaggcgtcccacgtccaccgattcctcctccctcgttgctgagtcc ttggctggcatcagaaaaatggctacaaacttctcagtatatgagaagatctggtttgac aagttcaaatatgacattgcagaaaggagattgtacgagcaaatgaacgggcctgtggcc agcacctcccgccagaagaatggcgccagcgtgatcctccatgacattgcgagagccaga gagaacatccagaaatccctggccagaagctcaggccccagggtctccagcggccccaac ggagaacacagcgagcttgttgtcctgatcgccagtctggaagtggagaaccagaggctg agcagcgtggtgcaggagctgcagcaggccatgtccaggctggaggcccagctgaacgtg ctggagaagagctctgctggccaccgggccacagtccctcagacccagcacgtgtctccc attggagccctggccaagaagccaaccacaccagcagaggatgacaagggcaatgacatt gacctttttggcagcaacaatgaggaggaggacgaggaggcagcacagctgcgggaggaa tggctgccatttcccccagtaattagaaaagccttattgtga >gi568815587f:62142308_62343478|GENSCAN_predicted_peptide_3|141_aa MRLSVCLLLLTLALCCYRANAVVCQALGSEITGFLLAGKPVFKFQLAKFKAPLEAVAAKM EVKKCVDTMAYEKRVLITKTLVTYLDAVSPECAEDMGNVSLDITIHSSKSQLKQHLLLQA ALGGLQATAPGQAIYDSIQLL >gi568815587f:62142308_62343478|GENSCAN_predicted_CDS_3|426_bp atgaggctgtcggtgtgtctcctgctgctcacgctggccctttgctgctaccgggcaaat gcagtggtctgccaagctcttggttctgaaatcacaggcttcttattagctggaaaacct gtgttcaagttccaacttgccaaatttaaggcacctctggaagctgttgcagccaagatg gaagtgaagaaatgcgtggatacgatggcctatgagaaaagagtgctaattacaaaaaca ttggtcacctacctggatgccgtgtccccagagtgtgctgaggacatggggaatgtgtcc ttggacatcaccatccactcatccaagtctcagctgaaacagcacctgctcctacaggct gccttggggggcttacaggccactgcccctgggcaggccatctatgacagcattcagctg ttgtga >gi568815587f:62142308_62343478|GENSCAN_predicted_peptide_4|199_aa MKLLMVLMLAALLLHCYADSGCKLLEDMVEKTINSDISIPEYKELLQEFIDSDAAAEAMG KFKQCFLNQSHRTLKNFGLMMMNQRAPENILKRETFKLQIIMQQRFQPIPGEDTAPMPSR SYSTSTRQNRCTPAVCKHNIGSFVGCRVESLMKSTSLNEDLVHRDYGPADTMLPVPVSES IILWVLNILTIVTVIPEIG >gi568815587f:62142308_62343478|GENSCAN_predicted_CDS_4|600_bp atgaagctgctgatggtcctcatgctggcggccctcctcctgcactgctatgcagattct ggctgcaaactcctggaggacatggttgaaaagaccatcaattccgacatatctatacct gaatacaaagagcttcttcaagagttcatagacagtgatgccgctgcagaggctatgggg aaattcaagcagtgtttcctcaaccagtcacatagaactctgaaaaactttggactgatg atgatgaatcagagggcccctgaaaacatcttgaaaagagagaccttcaagcttcagata atcatgcaacaaaggttccagccaattccaggtgaagacaccgccccaatgccatcaaga agctactccacctccactagacagaacaggtgcacacctgctgtgtgcaagcataacatt gggtcctttgttggctgcagagttgagtcactgatgaagtccacctctctgaatgaggat ttggttcatagggattatggaccagcagatacgatgctgccagtccctgtctcagaatcc attattctttgggtattgaatatcctcaccattgtcactgttatccctgaaattgggtga >gi568815587f:62142308_62343478|GENSCAN_predicted_peptide_5|495_aa MKLSVCLLLVTLALCCYQANAEFCPALVSELLDFFFISEPLFKLSLAKFDAPPEAVAAKL GVKRCTDQMSLQKRSLIAEVLEPGDSKPWDSPGNNLLLLSPTLVTCFHEDPGPHSRITPG LLAWQEKAAPSPAWVLLYELFLLYILLIHARSNRDWGCVKWRLNRDFEDMNKKERDREEK EGEQVEKEYKKEEEKQEDSTYRLAVFPLGHAPVYWSHASGFLDPCHPRLNTDSSSLTMKL LMVLMLAALSQHCYAGSGCPLLENVISKTINPQVSKTEYKELLQEFIDDNATTNAIDELK ECFLNQTDETLSNVEVFMRKQERSKIDTLTSQLKELEKQEQTHSKASRRQETTKIRAELK EIETQKTLQKINKSRSWFFEKINKIDRPLERLIKKKREKNQIYAIKNDKGDITIEPTEIQ TTIREYYKHLYSNKPENLEDMDKFLDTYTLPRLNQEEVESLKRPITGCEIEAIINSLPTK KSPGPDGFTAEFYQR >gi568815587f:62142308_62343478|GENSCAN_predicted_CDS_5|1488_bp atgaagctgtcggtgtgtctcctgctggtcacgctggccctctgctgctaccaggccaat gccgagttctgcccagctcttgtttctgagctgttagacttcttcttcattagtgaacct ctgttcaagttaagtcttgccaaatttgatgcccctccggaagctgttgcagccaagtta ggagtgaagagatgcacggatcagatgtcccttcagaaacgaagcctcattgcggaagtc ctggaaccaggagactccaagccctgggacagcccaggcaataacctcttgctgctctct ccaaccctcgtcacttgcttccatgaggatcctgggcctcattcccggatcaccccaggg ctattggcctggcaagaaaaggctgctccatcaccggcatgggtgctcctctatgaactc ttcttactatacatcctcctcattcatgccaggtccaacagggactggggctgtgtcaag tggcgcctgaacagggacttcgaggacatgaacaaaaaagaaagagacagagaagaaaag gaaggggagcaagtggagaaggaatacaagaaagaggaagagaagcaggaagattctaca tacaggctggctgtgtttcccctggggcatgctcctgtttactggtcccatgccagcggc ttccttgatccttgccacccgcgactgaacaccgacagcagcagcctcaccatgaagttg ctgatggtcctcatgctggcggccctctcccagcactgctacgcaggctctggctgcccc ttattggagaatgtgatttccaagacaatcaatccacaagtgtctaagactgaatacaaa gaacttcttcaagagttcatagacgacaatgccactacaaatgccatagatgaattgaag gaatgttttcttaaccaaacggatgaaactctgagcaatgttgaggtgtttatgagaaag caggaaagatctaaaattgacaccctaacatcacaattaaaagaactagagaagcaagag caaacacattcaaaagctagcagaaggcaagaaacaactaagatcagagcagaactgaag gaaatagagacacaaaaaacccttcaaaaaatcaataaatccaggagctggttttttgaa aagatcaacaaaatcgatagaccgctagaaagactaataaagaagaaaagagagaagaat caaatatacgcaataaaaaatgataaaggggatatcaccatcgagcccacagaaatacaa actaccatcagagaatactataaacacctctactcaaataaaccagaaaatctagaagac atggataaattcctggacacatacaccctcccaagactaaaccaggaagaagttgaatct ctgaagagaccaataacaggctgtgaaattgaggcaataattaatagcttgccaaccaaa aaaagtccaggaccagatggattcacagctgaattctaccagaggtaa >gi568815587f:62142308_62343478|GENSCAN_predicted_peptide_6|415_aa MSELPFTIASKRVKYLGIQLTRDVKDLFKENYKPLLNEIKEDTNKWKNIPSSWVGRINIV KMAILPKVIYRFNAIPIKLPMTFFTQLEKTTLNFIWNQKRACIAKSILSQKNKAGGITLP DFKLYYKATVTKTAWYWYQNRDIDQWNRTQPSEIIPHIYNYLIFDKPDKNKQWGKDSLFN KWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTVKTLEENLGNTIQDIGMGKD FMSKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQPTEWEKIFVIYSSDKGLISRI YSELQQICKKKTNNPINKWAKDMNRHFSKEDIYAANRHMKKCSSSLAIREMQIKTKMRYH PTPVRMAIIKKSGNNRCSSKLVISLAKKTSRMSINEHMNTKKADYEMLAGLGRRF >gi568815587f:62142308_62343478|GENSCAN_predicted_CDS_6|1248_bp atgagtgaactcccattcacaattgcttcaaagagagtaaaatacctaggaatccaactt acaagggatgtgaaggacctcttcaaggagaactacaaaccactgctcaatgaaataaaa gaggatacaaacaaatggaagaacattccatcctcatgggtaggaagaatcaatatcgtg aaaatggccatactgcccaaggtaatttatagattcaatgccatccccatcaagctacca atgactttcttcacacaattggaaaaaactactttaaatttcatttggaaccaaaaaaga gcctgcattgccaagtcaatcctaagccaaaagaacaaagctggaggcatcacactacct gacttcaaactatactacaaggctacggtaaccaaaacagcatggtactggtaccaaaac agagatatagaccaatggaacagaacacagccctcagaaataataccacatatctacaac tatctgatctttgacaaacctgacaaaaacaagcaatggggaaaggattccctatttaat aaatggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttcctt acaccttatacaaaaattaattcaagatggattaaagacttaaatgttagacctaaaact gtaaaaaccctagaagaaaacctaggcaataccattcaggacataggcatgggcaaggac ttcatgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatcta attaaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaacct acagagtgggagaaaatttttgtaatctactcatctgacaaagggctaatatccagaatc tacagtgaactccaacaaatttgcaagaagaaaacaaacaaccccatcaacaagtgggca aaggatatgaacagacacttctcaaaagaagacatttatgcagccaacagacacatgaaa aaatgctcatcatcactggccatcagagaaatgcaaatcaaaaccaaaatgagataccat cccacaccagttagaatggccatcattaaaaagtcaggaaacaacagatgttcttcaaaa ttagtaatatcattagcaaagaagacatccaggatgtcaatcaatgagcacatgaacacc aagaaagctgactatgagatgctggcaggtctaggaaggcgtttctag >gi568815587f:62142308_62343478|GENSCAN_predicted_peptide_7|81_aa MRLSVCLLMVSLALCCYQAHALVCPAVASEITVFLFLSDAAVNLQVAKLNPPPEALAAKL EVKHCTDQISFKKRLSLKKSW >gi568815587f:62142308_62343478|GENSCAN_predicted_CDS_7|246_bp atgaggctgtcagtgtgtctcctgatggtctcgctggccctttgctgctaccaggcccat gctcttgtctgcccagctgttgcttctgagatcacagtcttcttattcttaagtgacgct gcggtaaacctccaagttgccaaacttaatccacctccagaagctcttgcagccaagttg gaagtgaagcactgcaccgatcagatatcttttaagaaacgactctcattgaaaaagtcc tggtaa >gi568815587f:62142308_62343478|GENSCAN_predicted_peptide_8|123_aa MAEQEQLQSAAPSEINTEDRSTRQKVNKDIQDLNAALDQVDLTDIYRTLHPKSTEYTFFS APQCTYSKIDHIIGSKTLVSKCKRMEIITNSLSDHSAIKLELTIKKLTQNLHNYMETGQP APE >gi568815587f:62142308_62343478|GENSCAN_predicted_CDS_8|372_bp atggctgaacaggaacagctccagtctgcagctcccagtgagatcaacacagaagacaga tcaacaagacagaaggttaacaaggatattcaggacttgaacgcagctctggaccaagtg gacctaacagacatctacagaactctccaccccaaatcaacagaatatacattcttctca gcaccacaatgcacttattctaaaattgaccacataattggaagtaaaacactcgtcagc aaatgcaaaagaatggaaatcataacaaacagtctctcagatcacagtgcgatcaaatta gaactcacgattaagaaactcactcaaaacctgcacaactacatggaaactggacaacct gctcctgaatga >gi568815587f:62142308_62343478|GENSCAN_predicted_peptide_9|137_aa MGSEQKSVSTGDMQEKVNIEAEHNTSSREGAQDNLEEDISWWLERLLQGLGHISGQHLVA VEEDARKDEEEEDVKLLSVSGKRSAPGGGSKVPQKKVKLAADEDDDYEDDDDDEDDDDDD DFDDKETKEKAPVKQSI >gi568815587f:62142308_62343478|GENSCAN_predicted_CDS_9|414_bp atgggatctgaacaaaagtctgtgtccacaggagacatgcaggagaaagtgaatatcgag gcagaacacaacacgagcagcagagaaggagcccaggataacctggaggaagacatcagc tggtggctggaaaggctgctccaggggctagggcatattagtggacagcacttagtagct gtggaggaagatgcaagaaaagatgaagaggaggaggatgtgaaactcttaagtgtatct ggaaagcgatcagcccctggaggtggtagcaaggttccacagaaaaaagtaaaacttgct gccgatgaagatgatgattatgaagatgatgatgatgatgaagatgatgatgatgatgat gattttgatgataaggaaaccaaagaaaaagcaccagtgaagcaatctatatga >gi568815587f:62142308_62343478|GENSCAN_predicted_peptide_10|102_aa MNPIVVVHGGGAGPISKDRKERVHQGMVRAATVGYGILREGGSAVDAVEGAVVALEDDPE FNAEAFLLNVCPLIFQMSSIPHIEEEKSRAEKRESKDYGGKX >gi568815587f:62142308_62343478|GENSCAN_predicted_CDS_10|306_bp atgaatcccatcgtagtggtccacggcggcggagccggtcccatctccaaggatcggaag gagcgagtgcaccagggcatggtcagagccgccaccgtgggctacggcatcctccgggag ggcgggagcgccgtggatgccgtagagggagctgtcgtcgccctggaagacgatcccgag ttcaacgcagaggccttccttctcaacgtctgccctctcatctttcaaatgtcctctatt ccgcatattgaggaagaaaagagtagagctgaaaagagggaaagcaaagattatgggggg aaagnn