GENSCAN 1.0 Date run: 3-Nov-116 Time: 09:30:53 Sequence gi568815576f:29878441_30125798 : 247358 bp : 41.05% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 5128 5290 163 0 1 54 90 174 0.980 12.14 1.02 Term + 7388 7497 110 0 2 52 40 152 0.938 4.49 1.03 PlyA + 12801 12806 6 1.05 2.04 PlyA - 14695 14690 6 1.05 2.03 Term - 17623 17313 311 2 2 72 42 230 0.960 11.04 2.02 Intr - 18218 17901 318 0 0 120 62 434 0.945 39.01 2.01 Init - 20726 20657 70 2 1 21 101 41 0.348 -0.04 2.00 Prom - 28216 28177 40 -1.65 3.00 Prom + 57184 57223 40 -6.65 3.01 Init + 63403 63408 6 0 0 92 89 4 0.056 1.69 3.02 Intr + 77630 77742 113 1 2 48 67 51 0.006 -2.74 3.03 Intr + 92536 92622 87 1 0 91 115 56 0.971 6.77 3.04 Intr + 93202 93374 173 1 2 86 -27 141 0.072 1.06 3.05 Intr + 99985 100091 107 2 2 8 91 85 0.128 -0.19 3.06 Intr + 100496 100612 117 1 0 31 101 113 0.167 6.64 3.07 Intr + 106060 106178 119 0 2 59 100 60 0.120 2.64 3.08 Intr + 113064 113230 167 0 2 26 101 165 0.138 10.28 3.09 Intr + 120321 120417 97 1 1 91 89 70 0.965 5.55 3.10 Intr + 123580 123654 75 1 0 47 95 75 0.778 1.81 3.11 Intr + 124440 124553 114 0 0 121 111 93 0.999 13.54 3.12 Intr + 128674 128879 206 1 2 92 99 267 0.999 26.12 3.13 Intr + 129461 129592 132 0 0 108 94 222 0.954 24.50 3.14 Intr + 130578 130689 112 1 1 77 106 80 0.999 7.22 3.15 Intr + 133928 134123 196 2 1 69 83 184 0.978 14.50 3.16 Intr + 134916 135101 186 2 0 71 89 188 0.995 16.16 3.17 Intr + 138088 138258 171 0 0 75 44 118 0.919 5.32 3.18 Intr + 139487 139632 146 1 2 136 105 54 0.999 10.06 3.19 Intr + 141040 142444 1405 1 1 79 89 1016 0.962 89.13 3.20 Intr + 143589 143699 111 2 0 77 77 82 0.698 5.66 3.21 Intr + 144169 144257 89 0 2 120 81 -2 0.737 0.15 3.22 Intr + 145017 145043 27 0 0 134 77 43 0.907 4.11 3.23 Intr + 146634 146757 124 0 1 53 53 110 0.865 3.67 3.24 Term + 148869 148994 126 2 0 80 53 100 0.435 3.00 3.25 PlyA + 149183 149188 6 1.05 4.10 PlyA - 149236 149231 6 1.05 4.09 Term - 149866 149736 131 2 2 64 54 96 0.049 1.16 4.08 Intr - 151286 151178 109 2 1 53 55 98 0.112 1.94 4.07 Intr - 153843 153727 117 0 0 78 20 108 0.093 2.74 4.06 Intr - 161838 161737 102 0 0 62 110 27 0.282 1.75 4.05 Intr - 162193 162114 80 2 2 101 94 45 0.315 4.75 4.04 Intr - 168248 168008 241 2 1 46 20 334 0.435 18.60 4.03 Intr - 168666 168374 293 1 2 27 81 244 0.048 13.53 4.02 Intr - 202499 202451 49 2 1 87 110 37 0.056 3.13 4.01 Init - 214160 214104 57 2 0 65 92 35 0.184 2.96 4.00 Prom - 219001 218962 40 -5.05 5.00 Prom + 220943 220982 40 -2.95 5.01 Init + 222081 222255 175 2 1 71 9 123 0.049 2.26 5.02 Intr + 226616 226999 384 0 0 47 95 276 0.021 18.10 5.03 Intr + 243524 243774 251 0 2 85 75 227 0.681 17.33 5.04 Term + 246111 246125 15 2 0 121 48 12 0.559 -2.24 5.05 PlyA + 247347 247352 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 93202 93397 196 1 1 86 42 151 0.910 6.20 S.002 Init - 168753 168469 285 0 0 87 49 237 0.923 16.82 S.003 Intr + 243192 243349 158 0 2 52 96 94 0.975 5.33 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576f:29878441_30125798|GENSCAN_predicted_peptide_1|90_aa MGARRPSSARTLDAGEAGSRARAAFPDLAGMGPCGSGCGTYPRSARSYSHELPGVKWNAV TKGSIRVLEGGVVKVLTSLEEVEENVVTIV >gi568815576f:29878441_30125798|GENSCAN_predicted_CDS_1|273_bp atgggagctcggcggccgagctcggcccggaccctagatgcgggggaggcggggtcccgg gctcgggctgccttcccagacctggcggggatgggcccgtgcggctctgggtgtgggacg taccctcggagcgcccggagttattcccacgaactcccgggagtaaagtggaatgctgtt accaaaggaagtattcgtgttttagaggggggtgtggtgaaggttcttacaagtttggaa gaggtggaagaaaatgttgttacaattgtgtag >gi568815576f:29878441_30125798|GENSCAN_predicted_peptide_2|232_aa MHVYNFVGLKAPHLNPTGGKCYKYYGNYVRQKANSSDFLEFKMGREAVETTHNINYTSGP ETVQWWFKKCCKGDESLEDEECSGRPEVGNDQLRAIIEADPLTTTREIAEELNVDHSTLV WQAIEANWKVLGIQAWATTPSQKVKKLDKWVPHELTENFKNCRFEMLSSLILRNDDEPFL GWIVMCDKKWILYNNSDDQLSGWTEKTLQSTSQSRTCTKKWSWSLFGGLLLV >gi568815576f:29878441_30125798|GENSCAN_predicted_CDS_2|699_bp atgcatgtatacaactttgttggattgaaggctcctcatctcaaccccactggaggtaaa tgctataaatactatggaaattatgttagacaaaaagcaaattcaagtgattttcttgag ttcaaaatgggtcgtgaagcagtggagacaactcacaacatcaactacacatctggccca gaaactgtgcagtggtggttcaagaagtgttgcaaaggagacgagagccttgaagatgag gagtgtagtggccggccagaagttggcaatgaccaactgagagcaatcatcgaagctgat cctcttacaactacacgagaaattgccgaagaactcaatgtcgaccattctacccttgtt tggcaggcaattgaagccaactggaaagtgctgggaatacaggcgtgggctaccacgccc agccaaaaggtgaaaaagcttgataagtgggtgcctcatgagctgaccgaaaattttaaa aattgtcgatttgaaatgttgtcttctcttattctacgtaacgacgacgaaccatttctt ggttggattgtgatgtgcgacaaaaagtggattttatacaacaacagtgatgaccagcta agtggctggaccgagaagacactccaaagcacttcccaaagccgaacttgcaccaaaaag tggtcatggtcactgtttggcggtctgctcctggtctga >gi568815576f:29878441_30125798|GENSCAN_predicted_peptide_3|1401_aa MVSFDILSNLVHLCDSMSFKQLKWMSRVCTTLEKLSQRNRLHHKGKKRALLDTEEGTGIS PPNLGLLSWQYFDPIVGGAALHTPSCIFNQKENLMDNLMTYGPFRAQLKQRRWSNKPATF SSILRKCFGLFQDEETRHSLECIQANQIFPRKQLIREDENLQVPFLELHGESTEFVGRAE DAIIALSNYRLHIKFKESLVNGVSVYNVTWLFEELVVALNVFDALFEHEFGEIYIRKSIV RCQFSTFEQCQEWLKRLNNAIRPPAKIEDLFSFAYHAWCMEVYASEKEQHGDLCRPGEHV TSRFKNEVERMGFDMNNAWRISNINEKYKIWIKGLIAFTIEKRRLDYPERDIVLLCGSYP QELIVPAWITDKELESVSSFRSWKRIPAVIYRHQSNGAVIARCGQPEVSWWGWRNADDEH LVQSVAKACASDSRSSGSKLSTRNTSRDFPNGGDLSDVEFDSSLSNASGAESLAIQPQKL LILDARSYAAAVANRAKGGGCECPEYYPNCEVVFMGMANIHSIRRSFQSLRLLCTQMPDP GNWLSALESTKWLHHLSVLLKSALLVVHAVDQDQRPVLVHCSDGWDRTPQIVALAKLLLD PYYRTIEGFQVLVEMEWLDFGHKFADRCGHGENSDDLNERCPVFLQWLDCVHQLQRQFPC SFEFNEAFLVKLVQHTYSCLFGTFLCNNAKERGEKHTQERTCSVWSLLRAGNKAFKNLLY SSQSEAVLYPVCHVRNLMLWSAVYLPCPSPTTPVDDSCAPYPAPGTSPDDPPLSRLPKTR SYDNLTTACDNTVPLASRRCSDPSLNEKWQEHRRSLELSSLAGPGEDPLSADSLGKPTRV PGGAELSVAAGVAEGQMENILQEATKEESGVEEPAHRAGIEIQEGKEDPLLEKESRRKTP EASAIGLHQDPELGDAALRSHLDMSWPLFSQGISEQQSGLSVLLSSLQVPPRGEDSLEVP VEQFRIEEIAEGREEAVLPIPVDAKVGYGTSQSCSLLPSQVPFETRGPNVDSSTDMLVED KVKSVSGPQGHHRSCLVNSGKDRLPQTMEPSPSETSLVERPQVGSVVHRTSLGSTLSLTR SPCALPLAECKEGLVCNGAPETENRASEQPPGLSTLQMYPTPNGHCANGEAGRSKDSLSR QLSAMSCSSAHLHSRNLHHKWLHSHSGRPSATSSPDQPSRSHLDDDGMSVYTDTIQQRLR QIESGHQQEVETLKKQVQELKSRLESQYLTSSLHFNGDFGDEVTSIPDSESNLDQNCLSR CSTEIFSEASWEQVDKQDTEMTRWLPDHLAAHCYACDSAFWLASRKHHCRDTDRVDQTWL LVPGFACCQQSSPKWQQSICHVAAFDPLEKWPLGKAPYSPQFPDQGQGVSQEGGSVHAWV WLRRGPEGFPGALSPGKAQSI >gi568815576f:29878441_30125798|GENSCAN_predicted_CDS_3|4206_bp atggtgagttttgatattctgagcaatcttgttcatttatgtgactccatgtcatttaaa cagctcaaatggatgagtcgtgtttgcacaactttagagaaattgtcacaaaggaacaga cttcaccataagggaaagaagagagccttgttggacactgaagaagggacggggatttct cctcctaatctgggcctcttgtcatggcagtattttgatcccatagtaggtggagcagcc ctgcacaccccctcctgtatctttaaccagaaagagaatttaatggacaatttaatgact tacgggccatttcgggcccagctcaagcagagaagatggagcaacaagccagccacattt tcatcaatcctgaggaaatgttttggacttttccaggatgaagagactcggcacagcctt gagtgcatccaggccaatcagatctttcccaggaagcagctgatccgggaggatgagaat cttcaggttcctttccttgaacttcatggagagagcacagagtttgtgggccgtgccgag gatgccatcattgccctttccaattacagacttcacatcaagttcaaggagtctcttgtt aatggtgttagtgtctataatgtaacttggctttttgaggaattggttgtagcactgaat gtgtttgatgcattgtttgagcatgagtttggtgaaatttatattaggaaaagcattgta aggtgtcagttttcaacctttgagcagtgtcaagagtggctgaagagactgaacaacgca atccgaccacctgctaaaatagaagatctcttctcatttgcataccatgcttggtgcatg gaggtctatgccagtgaaaaagagcaacatggagacctgtgcagaccaggggagcatgta acttcaaggtttaaaaacgaggtggagaggatgggttttgatatgaacaacgcctggagg atttccaacatcaatgagaagtacaagatttggattaaaggccttattgccttcactatt gaaaaaagaagattggattatccagagagagacattgtcctattatgtggtagctatcct caagagctcatagtgcctgcctggatcactgacaaagaactggaaagtgtatcaagtttc aggtcctggaagcgcatccctgccgtcatctacaggcaccagagcaatggagctgtcatt gcccgctgtggacagccagaggttagctggtggggctggcgaaatgcagatgatgagcat ctggtacagtcagtagccaaagcttgtgcctctgactcccgatcgagtggcagcaagctg tcaactaggaacacttctcgagactttcccaatgggggagacctttctgacgtggagttc gattcttctctgtcaaatgcttcaggagcagagagtttagccatccaaccgcagaagctt ttgatcttggatgcacgctcctatgcagctgctgtggcaaaccgagccaaaggaggaggc tgcgaatgcccagagtattacccaaactgtgaagttgtgtttatggggatggcaaacatt cattctattcggaggagttttcagtctctgcggttgctgtgcactcagatgccagatccg ggaaattggctatcagctcttgaaagcacaaaatggctccatcacttgtctgtgcttctg aaatcagcgcttctggtagtgcatgctgtggatcaggatcagcggccggtgctagtacac tgctcagatggctgggaccgcaccccccagattgtggcattggctaagctcttgctggac ccttattaccgaaccatagagggtttccaggtcctcgtggaaatggagtggctggatttt ggccataaatttgctgaccggtgtggtcatggggagaactcggatgatctgaatgaacgt tgcccagtgtttctgcagtggcttgactgtgttcatcagcttcagaggcaatttccttgc tcttttgagttcaatgaagcattccttgtgaaactggtgcagcatacctattcctgcctg tttggaacattcctgtgcaacaacgccaaggagagaggggaaaagcatactcaggaacgg acatgttccgtgtggtcacttcttcgggcaggcaacaaggctttcaaaaacctactgtat tcctctcagtcagaagccgtgctgtaccctgtgtgccatgtgcgtaacctgatgctgtgg agtgcagtgtacctgccctgcccatccccaaccacccctgtggacgacagctgtgcacca tacccagccccaggcaccagccctgatgatccccccctgagccggctaccaaagactaga tcatacgacaatctgaccacagcctgtgacaacacagtgcctctggccagccggcgctgc agcgaccccagcctgaacgagaagtggcaggagcaccggcgctcactagagctgagcagc ctggctggccctggagaggatcccctttctgccgacagcctagggaagcccaccagagtg ccggggggtgccgagctttctgttgcagccggagtagctgaggggcagatggagaacatc ttgcaggaggccaccaaagaggagagtggagtagaggaacctgcccacagggcaggcatt gagatacaggagggtaaagaggaccctctcttagaaaaggagagcaggaggaagacacct gaggcctcagccattggacttcaccaagacccagaactgggtgatgctgctctgaggagc catctggatatgagctggcctctgttctcacagggcatttctgaacagcagagtgggctc agtgttctcctcagttctctccaggtcccccccaggggagaggattccctggaggtccct gtggagcagtttcgaatagaagagattgcagagggtagggaggaagcagttcttccaatc ccagtagatgcaaaagttggctatggtacctcacagtcatgttctctgctaccttcccaa gtcccttttgagaccagaggaccaaacgtggacagttctacagacatgttagtggaagat aaggtgaagtcagtaagtgggccccaaggtcatcatagatcttgccttgtaaatagtggc aaggacaggcttcctcagaccatggaacccagcccttcagagacaagcctggtcgagagg ccccaagtggggtctgtggtgcataggacttcccttggcagcactctcagcctgacacgt tccccttgtgccttgcctttagccgaatgtaaagaggggcttgtgtgcaatggtgcccca gagactgaaaacagggcctcagagcagcccccaggtcttagcaccctccagatgtacccc acacccaatgggcattgcgccaatggggaggctggtaggagcaaggactcactgagccgt cagctgtctgctatgagctgcagctctgcccacttacactcaaggaacttgcaccacaag tggctgcatagccactcaggaaggccatctgcaaccagcagccccgaccagccttcccgc agccacctggacgatgatggcatgtcagtgtacacagacacgatccaacagcgcctgcgt cagattgagtcaggccaccagcaggaagtagaaactttgaagaaacaagtccaggagctg aagagtcgcctggagagccagtacctgaccagctccctacactttaatggagactttggg gatgaggtgacttcaatccccgactcggaaagcaatctggatcagaactgtttgtctcgc tgcagcacagagattttctctgaagccagctgggagcaggtggataaacaggacacagag atgacccgttggcttcctgaccacctggccgcccactgctatgcgtgcgacagtgccttc tggcttgccagcaggaagcaccactgcagggacactgaccgtgttgatcaaacgtggcta cttgtccctggctttgcatgctgccagcagagcagtccaaaatggcagcagtctatctgc catgttgctgcctttgatcctctggagaagtggccgttgggcaaggctccttactcccca cagttccctgaccaagggcaaggtgtgtctcaggaaggtggttctgtgcatgcctgggtg tggttaaggcgtggcccagaaggcttccctggtgctctcagtccaggcaaagcccagagc atctga >gi568815576f:29878441_30125798|GENSCAN_predicted_peptide_4|392_aa MLQDIGLGKDFMGKTSNAQASILAENLNGWGESDPEGGRAPQLDQGTHRTLHRSRLPEGP EGRDYLMRTHEQATARLSPQDQSLHAELAPARKPLQLHQGHGQLRMNPMDLFEANDLFES RSMTQLQVSLLALMGTNKCASQSGMTAYGMRRHFYDPNNYILPPMDHSIISVQMRINKCV SQVAMTAPGTQRHISDTKLGTDKCGNSSISLQMGWASIKRFVFGKFRDHWLHIQTLAHME IWVRGGMLLPWSCQAAWQTSRIPAASHGDYTGTQMRKCLRVPHTPMSNSAGAAGLGSHPR SVGSKTQGLSTKLVDTAAALGTGHLNWQQLNEILQQHLQISPHSQKINRVYPSRTTGRSI FSPPPHISFHSPTEGARVLAALKSEQQLKMEP >gi568815576f:29878441_30125798|GENSCAN_predicted_CDS_4|1179_bp atgcttcaggacattggtctgggtaaagattttatgggtaagacttcaaatgcacaggca tcgatcctggcagaaaacctcaatggctggggagaatctgacccagaaggaggcagagct ccacagctggatcaagggactcacaggactcttcatcggtcccgacttccagaagggcct gaaggacgggattatcttatgcgcactcatgaacaagctacagccaggctcagtccccaa gatcaatcactccatgcagaactggcaccagctagaaaacctctccaacttcatcaaggc catggtcagctacggatgaaccccatggacctgttcgaggccaacgacctgtttgagagt agaagtatgacacagctgcaggtgtctcttctcgccctgatgggcaccaacaaatgcgcc agccagtcagggatgaccgcatacggcatgaggaggcatttctacgaccccaacaactac atcctgccccccatggaccactccatcatcagtgtccagatgcgtataaacaagtgtgtc agccaggtggccatgacggctcctgggacgcagcggcacatctctgacaccaagctggga accgacaagtgtggtaactcctccatatccctgcagatgggatgggcatccatcaagagg tttgtatttgggaagttcagggatcactggctgcacatccagacactggcccacatggag atatgggtaagaggagggatgctgttgccttggagctgccaggctgcatggcagacaagc aggattcctgctgcctcccatggagactatactggaacgcagatgaggaaatgtctccga gttcctcatacacccatgagcaactcagctggtgctgcagggctaggatcccaccccagg tctgtgggctctaaaacacaggggctttccaccaaactggtggatacagctgcagccctg gggacaggtcacttgaactggcagcagctaaatgaaatcctgcaacagcaccttcaaatc agccctcattcccaaaagatcaatagagtctatccttcaagaacaacaggcagaagcatt ttctcacccccaccccacatctccttccactccccgactgaaggggccagagtactggct gccctgaaatcagagcagcagctaaaaatggagccttga >gi568815576f:29878441_30125798|GENSCAN_predicted_peptide_5|274_aa MGKDFMTKAPKAIATKAKIDKWELIKPKSFCTAKEIVIRVNRKHTEWEKMFPIYPSDKAF IMNRGQGKAGGSVLGSGFPHGGQDIAQLLLLPLGADVCSHLFLDELEGPFVLGDLELLHG MLPIQGEATHLSDHIPHELGVFGQAPAMAAGPGLAHILGHLVALVEAHGCRIKQNHGCCS PVTAKALTPHDYQPLGFKEGVNSHFLLFDKEPINVQVGFVSTGFHSMKVKVMTEATKVID LENNLFRENSTTEIAHQGLDCDEEEECNDHFCHG >gi568815576f:29878441_30125798|GENSCAN_predicted_CDS_5|825_bp atgggcaaagacttcatgactaaagcaccaaaagcaattgcaacaaaagccaaaatagac aaatgggaactaattaaaccaaagagcttctgcacagcaaaagaaattgtcatcagagtg aacaggaaacatacagaatgggagaaaatgtttccaatctatccatctgacaaagctttt attatgaacagagggcagggcaaggcagggggctcagtccttggcagcggctttcctcat ggtggccaggacattgcgcagctcctcctacttcctcttggtgcggatgtgtgttcccac ctttttcttgatgaacttgaaggcccttttgtccttggagaccttgagctactccatggg atgctgcccatacagggtgaagccacacacctctcagatcatatcccgcatgaacttggt gtgtttggtcaggcacctgcgatggcagctgggcctgggcttgctcacattcttggtcac cttgtggcccttgttgaggcccatggctgtaggataaagcagaaccatggctgctgctct ccagtgacagccaaggccttgaccccacatgattaccaacccctcggttttaaagaaggg gtaaattcacacttcctgctgtttgacaaggagcctatcaacgtgcaagtgggatttgtc tccactggctttcatagcatgaaagtaaaagtcatgacagaggctacaaaagtgattgat ttggagaacaatctgtttcgggagaacagcactactgagatcgcccatcagggtctagac tgtgatgaggaagaagaatgcaatgaccatttttgccatggatga