GENSCAN 1.0 Date run: 8-Nov-116 Time: 02:08:20 Sequence gi568815592f:82264962_82466221 : 201260 bp : 38.37% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 PlyA - 688 683 6 1.05 1.04 Term - 15981 15221 761 1 2 17 43 249 0.063 5.90 1.03 Intr - 18815 18560 256 2 1 4 71 236 0.176 9.59 1.02 Intr - 19401 19249 153 0 0 73 36 140 0.957 6.65 1.01 Init - 31788 31729 60 0 0 77 44 53 0.043 1.10 1.00 Prom - 32103 32064 40 -8.25 2.00 Prom + 32239 32278 40 -4.75 2.01 Init + 32940 33196 257 2 2 46 80 164 0.652 8.15 2.02 Intr + 58053 58122 70 0 1 36 93 110 0.015 4.57 2.03 Intr + 66615 66717 103 2 1 21 106 88 0.262 2.73 2.04 Intr + 67615 67675 61 2 1 104 76 42 0.534 1.47 2.05 Term + 72738 72996 259 0 1 33 49 183 0.712 2.94 2.06 PlyA + 73012 73017 6 1.05 3.02 PlyA - 73423 73418 6 1.05 3.01 Sngl - 76540 76109 432 1 0 86 42 265 0.702 17.73 3.00 Prom - 85087 85048 40 -4.05 4.00 Prom + 96124 96163 40 -6.45 4.01 Sngl + 100001 101263 1263 1 0 97 54 1233 0.999 114.42 4.02 PlyA + 101901 101906 6 1.05 5.03 PlyA - 102833 102828 6 1.05 5.02 Term - 109524 109388 137 1 2 89 38 109 0.947 3.20 5.01 Init - 111862 111763 100 1 1 76 87 84 0.713 5.68 5.00 Prom - 121622 121583 40 -3.75 6.03 PlyA - 122293 122288 6 1.05 6.02 Term - 126209 125724 486 2 0 16 50 222 0.498 4.91 6.01 Init - 128071 127034 1038 1 0 40 41 493 0.519 33.83 6.00 Prom - 128164 128125 40 -6.15 7.02 PlyA - 128333 128328 6 1.05 7.01 Sngl - 129577 128561 1017 1 0 88 43 776 0.999 69.87 7.00 Prom - 138730 138691 40 -4.15 8.03 PlyA - 138745 138740 6 1.05 8.02 Term - 148726 148510 217 0 1 72 45 130 0.419 2.73 8.01 Init - 149929 149781 149 1 2 82 45 107 0.815 3.75 8.00 Prom - 156172 156133 40 -2.85 9.00 Prom + 159443 159482 40 -2.95 9.01 Init + 167705 167928 224 1 2 98 81 97 0.912 8.08 9.02 Intr + 170758 170857 100 1 1 -27 37 125 0.005 -4.91 9.03 Intr + 178182 178360 179 2 2 56 74 151 0.041 8.30 9.04 Term + 181777 181945 169 1 1 106 49 79 0.688 2.17 9.05 PlyA + 182461 182466 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 178258 178360 103 2 1 85 74 69 0.936 3.51 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:82264962_82466221|GENSCAN_predicted_peptide_1|409_aa MEKVPSMNQEIDLYQTPSLPGLHNRQNRRYPPVPGSAGPTSMEPSKIRPTGLKFSLTTQQ QSEIDLGCSSLESSGWHLTGAPLGRSFQRKERAAIFAVLQPLLVIPRQTGSGVDLQQTPA DLQQTGLIVRRKTNKQKKIAHPLRDPIRRSRTSKTKEKEGILPNSFYEASIILIPKPGRD TTKKENFMPISLMNIDAKILNKILANQIQQHIKKLLRHDQISFIPGMQSWFNIRKSINLI HHINRTNDNNHMIISIDEEKAFKKIQHRFMLKTLNKLGTDGSYLKIIRAIYDKPTANVIL NGLKLEAFPLKTGTRQGCSLSPLLFNIVLEVLARAIRQEKEIKGIQIGKEEVKSSLFADD MIIYLENPIVSSLNLLKLISNFSKVSGYKINVQKSQAFLYTNNREPDHE >gi568815592f:82264962_82466221|GENSCAN_predicted_CDS_1|1230_bp atggagaaggtgccatctatgaatcaggaaatagacctctaccagacacctagtctgcca ggccttcacaatcggcagaacaggcgataccctcccgtgcctggctcagctggtcccaca tccatggagcccagcaagataagacccactggcttgaaattctcgctgacaacacagcag cagtctgagatagacctgggatgctccagcttggagagctctggctggcatctgacgggt gcccctctgggacgaagcttccagaggaaagaacgggcagcaatctttgctgttctgcag cctctgctggtgatacccaggcaaacagggtctggagtagacctccagcaaactccagca gacctgcagcagacgggcttgattgttagaaggaaaactaacaaacagaaaaaaatagca catccacttagagaccccatccgaaggtcacgaacatcaaagactaaagaaaaagaggga atcctccctaactcattttatgaggccagcatcatcctgataccaaaacctggcagagac acaacaaaaaaagaaaatttcatgccaatatccctgatgaacattgatgcaaaaatcctc aataaaatactggcgaaccaaatccagcagcacatcaaaaagcttctccgccacgatcaa atcagcttcatccctgggatgcaaagctggttcaacatacgcaaatcaataaacttaatc catcacataaacagaaccaatgacaataaccacatgattatctcaatagatgaagaaaag gccttcaaaaaaattcagcaccgcttcatgctaaaaactctcaataaactaggtactgac gggtcttatctcaaaataataagagctatttatgacaaacccacagccaatgttatactg aatggactaaaactggaagcattccctttgaaaactggcacaagacaaggatgctctctc tcaccactcctattcaacatagtattggaagttctggccagggcaatcaggcaagagaaa gaaataaagggtattcaaataggaaaagaggaagtcaaatcgtctctgtttgcggatgac atgattatatatttagaaaaccccattgtctcatccctaaatctccttaagctgataagc aacttcagcaaagtctcaggatacaaaatcaatgtgcaaaaatcacaggcattcctatac accaataatagagagccagatcatgagtga >gi568815592f:82264962_82466221|GENSCAN_predicted_peptide_2|249_aa MVVYVEETSPTPPSGYPESRGDKGVRKRQNKHLKGGSRGPEHRRLARGPELSGSPQFIGL QAVCSSGRWELGGRDEEKDESVEENSDWGYTREQNKVPSTYELTLRFEKILFLLTTDGQE ESVNSTEKFHRNELCSAFAIRGQISGNCQSTSYLQEIHFLSFRLFLAQRLKGLMRQAAEI SICSDKGCLSAMKTHHVLQVLATFMSPMQNCGFLPRIYLPILQGQVSEYPESLWRVLKHE AAHLQEVLQ >gi568815592f:82264962_82466221|GENSCAN_predicted_CDS_2|750_bp atggtggtttatgttgaggaaaccagccccacaccacccagcgggtaccccgagtcccgg ggcgacaaaggagttagaaagagacagaataagcatttaaaaggcgggtccagaggaccg gagcatcggaggcttgctcgcggcccagagctctcgggctccccacaatttattggttta caagctgtttgttcttcgggcagatgggagttgggaggaagggatgaggaaaaggatgaa tcagtggaggagaactcagactggggatatacccgtgaacagaacaaagtccccagtact tatgaacttacattgcggtttgagaagattcttttcctgctgacaacggatggccaggaa gaaagtgtgaattcaacagagaagtttcacagaaatgagctgtgctcagcctttgccata agagggcaaatctctggtaactgccagtctacttcctatcttcaggaaatccacttttta agcttccgccttttcctagcacagcgactcaagggcctgatgcgtcaggctgctgagata tctatctgttccgacaaaggctgcctcagtgccatgaaaacacatcacgtgctgcaagtc ctggcaacatttatgtctccaatgcaaaattgtggatttctgccaagaatttacctgcct attctccaaggacaagtatcagaatatcctgaaagcctctggagggtgctaaaacatgaa gcagctcacttacaggaagtacttcagtga >gi568815592f:82264962_82466221|GENSCAN_predicted_peptide_3|143_aa MEPCITAAPAPAVAKTSQGTAWAIASEGVSPKPWWLPHGIGPVGVQRARVEAWEPLLRFH RMYGNAWMSRQKFAAGVEPLWRTSTRTVWRGSVGLEPPHRVPTGVLPSGAVRRGPPSSRP QMVNPWTAWNVHLEKPQALNATL >gi568815592f:82264962_82466221|GENSCAN_predicted_CDS_3|432_bp atggagccctgcatcacagctgctccagcaccagctgtggctaaaacgagccaaggtaca gcttgggccattgcttcagagggtgtaagccccaagccttggtggcttccacatggtatt gggcctgtgggtgtacagagggcaagagttgaggcttgggagcctctgcttagatttcac aggatgtatggaaatgcctggatgtctaggcagaagtttgctgcaggggtggagcccttg tggagaacttctactaggacagtgtggaggggaagtgtgggtttggagcccccacacaga gtccccactggggtattgcctagtggagctgtaagaagagggccaccatcctccagaccc cagatggtaaatccatggacagcttggaatgtgcacctggaaaaaccacaggcactcaat gccaccctgtaa >gi568815592f:82264962_82466221|GENSCAN_predicted_peptide_4|420_aa MPGGCSRGPAAGDGRLRLARLALVLLGWVSSSSPTSSASSFSSSAPFLASAVSAQPPLPD QCPALCECSEAARTVKCVNRNLTEVPTDLPAYVRNLFLTGNQLAVLPAGAFARRPPLAEL AALNLSGSRLDEVRAGAFEHLPSLRQLDLSHNPLADLSPFAFSGSNASVSAPSPLVELIL NHIVPPEDERQNRSFEGMVVAALLAGRALQGLRRLELASNHFLYLPRDVLAQLPSLRHLD LSNNSLVSLTYVSFRNLTHLESLHLEDNALKVLHNGTLAELQGLPHIRVFLDNNPWVCDC HMADMVTWLKETEVVQGKDRLTCAYPEKMRNRVLLELNSADLDCDPILPPSLQTSYVFLG IVLALIGAIFLLVLYLNRKGIKKWMHNIRDACRDHMEGYHYRYEINADPRLTNLSSNSDV >gi568815592f:82264962_82466221|GENSCAN_predicted_CDS_4|1263_bp atgcctggggggtgctcccggggccccgccgccggggacgggcgtctgcggctggcgcga ctagcgctggtactcctgggctgggtctcctcgtcttctcccacctcctcggcatcctcc ttctcctcctcggcgccgttcctggcttccgccgtgtccgcccagcccccgctgccggac cagtgccccgcgctgtgcgagtgctccgaggcagcgcgcacagtcaagtgcgttaaccgc aatctgaccgaggtgcccacggacctgcccgcctacgtgcgcaacctcttccttaccggc aaccagctggccgtgctccctgccggcgccttcgcccgccggccgccgctggcggagctg gccgcgctcaacctcagcggcagccgcctggacgaggtgcgcgcgggcgccttcgagcat ctgcccagcctgcgccagctcgacctcagccacaacccactggccgacctcagtcccttc gctttctcgggcagcaatgccagcgtctcggcccccagtccccttgtggaactgatcctg aaccacatcgtgccccctgaagatgagcggcagaaccggagcttcgagggcatggtggtg gcggccctgctggcgggccgtgcactgcaggggctccgccgcttggagctggccagcaac cacttcctttacctgccgcgggatgtgctggcccaactgcccagcctcaggcacctggac ttaagtaataattcgctggtgagcctgacctacgtgtccttccgcaacctgacacatcta gaaagcctccacctggaggacaatgccctcaaggtccttcacaatggcaccctggctgag ttgcaaggtctaccccacattagggttttcctggacaacaatccctgggtctgcgactgc cacatggcagacatggtgacctggctcaaggaaacagaggtagtgcagggcaaagaccgg ctcacctgtgcatatccggaaaaaatgaggaatcgggtcctcttggaactcaacagtgct gacctggactgtgacccgattcttcccccatccctgcaaacctcttatgtcttcctgggt attgttttagccctgataggcgctattttcctcctggttttgtatttgaaccgcaagggg ataaaaaagtggatgcataacatcagagatgcctgcagggatcacatggaagggtatcat tacagatatgaaatcaatgcggaccccagattaacgaacctcagttctaactcggatgtc tga >gi568815592f:82264962_82466221|GENSCAN_predicted_peptide_5|78_aa MVSWARSKAPLLCAASDLVPCVPAAPAVAKSGQDSQNINKCHTSAHTKTKRKGLTKTLLI PFRVPEIIDDDQGLHFTS >gi568815592f:82264962_82466221|GENSCAN_predicted_CDS_5|237_bp atggtttcctgggccaggtccaaggcccccctgctgtgtgcagcctcagacttggtgccc tgtgtcccagctgctccagctgtggctaaaagtggccaagatagccaaaatataaacaaa tgtcacacatcagcccacactaaaactaaacgaaaaggcctcaccaagacacttcttatc cccttcagagttccagaaattattgatgatgaccaaggcctgcatttcacttcttaa >gi568815592f:82264962_82466221|GENSCAN_predicted_peptide_6|507_aa MGDFNTPLSTLDRSTRQKVNKDTQELNSALHQADLIDIYRILHPKSTEYTFFSAPHHTYS KIDHILGSKALLSKCKRTEIITNYLSDHSAIKLELRIKNLTQNHSTTWKLNNLLLNDYWI HNEMKAEIKMFFETNENKDTTYQNLWDAFKAVCRGKFIALNAHKRKQERSKIDTLTSQLK ELEKQEQTHSKASRRQEITKIRAELKELETQKTLQKINESRSWFFERINKIDRPLARLIK KKREKNQIDTIKNDKGDITTDPTEIQTTIREYYKHLYANKLENLEEMDKFLDTYTLPRLN REEVESLNRPITGSEIVAIINSLPTKKSPGPDGFTAEFYQRYEEELRRKYLGIQLTRDVK DLFKENYKPLLKEIKEDTNKWKNIPCSWVGRINIVKMAILPKVIYRFNAIPIKLPMTFFT ELEKTTLKFIWNQKRAPIAKSILSQKNKAGGITVPDCKLYYKATVTKTAWYWYQNRDIDQ WNRTEPSEIMPHIYNYLIFDKPEKNKQ >gi568815592f:82264962_82466221|GENSCAN_predicted_CDS_6|1524_bp atgggagactttaacaccccactgtcaacattagacagatcaacgagacagaaagttaac aaggatacccaggaattgaactcagctctgcaccaagcggacctaatagacatctacaga attctccaccccaaatcaacagaatatacatttttttcagcaccacaccacacctattcc aaaattgaccacatactgggaagtaaagctctcctcagcaaatgtaaaagaacagaaatt ataacaaactatctctcagaccacagtgcaatcaaactagaactcaggattaagaatctc actcaaaaccactcaactacatggaaactgaacaacctgctcctgaatgactactggata cataacgaaatgaaggcagaaataaagatgttctttgaaaccaacgagaacaaagacaca acataccagaatctctgggatgcattcaaagcagtgtgtagagggaaatttatagcacta aatgcccacaagagaaagcaggaaagatccaaaattgacaccctaacatcacaattaaaa gaactagaaaagcaagagcaaacacattcaaaagctagcagaaggcaagaaataactaaa atcagagcagaactgaaggaattagagacacaaaaaacccttcaaaaaattaatgaatcc aggagctggttttttgaaaggatcaacaaaattgatagaccactagcaagactaataaag aaaaaaagagagaagaatcaaatagacacaataaaaaatgataaaggggatatcaccacc gatcccacagaaatacaaactaccatcagagaatactacaaacacctctatgcaaataaa ctagaaaatctagaagaaatggataaatttctcgacacatacactctcccaagactaaac cgggaagaagttgaatctctgaatagaccaataacaggatctgaaattgtggcaataatc aatagcttaccaaccaaaaagagtccaggaccagatggattcacagccgaattctatcag aggtatgaggaggaactgagaagaaaatacctaggaatccaacttacaagggatgtgaag gacctcttcaaggagaactacaaaccactgctcaaggaaataaaagaggatacaaacaaa tggaagaacattccatgctcatgggtaggaagaatcaatatcgtgaaaatggccatactg cccaaagtaatttatagattcaatgccatccccatcaagctaccaatgactttcttcaca gaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcccccattgccaag tcaatcctaagccaaaagaacaaagctggaggcatcacagtacctgactgcaaactatac tacaaggctacagtaaccaaaacagcatggtactggtaccaaaacagagatatagatcaa tggaacagaacagagccctcagaaataatgccgcatatctacaactatctgatctttgat aaacctgagaaaaacaagcaatag >gi568815592f:82264962_82466221|GENSCAN_predicted_peptide_7|338_aa MGKKQNRKTGNSKTQSASPPPKERSSSPATEKSWMENDFDELREEGFRRSNYSELREDIQ TKGKEVENFEKNLEECITRITNTEKCLKELMELKTKARELREECRSLRSRCDQLEERISA MEDEMNEMKREGKFREKRIKRNEQSLQEIWDYVKRPNLRLIGVPESDGENGTKLENTLQD IIQENFPNLARQANIQIQEIQRTPQRYSLRRATPRHIIVIFTKVEMKEKMLRAAREKGWV TLKGKPIRLTADLSAETLQAKREWGPIFNILKEKNFQPTISYPAKLSFISEGEIKYFTDK QMLRDFVTSKPALKELLKEALNMERNNRYQLLQNHAKM >gi568815592f:82264962_82466221|GENSCAN_predicted_CDS_7|1017_bp atggggaaaaaacagaacagaaaaactggaaactctaaaacgcagagcgcctctcctcct ccaaaggaacgcagttcctcaccagcaacggaaaaaagctggatggagaatgactttgac gagctgagagaagaaggcttcagacgatcaaattactctgagctacgggaggacattcaa accaaaggcaaagaagttgaaaactttgaaaaaaatttagaagaatgtataactagaata accaatacagagaagtgcttaaaggagctgatggagctgaaaaccaaggctcgagaacta cgtgaagaatgcagaagcctcaggagccgatgcgatcaactggaagaaaggatatcagcg atggaagatgaaatgaatgaaatgaagcgagaagggaagtttagagaaaaaagaataaaa agaaatgagcaaagcctccaagaaatatgggactatgtgaaaagaccaaatctacgtctg attggtgtacctgaaagtgatggggagaatggaaccaagttggaaaacactctgcaggat attatccaggagaacttccccaatctagcaaggcaggccaacattcagattcaggaaata cagagaacgccacaaagatactccttgagaagagcaactccaagacacataattgtcata ttcaccaaagttgaaatgaaggaaaaaatgttaagggcagccagagagaaaggttgggtt accctcaaagggaagcccatcagactaacagcggatctctcggcagaaaccctacaagcc aaaagagagtgggggccaatattcaacattcttaaagaaaagaattttcaacccacaatt tcatatccagccaaactaagcttcataagcgaaggagaaataaaatactttacagacaag caaatgctgagagattttgtcaccagcaagcctgccctaaaagagctcctgaaggaagcg ctaaacatggaaaggaacaaccggtaccagctgctgcaaaatcatgccaaaatgtaa >gi568815592f:82264962_82466221|GENSCAN_predicted_peptide_8|121_aa MAPQPLPLLLLQQQSVMSLTKHSSGCWRCDEGLIERLLMTYEMSVGLRRILDLVAHCFST FLLRINASAQALLIEDLSQIYAPMARLSQTNAEIHAKNILPLTSAKLPGPLSHRATEPPF G >gi568815592f:82264962_82466221|GENSCAN_predicted_CDS_8|366_bp atggccccacaacccctcccactgctgctcctccagcagcagagtgtcatgagcctgaca aagcattcttcaggctgctggagatgtgatgaaggattaatcgagaggcttttgatgact tatgagatgtctgtaggacttagaagaatcctagacttggtggcacattgtttcagcaca tttcttctcagaatcaatgcctctgctcaagccctgctaattgaggacctttctcagatc tatgcccctatggccagactctcgcaaactaatgcagaaattcatgctaaaaatatactg cctctcacctcagccaagctccctgggccactgagccaccgagccaccgagccacctttt ggttga >gi568815592f:82264962_82466221|GENSCAN_predicted_peptide_9|223_aa MRGSVQMFTNGKGAEHHPSFLQAWPSLALQALTWLSTALPAWVMLLTCMGNLHPGILFVA PFIDMSSASTSAEARTQGEDAIYEPGNGSSPDTESAGSLLLDSLASRTFSGKPLPLPSLV SATQDLTTDVTPAAWLQRLKVDQRSTSPEILLYLLGKGDLFLLEFLSCYSGCLQCMSICS RMSRRQNSSDTELKKEGVYSAGGIGKTPVSKSQAPRVSNSCPF >gi568815592f:82264962_82466221|GENSCAN_predicted_CDS_9|672_bp atgaggggcagtgttcaaatgttcacaaatgggaagggagctgagcaccatccttcattc ctgcaagcgtggccatcattggccttgcaagccctgacctggctgtctacagccttgcca gcatgggtcatgcttcttacatgcatgggcaatctacacccagggatcttgtttgttgct cctttcattgatatgtcatcagcaagcacctctgcagaggccaggacacagggagaagac gccatctatgaaccaggaaacgggtcctcaccagacactgaatctgctggttctctgctc ttggactctctggcctccagaactttttcgggcaaacccctacctctgccttctctggtg tctgcgacccaggatctgaccactgacgttacccctgcagcctggctgcagagactcaaa gtggaccaacgatcaacgtcaccagaaattttactctacctgttgggaaagggggatctt tttctgctagagtttctaagttgctatagtggatgcctccagtgtatgtcaatttgtagc aggatgagccgcagacaaaactcctcagacaccgagttaaagaaggaaggggtttattca gctgggggcattggcaagactcctgtctccaagagccaagctccccgagtgagcaattcc tgtcccttttaa