GENSCAN 1.0 Date run: 5-Nov-116 Time: 23:23:25 Sequence gi568815581r:33937624_34139872 : 202249 bp : 43.48% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 14922 15005 84 2 0 72 47 70 0.151 2.12 1.02 Intr + 20470 20608 139 0 1 77 87 62 0.264 5.04 1.03 Intr + 26497 26656 160 2 1 71 71 125 0.640 8.15 1.04 Intr + 40157 40424 268 2 1 75 81 77 0.357 3.23 1.05 Intr + 47317 47413 97 0 1 67 76 59 0.364 2.18 1.06 Term + 48228 48259 32 1 2 114 48 22 0.351 -1.18 1.07 PlyA + 49957 49962 6 1.05 2.00 Prom + 58070 58109 40 -3.06 2.01 Init + 63773 63927 155 1 2 72 47 97 0.774 3.46 2.02 Intr + 66514 66599 86 1 2 64 87 66 0.749 3.56 2.03 Intr + 67622 67746 125 0 2 76 79 21 0.279 0.30 2.04 Term + 93950 94444 495 1 0 -18 46 336 0.225 13.37 2.05 PlyA + 94660 94665 6 1.05 3.00 Prom + 95195 95234 40 -4.46 3.01 Sngl + 96146 97711 1566 1 0 70 43 338 0.764 22.88 3.02 PlyA + 98903 98908 6 1.05 4.03 PlyA - 98965 98960 6 1.05 4.02 Term - 101293 99998 1296 1 0 38 53 652 0.966 48.30 4.01 Init - 102249 101308 942 0 0 67 -11 391 0.584 20.75 4.00 Prom - 105600 105561 40 -6.26 5.08 PlyA - 105689 105684 6 1.05 5.07 Term - 108119 108055 65 0 2 126 48 40 0.814 1.85 5.06 Intr - 112308 112131 178 0 1 44 110 50 0.029 2.29 5.05 Intr - 128977 128851 127 0 1 60 38 109 0.188 3.78 5.04 Intr - 129306 129132 175 1 1 -3 84 121 0.236 1.60 5.03 Intr - 132840 132806 35 2 2 125 76 -17 0.055 -1.33 5.02 Intr - 139946 139788 159 1 0 61 55 130 0.193 6.10 5.01 Init - 146476 146058 419 1 2 71 53 138 0.123 4.50 5.00 Prom - 146678 146639 40 -9.55 6.02 PlyA - 147090 147085 6 1.05 6.01 Sngl - 149031 147616 1416 0 0 44 42 579 0.828 44.84 6.00 Prom - 169282 169243 40 -0.96 7.00 Prom + 169343 169382 40 -6.26 7.01 Init + 171765 171824 60 2 0 55 115 24 0.574 3.05 7.02 Intr + 177035 177191 157 1 1 67 86 65 0.499 3.78 7.03 Intr + 188172 188279 108 1 0 41 78 56 0.168 0.16 7.04 Term + 190023 190108 86 1 2 83 52 91 0.483 2.82 7.05 PlyA + 190462 190467 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 112362 112131 232 0 1 66 110 89 0.879 7.32 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:33937624_34139872|GENSCAN_predicted_peptide_1|259_aa MMEDAWVLDDIIKSLNSSKYSLICTFSHVMLLQEVSSHSLGQLHHCGFAGTAILLAALMA GIECLWLFQVHDASCEASESVPGNHYPLAGMNVLDETNLSVSVDACTANSSFDFLTPRVY HTYSNLTRSLMTLIVPRDREGLPCYRTPAPCEQTGEEFGASGLVGPESTVSAAACLAHGF KQPCQLMASWCSMEVPGHGFSAVYDSWAVESWCQPLKRTPETSDTERKDILKGGKKVAFG DCYRQSEYRCPQAFPEKTQ >gi568815581r:33937624_34139872|GENSCAN_predicted_CDS_1|780_bp atgatggaagatgcctgggtccttgatgatatcatcaagtcactgaactcgtccaaatac agcctcatttgcacttttagtcatgttatgctgttgcaagaagtgagctcccacagcctt gggcagctccatcactgtggctttgcaggtacagctatcctcctggctgctttaatggct ggcattgagtgtctgtggcttttccaggtgcacgatgcaagctgtgaggcttcagaatct gtccctggaaaccactacccattggctggcatgaacgtgttagatgaaaccaatttgtca gtttcagtagatgcctgtacagccaactccagcttcgactttctgactccacgagtctat cacacatacagcaacctgaccagaagcctcatgacactgattgttcccagggacagagaa ggcctgccctgttacaggactcctgcaccctgtgagcagactggagaggaatttggggcc agcggtcttgttggccctgaaagcacggtgtctgcagctgcctgcctggcacatgggttc aagcagccttgccagctcatggcatcatggtgcagtatggaggtcccgggccatggtttc agtgctgtctatgactcatgggctgtggagtcctggtgccagcccttgaagaggacacca gaaacttcagacactgaaaggaaagacattctaaaaggtggaaagaaagtggcatttggg gactgctataggcagtctgaatatagatgtccacaggccttccctgagaagacacaatag >gi568815581r:33937624_34139872|GENSCAN_predicted_peptide_2|286_aa MGAGLKTESIGVGLDPGSSGAGAGLASASTEVVLATGSMEMHLESGSSRELGRDDQLIKG EGVKYGRWEKEKEKEKEKEGFPACSNPFLILVLDTLPVGLTYMLATDHWFLDTFLLSSNF STEEIRTNGKEVKSFEKKLDKWITRITNGEKSLRDLMELKTKARELRDECRSLSSQCDQL EERVSVMEDEMNEMKREEKFREKRIKRNKQSLQDLWQYVERPNLRLIDLPESDGENGTKL ENTLQDIIQENFPNLARQTNIQIQEIQRMPQRYSSRRATPRHIIVR >gi568815581r:33937624_34139872|GENSCAN_predicted_CDS_2|861_bp atgggggcaggcctgaaaactgaatctataggtgtaggcctggaccctgggtccagtggg gctggtgctggtctggcatcagcatccactgaggtggtcctagcaactgggtccatggag atgcacctggagtctggatcttcaagagaactgggccgtgatgaccagttgataaaagga gaaggagtaaagtatgggagatgggagaaggaaaaggaaaaggaaaaggaaaaggaaggc ttcccagcctgctccaacccctttctcatcctggtgctggacacactgccagtggggctc acctacatgctggccacagatcattggtttctcgatacgtttctattatcttcaaacttt tctacagaggaaattcgaaccaatggcaaagaagttaaaagctttgaaaaaaaattagac aaatggataactagaataaccaatggagaaaagtccttaagggacctaatggagctgaaa accaaggcacgagagctacgtgacgaatgcagaagcctcagtagccaatgcgatcaactg gaagaaagggtatcagtgatggaagatgaaatgaatgaaatgaagcgagaagagaagttt agagaaaaaagaataaaaagaaacaaacaaagcctccaagacttatggcagtatgtggaa agaccaaatctacgtctgattgatttacctgaaagtgacggggagaatggaaccaagttg gaaaacactctgcaggatattatccaggagaacttccccaatctagcaaggcagaccaac attcaaattcaggaaatacagagaatgccacaaagatactcctcgagaagagcaactcca agacacataattgtcagatga >gi568815581r:33937624_34139872|GENSCAN_predicted_peptide_3|521_aa MDKFLDTYILPRLNQEVESLNRPITGSQIEAIINSLPTKKSPGPDGFTAEFYQRYKEELI SLLLKLFQSIVKEGILPNSFYEASIILIPKPGRDTTKKENFRPISLININTKILNKILAN RIQQHIKKLIHHDQVGFIPGMQGWFNICKSINVIQHINRTKDKNHMIISIDAEKAFDKIQ QPFMLKTLNKLGIDRTYLKIIRAIYDKPTTNIILNGQKLEAFPLKTGTRQGCPLSPFLFN IVLEILARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQNLLRLISNFSKVS GYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLIRDVKDLFKENYKPLL NEIKEDTNKWKNIPCLWIGRINIMKMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIW NQKRACIAKSILSQKKAGGITLPDFKLYYKSTVTKTAWYWYQNRDIDQWNRTEPSEIMLH IYNYLIFDKPDRNKQWGKDSLFNKWCWENWLAICGKLASHM >gi568815581r:33937624_34139872|GENSCAN_predicted_CDS_3|1566_bp atggataaatttcttgacacatacatcctcccaagactaaaccaggaagttgaatctctg aatagaccaatcacaggctctcaaattgaggcaataatcaatagcttaccaaccaaaaaa agtccaggaccagatggattcacagccgaattctaccagaggtacaaagaggagctgata tcactccttctgaaattattccaatcaatagtaaaagagggaatcctccctaactcattt tatgaggccagcatcatcctgataccaaagcctggcagagacacaaccaaaaaagagaat tttagaccaatatccttgataaacatcaatacaaaaattctcaataaaatactggcaaac cgaatccagcagcacatcaaaaagcttatccaccatgatcaagtgggcttcatccctgga atgcaaggctggttcaacatatgcaaatcaataaatgtaatccagcatataaacagaacc aaagacaaaaaccacatgattatctcaatagatgcagaaaaagcctttgacaaaattcaa caacccttcatgctaaaaactctcaataaattaggtattgataggacgtatctcaaaata ataagagctatctatgacaaacccacaaccaatatcatactgaatgggcaaaaactcgaa gcattccctttgaaaacaggcacaagacagggatgccctctctcaccattcctattcaac atagtcttggaaattctggccagggcaatcaggcaggagaaggaaataaagggtattcaa ttaggaaaagaggaagtcaaattgtccctgtttgcagatgacatgattgtatatctagaa aaccccatcgtctcagcccaaaatctcctcaggctgataagcaacttcagcaaagtctca ggatacaaaatcaatgtacaaaagtcacaagcattcttatacaccaataacagacaaaca gagagccaaatcatgagtgaactcccattcacaattgcttcaaagagaataaaataccta ggaatccaacttataagggatgtgaaggacctcttcaaggagaactacaaaccactgctc aatgaaataaaagaggacacaaacaaatggaagaacattccatgcttatggataggaaga atcaatatcatgaaaatggccatactgcccaaggtaatttatagattcaatgccatcccc atcaagctaccaatgactttcttcacagaattggaaaaaactactttaaagttcatatgg aaccaaaaaagagcctgcatcgccaagtcaatcctaagccaaaagaaagctggaggcatc acgctacctgacttcaaactatactacaagtctacagtaaccaaaacagcatggtactgg taccaaaacagagatatagatcaatggaacagaacagagccctcagaaataatgctgcat atctacaactatctcatctttgacaaacctgacagaaacaagcaatggggaaaggattcc ctatttaataaatggtgctgggaaaactggctagccatatgtggaaaactggctagccac atgtag >gi568815581r:33937624_34139872|GENSCAN_predicted_peptide_4|745_aa MMEELHSLDPRRQKLLEARFTGVGVSKGPLNSESSNQSLCSVGSLSDKEVETPKKKQNDQ RNRKRKAEPYESSQGKGTPRGHKISDYFEFAGGSGPGTSPGRSVPPVARSSLQHSLSNPL PRRVEQPLYGLDGSAAKEATEEQSALPTLMSVMLAKPRLDTEQLAQRGAGLCFTFVSAQQ NSPSSTGSGNTEHSCSSQKQISIQHRQTQSDLTIEKISALENSKNSDLEKKEGRIDDLLR AICDLRRQIDEQQKMLEKYKERLNRCVTMSKKLLIEKSKQEKMACRDKSMQDRLRLGYFT TSDTEPNLLSSGQMNLIKQQERINSQREEIERQRKMLAKRKPPAMGQAPPATNEQKQWKS KTNGAENETLTLKEYHEQEEIFKLRLGHLKKEEAEIQAELERLERVRKLHIREVKRIHNE DNSQFKYHPTLNDRYLLLHLLGRGGFSEVYKAFDLTEQRYVAVKIHQLNKNWRDEKKENY HKHACREYRIHKELDHPRIVKLYDYFSLDTDSFCTVLEYCEGNDLDFYLKQHKLMSEKEA RSIIMQIVNALKYLNEIKPPIIHYDLKPGNILLENGTGCGEIKITDFGLSKIMDDDSYNS VDGMELTSQGAGTYWYLPPECFVVGKEPPKISNKVDVWSVGVIFYQCLYGRKPFGHNQSQ QDILQENTILKATEVQFPPKPVVTPEAKALIRRCLAYRKEDRIDVQQLACDPYLLPHIQK SVSTSSPAGAAIASTSGASNNSSSN >gi568815581r:33937624_34139872|GENSCAN_predicted_CDS_4|2238_bp atgatggaagaattgcatagcctggacccacgacggcagaaattattggaggccaggttt actggagtaggtgttagtaagggaccacttaatagtgagtcttccaaccagagcttgtgc agcgtcggatccttgagtgataaagaagtagagactcccaagaaaaagcagaatgaccag cgaaatcggaaaagaaaagctgaaccatatgaaagtagccaagggaaaggcactcctagg ggacataaaattagtgattactttgagtttgctgggggaagcgggccgggaaccagccct ggcagaagtgttccaccagttgcacgatcctcactgcaacattccttatccaatccctta ccgcgacgagtagaacagcccctctatggtttagatggcagtgctgcaaaggaggcaacg gaggagcagtctgctctgccaaccctcatgtcagtgatgctagcaaaacctcggcttgac acagagcagctggcgcaaaggggagctggcctctgcttcacttttgtttcagctcagcaa aacagtccctcatctacgggatctggcaacacagagcattcctgcagctcccaaaaacag atctccatccagcacagacagacccagtccgacctcacaatagaaaaaatatctgcacta gaaaacagtaagaattctgacttagagaagaaggagggaagaatagatgatttattaaga gccatctgtgatttgagacggcagattgatgaacagcaaaagatgctagagaaatacaag gaacgattaaatagatgtgtgacaatgagcaagaaactccttatagaaaagtcaaaacaa gagaagatggcgtgtagagataagagcatgcaagaccgcttgagactgggctactttact acgtccgacacggagccaaatttactgagcagtggacagatgaatcttatcaagcaacag gaaaggataaattcacagagggaagagatagaaagacaacggaaaatgttagcaaagcgg aaacctcctgccatgggtcaggcccctcctgcaaccaatgagcagaaacagtggaaaagc aagaccaatggagctgaaaatgaaacgttaacgttaaaagaataccatgaacaagaagaa atcttcaaactcagattaggtcatcttaaaaaggaggaagcagagatccaggcagagctg gagaggctagaaagggttagaaaactacatatcagggaagtaaaaaggatacataatgaa gataattcacaatttaaatatcatccaacgctaaatgacagatatttgttgttacatctt ttgggtagaggaggtttcagtgaagtttacaaggcatttgatctaacagagcaaagatac gtagctgtgaaaattcaccagttaaataaaaactggagagatgagaaaaaggagaattac cacaagcatgcatgtagggaataccggattcataaagagctggaccatcccagaatagtt aagctgtatgattacttttcactggatactgactcgttttgtacagtattagaatactgt gagggaaatgatctggacttctacctgaaacagcacaaattaatgtcagagaaagaggcc cggtccattatcatgcagattgtgaatgctttaaagtacttaaatgaaataaaacctccc atcatacactatgacctcaaaccaggtaatattcttttagaaaatggtacagggtgtgga gagataaaaattacagattttggtctttcgaagatcatggatgatgatagctacaattca gtggatggcatggagctaacatcacaaggtgctggtacttattggtatttaccaccagag tgttttgtggttgggaaagaaccaccaaagatctcaaataaagttgatgtgtggtcggtg ggtgtgatcttctatcagtgtctttatggaaggaagccttttggccataaccagtctcag caagacatcctacaagagaatacgattcttaaagctactgaagtgcagttcccgccaaag ccggtagtaacacctgaagcaaaggcgttgattcgacgatgcttggcctaccgaaaggag gaccgcattgatgtccagcagctggcctgtgatccctacttgttgcctcacatccaaaag tcagtctctacgagtagccctgctggagctgctattgcatcaacctctggggcgtccaat aacagttcttctaattga >gi568815581r:33937624_34139872|GENSCAN_predicted_peptide_5|385_aa MGKDFMSKTPKAMATKAKIDKWDLIKLKSFCTSKETTIGVNRQPTEWEKIFTTYSSDKGL ISRIHNELKQIYKKKTNNPVKKWAKDLNRHFSKEDIYAAKTHMKKCSPSLAIREMQIKTT MGYHLTPVRMAIIKKSGNNSEGCIVAVFHGNRFQEEQHRRSAGSEDFQSAPKLSQTKRRV AGFTASADTKGRGPWGLTSSEFLEELELDNKESHLSEELKTLCVVSWVVLLTFSDFEFPG NFTKPVTHISMPNHPKQSTLALLLKRLCENVFAGHVKGFQLLGRVGTPLSDDLVHSYLHQ WFTAQLRSQAGKQRISICEKQGLLKVQHEHRGESNYHRGAGLLEIISLGFLSEEDSLKGF EVSKGPILLSARGKLVFVDVLLYDP >gi568815581r:33937624_34139872|GENSCAN_predicted_CDS_5|1158_bp atgggcaaggacttcatgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgac aaatgggatctaattaaactaaagagcttctgcacatcaaaagaaactaccatcggagtg aacaggcaacctacagaatgggagaaaattttcacaacctactcatctgacaaagggcta atatccagaatccacaatgaactcaaacaaatttacaagaaaaaaacaaacaaccccgtc aaaaagtgggcgaaggacctgaacagacacttctcgaaagaagacatttatgcagccaaa acacacatgaaaaaatgctcaccatcactggccatcagagaaatgcaaatcaaaaccaca atgggatatcatcttacaccagttagaatggcaatcattaaaaagtcaggaaacaacagt gaaggctgcatcgtggctgtgttccatggtaaccgtttccaggaagagcagcaccggagg agtgcaggttctgaagactttcaatcagccccaaagcttagccagacaaaacgtcgtgtt gctggcttcacagcctccgcggacaccaaagggagaggaccatggggtttaacctcctct gagtttctagaagagctggagcttgataataaagaatcccacctgtccgaggagctgaag acattgtgtgttgtttcctgggtggtcttgctgaccttctctgatttcgagttccctgga aacttcaccaaaccagtgactcacatctccatgcccaaccacccaaagcagtccactttg gcccttctgctgaagaggctttgtgagaatgtctttgcaggccatgtgaaaggctttcag ctgctgggcagagttggcactccactcagtgatgacctcgtccacagttacctgcatcag tggttcacagcccagttgaggagtcaggcaggtaaacagagaatttcaatatgtgagaaa cagggtttattgaaggtccaacatgaacacagaggagagagtaattaccaccgtggagca ggtttgctggagattatcagcttgggattcctgtcggaggaagattctctgaagggtttt gaagtttcaaagggaccaattttgctttcagccagaggtaaactagtctttgtcgatgtg ttactttatgacccttaa >gi568815581r:33937624_34139872|GENSCAN_predicted_peptide_6|471_aa MGDFNTPLSTLDRSVRHKVNKDTQELNSALHQADLIDIYRTLHPKSTEYTFFSAPHHTYS KIDHIVGSKALLRKCKRTEIITNYLSDHSAIKLELRIKKLTQNRSTTWKLNNLLLNDYLV HNEMKAEIKMFFETNENKDTTYQNLWDTFKAVCRGKFIALNAHKRKQERSKIDTLTSQLK ELEKQEQTHSKASRRQEITKIRAELKETETQKTLQKINESRSWFFERINKIDRPLARLIK KKREKNQIDTIKNDKGDVTTDPTEIQTTIREYYKHLYTNKLENLEEMDKFLDTYTLPRLN QEEVESLNRPITGAEIVAIINSLPTKKSPGPDGFTAKFYQRYKEELVPFLLKLFQSIEKE GILPNSFYEASIILIPKPGRDTTKKENFRPISLMNIDAKIVNKILANRIQQHIRKLIHHD QVGFIPGMQVWFNIHKSVNVIQHINRTKDKNHMIISIDAEKAFDKIQQPSC >gi568815581r:33937624_34139872|GENSCAN_predicted_CDS_6|1416_bp atgggagactttaacaccccactgtcaacattagacagatcagtgagacacaaagttaac aaggatacccaggaattgaactcagctctgcaccaagcggacctaatagacatctacaga actctccaccccaaatcaacagaatatacatttttttcagcaccacaccacacctattcc aaaattgaccacatagttggaagtaaagctctcctcagaaaatgtaaaagaacagaaatt ataacaaactatctctcagaccacagtgcaatcaaattagaactcaggattaagaaactc actcaaaaccgctcaactacatggaaactgaacaacctgctcctgaatgactacttggta cataatgaaatgaaggcagaaataaagatgttctttgaaaccaacgagaacaaagacaca acataccagaatctctgggacacattcaaagcagtgtgtagagggaaatttatagcacta aatgcccacaagagaaagcaggaaagatccaaaattgacaccctaacatcacaattaaaa gaactagaaaagcaagagcaaacacattcaaaagctagcagaaggcaagaaataactaaa atcagagcagaactgaaggaaacagagacacaaaaaacccttcaaaaaattaatgaatcc aggagctggttttttgaaaggatcaacaaaattgatagaccactagcaagactaataaag aaaaaacgagagaagaatcaaatagacacaataaaaaatgataaaggggatgtcaccacc gatcccacagaaatacaaactaccatcagagaatactacaaacacctctacacaaataaa ctagaaaatctagaagaaatggataaattcctcgacacgtacactctcccaagactaaac caggaagaagttgaatctctgaatagaccaataacaggagctgaaattgtggcaataatc aatagcttaccaaccaaaaagagtccaggaccagatggattcacagccaaattctaccag aggtacaaggaggaactggtaccattccttctgaaactattccaatcaatagaaaaagag ggaattctccccaactcattttatgaggccagcatcatcctgataccaaagcctggcaga gacacaaccaaaaaagagaattttagaccaatatccttgatgaacatcgatgcaaaaatc gtcaataaaatactggcaaaccgaatccagcagcacatcagaaagcttatccaccatgat caagtgggcttcatccctgggatgcaagtctggttcaatatacacaaatcagtaaatgta atccagcatataaacagaaccaaagacaaaaaccacatgattatctcaatagatgcagaa aaggcctttgacaaaattcaacaaccttcatgctaa >gi568815581r:33937624_34139872|GENSCAN_predicted_peptide_7|136_aa MPKVYSPVHVFYVHVCSKTKNNVSVLKTVRVKCYPEKWGAGSAYDAKGGGTDTASIIPQS SWSNRLEPPKAVAAKTWFAGELELLKLRIQVRAQIPNPELQRAARGQGVGQLGASAPLLF LLQDGCNSSREESDAT >gi568815581r:33937624_34139872|GENSCAN_predicted_CDS_7|411_bp atgccaaaagtatacagtcctgtgcatgtcttctacgtacatgtatgttccaagacaaag aataatgtttctgtgttgaaaacagtgagggttaagtgttatcctgagaagtggggagca ggttcagcctatgatgcaaagggtggaggcacagacacagccagcattattccccagagc tcatggagcaatcgcctggagccaccaaaagctgtggccgctaagacatggtttgcgggt gagctggagcttctgaaactgaggatccaggtacgagctcagataccaaatcctgagctc cagagagcagccagagggcagggagtggggcagctgggggcttcagctcccctgctcttc ctgctgcaggacggatgcaacagctccagggaggagtccgatgccacctga