GENSCAN 1.0 Date run: 4-Nov-116 Time: 11:23:27 Sequence gi568815595f:4879719_5083689 : 203971 bp : 43.46% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 7380 7524 145 2 1 86 39 73 0.059 2.06 1.02 Intr + 25789 25869 81 2 0 60 66 79 0.022 2.51 1.03 Intr + 30465 30518 54 1 0 110 42 43 0.007 0.85 1.04 Term + 37632 38113 482 1 2 14 54 394 0.062 23.56 1.05 PlyA + 38137 38142 6 1.05 2.00 Prom + 38840 38879 40 -4.06 2.01 Init + 39230 42189 2960 1 2 44 53 970 0.546 80.27 2.02 Intr + 78143 78226 84 2 0 61 115 20 0.019 0.94 2.03 Intr + 79509 79654 146 0 2 9 115 46 0.028 -0.67 2.04 Term + 86974 87497 524 2 2 -10 42 333 0.090 13.34 2.05 PlyA + 87958 87963 6 1.05 3.00 Prom + 88207 88246 40 -5.06 3.01 Sngl + 88580 89008 429 1 0 49 42 169 0.625 4.69 3.02 PlyA + 89293 89298 6 1.05 4.00 Prom + 91714 91753 40 -2.46 4.01 Init + 100001 100080 80 1 2 105 110 80 0.974 11.13 4.02 Intr + 100244 100313 70 2 1 128 121 62 0.941 12.68 4.03 Intr + 100583 100690 108 1 0 113 98 184 0.999 22.38 4.04 Intr + 101674 101797 124 0 1 126 105 102 0.999 15.86 4.05 Term + 103118 103974 857 0 2 129 37 719 0.966 63.85 4.06 PlyA + 104607 104612 6 1.05 5.04 PlyA - 105041 105036 6 1.05 5.03 Term - 140299 140163 137 2 2 85 36 109 0.693 3.58 5.02 Intr - 142169 142053 117 0 0 42 60 90 0.454 2.04 5.01 Init - 157558 157435 124 1 1 72 25 89 0.189 1.33 5.00 Prom - 191848 191809 40 -3.56 6.04 PlyA - 193357 193352 6 1.05 6.03 Term - 201731 201439 293 0 2 55 38 444 0.771 31.61 6.02 Intr - 203097 202911 187 0 1 15 34 167 0.290 3.26 6.01 Intr - 203832 203724 109 2 1 46 31 176 0.529 7.99 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 37724 38113 390 1 0 88 54 327 0.810 25.52 S.002 Sngl - 200664 200494 171 0 0 110 54 249 0.996 18.53 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:4879719_5083689|GENSCAN_predicted_peptide_1|253_aa EYRPLHLPNEPHLLIIFTSVEIPSFQKPPLTSDAKELLRRPARTFTDAVSSFNKGRSGFC DTAVKEEHVKREESLAPNVVPFSEKVHKAPGPGKGKLTTRKDIYTENPSVHHHHQRPKVD KTTKMGKKQNRKTGNSKTQSASPPPKERSSSPATEQSWMENDFDELREEGFRRSSYSELR EDIQTKGKEVENFEKNLEECITRITNTEKCLKELMELKTKARELREECRSLRSQCDQLEE RVSVMEDEMNEMK >gi568815595f:4879719_5083689|GENSCAN_predicted_CDS_1|762_bp gaatatcgtcctctccatttgcccaatgagccccacctcctcatcatttttacctctgtg gaaatcccttccttccagaagcctccactcacttctgatgccaaggagcttctgcggcgc cctgcacgcacctttacagatgcagtttcttctttcaacaagggtagaagcggtttctgt gacacagctgtcaaagaagaacacgtgaagagggaagaaagtctagctcccaacgtggtc ccattctctgagaaagtacacaaggccccggggccaggcaaaggaaaactaacaaccaga aaggacatctacaccgaaaacccatctgtacatcaccatcatcaaagaccaaaagtagat aaaaccacaaagatggggaaaaaacagaacagaaaaactggaaactctaaaacgcagagc gcctctcctcctccaaaggaacgcagttcctcaccagcaacagaacaaagctggatggag aatgattttgacgagctgagagaagaaggcttcagacgatcaagttactctgagctacgg gaggacattcaaaccaaaggcaaagaagttgaaaactttgaaaaaaatttagaagaatgt ataactagaataaccaatacagagaagtgcttaaaggagctgatggagctgaaaaccaag gctcgagaactacgtgaagaatgcagaagcctcaggagccaatgcgatcaactggaagaa agggtatcagtgatggaagatgaaatgaatgaaatgaagtga >gi568815595f:4879719_5083689|GENSCAN_predicted_peptide_2|1237_aa MGDFNTPLSTLDRSTRQKVNKDTQELKSALHQADLIDIYRTLHPKSTEYTFFSAPHHTYS KIDHIVGSKALLSKCKRTEIITNYLSDHSAIKLELRIKNLTQSRSTTWKLNNLLLNDYWV HNEMKAEIKMFFETNENKDTTYQNRWDAFKAVCRGKFIALNAYKRKQERSKIDTLTSQLK ELEKQEQTHSKASRRQEITKIRAELKEIETQKTLQKINESRSWFFERINKIDRPLARLIK KKREKNQIDTIKNDKGDITTDPTEIQTTIREYYKHLYANKLENLEEMDTFLDTYTLPRLN QEEVESLNRPITGSEIVAIINSLPTKKSPGLDGFTAEFYQRYKEELVPFLLKLFQSIEKE GILPNSFYEASIILIPKLGRDTTKKENFRPISLMNIDAKILNKILANRIQQHIKKLIHHD QVGFIPGMQGWFNIRKSINVIQHINRAKDKNHMIISIDAEKAFDKIQQPFMLKTLNKLGI DGTYFKIIRAIYDKPTANIILNGQKLEAFPLKTGTRQGCPLSPLLFNIVLEVLARAIRQE KEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQAFL YTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLKEIKEDTNKWKNI PCSWVGRINIVKMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRARIAKSILS QKNKAGGITLPDFKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEIMPHIYNYLIFDKPEK NKQWGKDSLFNKWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLG ITIQDIGVGKDFMSKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQPTTWEKIFAT YSSDKGLISRIYNELKQIYKKKTNNPIKKWAKDMNRHFSKEDIYAAKRPMKKFSSSLAIR EMQIKTTMRYHLTPVRMAIIKKSGNNSKRQLSGGFLPDVQGYVRLFVPYQPKQDRIKKFH IHVVCQGRVGGGRRTKNLNAPTRRINKEKAAPKQITAKLLKSSERNSININKKDIHTKTP SVGHHHQRPKVDKTTKVGRIQSRKAENSKNQSASSPPKDHSSSPAMEQTWTENDFDELTE VGFRRSVITNFSELKKDVRTHCKEAKNLEKRLHEWLTRIKSVEKTLKDLMELKTMAQKLC DACTSFRSQSDQVEERVSVIKDQMNEMKKRRSLEKKE >gi568815595f:4879719_5083689|GENSCAN_predicted_CDS_2|3714_bp atgggagactttaacaccccactgtcaacattagacagatcaacgagacagaaagtcaac aaggatacccaggaattgaaatcagctctgcaccaagcagacctaatagacatctacaga actctccaccccaaatcaacagaatatacatttttttcagcaccacaccacacctattcc aaaattgaccacatagttggaagtaaagctctcctcagcaaatgtaaaagaacagaaatt ataacaaactatctctcagaccacagtgcaatcaaactagaactcaggattaagaatctc actcaaagccgctcaactacatggaaactgaacaacctgctcctgaatgactactgggta cataacgaaatgaaggcagaaataaagatgttctttgaaaccaacgagaacaaagacacc acataccagaatcgctgggacgcattcaaagcagtgtgtagagggaaatttatagcacta aatgcctacaagagaaagcaggaaagatccaaaattgacaccctaacatcacaattaaaa gaactagaaaagcaagagcaaacacattcaaaagctagcagaaggcaagaaataactaaa atcagagcagaactgaaggaaatagagacacaaaaaacccttcaaaaaatcaatgaatcc aggagctggttttttgaaaggatcaacaaaattgatagaccgctagcaagactaataaag aaaaaaagagagaagaatcaaatagacacaataaaaaatgataaaggggatatcaccacc gatcccacagaaatacaaactaccatcagagaatactacaaacacctctacgcaaataaa ctagaaaatctagaagaaatggatacattcctcgacacatacactctcccaagactaaac caggaagaagtcgaatctctgaatagaccaataacaggatctgaaattgtggcaataatc aatagtttaccaaccaaaaagagtccaggactagatggattcacagccgaattctaccag aggtacaaggaggaactggtaccattccttctgaaactattccaatcaatagaaaaagag ggaatcctccctaactcattttatgaggccagcatcattctgataccaaagctgggcaga gacacaaccaaaaaagagaattttagaccaatatccttgatgaacattgatgcaaaaatc ctcaataaaatactggcaaaccgaatccagcagcacatcaaaaagcttatccaccatgat caagtgggcttcatccctgggatgcaaggctggttcaatatacgcaaatcaataaatgta atccagcatataaacagagccaaagacaaaaaccacatgattatctcaatagatgcagaa aaagcctttgacaaaattcaacaacccttcatgctaaaaactctcaataaattaggtatt gatgggacgtatttcaaaataataagagctatctatgacaaacccacagccaatatcata ctgaatgggcaaaaactggaagcattccctttgaaaactggcacaagacagggatgccct ctctcaccgctcctattcaacatagtgttggaagttctggccagggcaatcaggcaggag aaggaaataaagggtattcaattaggaaaagaggaagtcaaattgtccctgtttgcagac gacatgattgtttatctagaaaaccccatcgtctcagcccaaaatctccttaagctgata agcaacttcagcaaagtctcaggatacaaaatcaatgtacaaaaatcacaagcattctta tacaccaacaacagacaaacagagagccaaatcatgagtgaactcccattcacaattgct tcaaagagaataaaatacctaggaatccaacttacaagggatgtgaaggacctcttcaag gagaactacaaaccactgctcaaggaaataaaagaggacacaaacaaatggaagaacatt ccatgctcatgggtaggaagaatcaatatcgtgaaaatggccatactgcccaaggtaatt tacagattcaatgccatccccatcaagctaccaatgactttcttcacagaattggaaaaa actactttaaagttcatatggaaccaaaaaagagcccgcatcgccaagtcaatcctaagc caaaagaacaaagctggaggcatcacactacctgacttcaaactatactacaaggctaca gtaaccaaaacagcatggtactggtaccaaaacagagatatagatcaatggaacagaaca gagccctcagaaataatgccacatatctacaactatctgatctttgacaaacctgagaaa aacaagcaatggggaaaggattccctatttaataaatggtgctgggaaaactggctagcc atatgtagaaagctgaaactggatcccttccttacaccttatacaaaaatcaattcaaga tggattaaagatttaaacgttagacctaaaacaataaaaaccctagaagaaaacctaggc attaccattcaggacataggcgtgggcaaggacttcatgtccaaaacaccaaaagcaatg gcaacaaaagccaaaattgacaaatgggatctaattaaactaaagagcttctgcacagca aaagaaactaccatcagagtgaacaggcaacctacaacatgggagaaaattttcgcaacc tactcatctgacaaagggctaatatccagaatctacaatgaactcaaacaaatttacaag aaaaaaacaaacaaccccatcaaaaagtgggcgaaggacatgaacagacacttctcaaaa gaagacatttatgcagccaaaagacccatgaaaaaattctcatcatcactggccatcaga gaaatgcaaatcaaaaccactatgagatatcatctcacaccagttagaatggcaatcatt aaaaagtcaggaaacaacagtaagaggcagttatctgggggatttctaccagatgtacag ggttacgtgagactatttgtaccttatcaaccaaaacaagacagaatcaagaagttccac atccacgttgtgtgtcaggggagagttgggggtgggaggagaaccaagaaccttaacgca cccacaaggagaataaataaggaaaaagcagcacctaaacaaatcacagccaaactgctg aaaagcagcgaaaggaatagcatcaacatcaacaaaaaggacatccacaccaaaacccca tctgtaggacaccatcatcaaagaccaaaggtagataaaaccacaaaggtggggagaatc cagagcagaaaagctgaaaattctaaaaatcagagcgcctcttctcctccaaaggatcac agctcctcaccagcaatggaacaaacctggacggagaatgactttgacgagttgacagaa gtaggcttcagaagatcggtaataacgaacttctccgagctaaagaaggatgttcgaacc cattgcaaagaagctaaaaaccttgaaaaaagattacacgaatggctgactagaataaag agcgtagagaagaccttaaaggacctgatggagctgaaaaccatggcacaaaaactatgt gacgcatgcacaagcttcaggagccaatccgatcaagtggaagaaagggtatcagtgatt aaagatcaaatgaatgaaatgaagaaaaggagaagtttagagaagaaagagtaa >gi568815595f:4879719_5083689|GENSCAN_predicted_peptide_3|142_aa MGDFNAPLSTLDRSTRQKVNKDIQDLNSALHQVDLIDIYRTLHAKSTEYTFFSAPHCTYS KIDHIVGSKALLSKCKRTEITTNCLSDHSAIKLELRIKKLTQNHAATWKLNNLLLNDYWV NNEMKAQVKMFFETNENKDTIY >gi568815595f:4879719_5083689|GENSCAN_predicted_CDS_3|429_bp atgggagactttaacgccccactgtcaacattagatagatcaacgagacaaaaagttaac aaggatatccaggacttaaactcagctctgcaccaagtggacctaatagacatctacaga actctccacgccaaatcaacagaatatacattcttctcagcaccacattgcacttattcc aaaattgaccacatagttggaagtaaagcactcctcagcaaatgtaaaagaacagaaatt acaacaaactgtctctcagaccacagtgcaatcaaattagaactcaggattaagaaactc actcaaaaccacgcagctacatggaaactgaacaacctgctcctgaatgactactgggta aataacgaaatgaaggcacaagtaaagatgttctttgaaaccaatgagaacaaagacaca atatactag >gi568815595f:4879719_5083689|GENSCAN_predicted_peptide_4|412_aa MERIPSAQPPPACLPKAPGLEHGDLPGMYPAHMYQVYKSRRGIKRSEDSKETYKLPHRLI EKKRRDRINECIAQLKDLLPEHLKLTTLGHLEKAVVLELTLKHVKALTNLIDQQQQKIIA LQSGLQAGELSGRNVETGQEMFCSGFQTCAREVLQYLAKHENTRDLKSSQLVTHLHRVVS ELLQGGTSRKPSDPAPKVMDFKEKPSSPAKGSEGPGKNCVPVIQRTFAHSSGEQSGSDTD TDSGYGGESEKGDLRSEQPCFKSDHGRRFTMGERIGAIKQESEEPPTKKNRMQLSDDEGH FTSSDLISSPFLGPHPHQPPFCLPFYLIPPSATAYLPMLEKCWYPTSVPVLYPGLNASAA ALSSFMNPDKISAPLLMPQRLPSPLPAHPSVDSSVLLQALKPIPPLNLETKD >gi568815595f:4879719_5083689|GENSCAN_predicted_CDS_4|1239_bp atggagcggatccccagcgcgcaaccaccccccgcctgcctgcccaaagcaccgggactg gagcacggagacctaccagggatgtaccctgcccacatgtaccaagtgtacaagtcaaga cggggaataaagcggagcgaggacagcaaggagacctacaaattgccgcaccggctcatc gagaaaaagagacgtgaccggattaacgagtgcatcgcccagctgaaggatctcctaccc gaacatctcaaacttacaactttgggtcacttggaaaaagcagtggttcttgaacttacc ttgaagcatgtgaaagcactaacaaacctaattgatcagcagcagcagaaaatcattgcc ctgcagagtggtttacaagctggtgagctgtcagggagaaatgtcgaaacaggtcaagag atgttctgctcaggtttccagacatgtgcccgggaggtgcttcagtatctggccaagcac gagaacactcgggacctgaagtcttcgcagcttgtcacccacctccaccgggtggtctcg gagctgctgcagggtggtacctccaggaagccatcagacccagctcccaaagtgatggac ttcaaggaaaaacccagctctccggccaaaggttcggaaggtcctgggaaaaactgcgtg ccagtcatccagcggactttcgctcactcgagtggggagcagagcggcagcgacacggac acagacagtggctatggaggagaatcggagaagggcgacttgcgcagtgagcagccgtgc ttcaaaagtgaccacggacgcaggttcacgatgggagaaaggatcggcgcaattaagcaa gagtccgaagaaccccccacaaaaaagaaccggatgcagctttcggatgatgaaggccat ttcactagcagtgacctgatcagctccccgttcctgggcccacacccacaccagcctcct ttctgcctgcccttctacctgatcccaccttcagcgactgcctacctgcccatgctggag aagtgctggtatcccacctcagtgccagtgctatacccaggcctcaacgcctctgccgca gccctctctagcttcatgaacccagacaagatctcggctcccttgctcatgccccagaga ctcccttctcccttgccagctcatccgtccgtcgactcttctgtcttgctccaagctctg aagccaatcccccctttaaacttagaaaccaaagactaa >gi568815595f:4879719_5083689|GENSCAN_predicted_peptide_5|125_aa MALFATDHLAQKDDKVEPRKDMINAVNFYITVVKTQGGHLLLSQSDVVEYERSGLNRPES KCRLFISCVALGNLLKSNRAGHNIQKVRVQEGFCEIKGELWPERKGGMVMSSHLLNTISP EKEYE >gi568815595f:4879719_5083689|GENSCAN_predicted_CDS_5|378_bp atggctctctttgctactgaccacctagcccagaaggatgataaagtggagcccaggaag gatatgatcaatgcagttaatttttatatcactgtggtcaagacacaaggcgggcacctc cttttgagtcagtctgatgtggtggaatacgagcgctctggactcaacaggcctgagtca aaatgtcggctcttcattagctgtgtggccttgggcaacttacttaaaagtaaccgtgcg gggcacaacattcagaaagtccgagtccaggaaggcttttgtgagataaagggggagctg tggccagagaggaaagggggcatggtgatgagttcacacttattaaataccatcagcccc gagaaagagtatgaataa >gi568815595f:4879719_5083689|GENSCAN_predicted_peptide_6|196_aa XKDHPNLIKNAKKPDIPKKLNSPTQQLWYTHRKKDFSCAELMANKKDVPSAERMICRVAS SGSCSPGRRRTPMCDQETKDYEAELLRFSQETAPGGAAAVGKGQQLQEEQPRFLEIELAC TLARRWSDLSEKAKYKTREQARQGVPGTQQAAQVIKRVQIWQQSIISNYLACFKNDRVKA SKAMDVTWNPKEENLM >gi568815595f:4879719_5083689|GENSCAN_predicted_CDS_6|591_bp nngaaggatcatcccaacttaatcaagaatgccaagaagccagacatccccaagaagctc aactcccccacccagcagctgtggtacacccatcggaagaaggactttagttgcgcagag ctcatggccaacaagaaggatgttcccagcgcggagcgcatgatatgccgtgtggccagc agtggaagctgctctcccggaaggagaaggacgcctatgtgcgaccaggaaacaaaagat tatgaggcggaactgctgcgtttttctcaggagactgccccaggaggagcagcagcagtg ggaaaggggcagcagctgcaggaggagcagcctaggttcttggagatcgagctggcctgc acgctggcccgaaggtggagtgacttgtctgagaaggccaagtacaagacccgagagcaa gccaggcagggagtgccaggaactcagcaagccgcccaagtcatcaagagagtacagatc tggcaacagagcatcatcagcaactacctggcctgcttcaagaacgaccgggtgaaggcc tcgaaagccatggatgtgacctggaatcccaaggaggagaacctgatgtga