GENSCAN 1.0 Date run: 4-Nov-116 Time: 12:15:07 Sequence gi568815588f:7955656_8174020 : 218365 bp : 44.50% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 8265 10143 1879 1 1 97 97 1508 0.508 139.93 1.02 Intr + 53423 53675 253 2 1 105 62 313 0.137 27.84 1.03 Intr + 58076 58182 107 1 2 28 105 115 0.987 6.21 1.04 Term + 58982 59096 115 2 1 69 49 156 0.999 7.84 1.05 PlyA + 59379 59384 6 1.05 2.03 PlyA - 59417 59412 6 1.05 2.02 Term - 59984 59914 71 0 2 50 41 71 0.005 -3.30 2.01 Init - 68075 67979 97 2 1 101 40 104 0.594 6.37 2.00 Prom - 68200 68161 40 -1.86 3.00 Prom + 70231 70270 40 -5.76 3.01 Init + 71326 71353 28 0 1 26 115 34 0.470 -0.34 3.02 Intr + 79511 79702 192 0 0 52 -4 160 0.046 2.66 3.03 Intr + 86399 86470 72 0 0 92 66 46 0.008 2.18 3.04 Intr + 94417 94529 113 2 2 73 90 44 0.018 3.20 3.05 Intr + 95251 95338 88 0 1 85 68 62 0.020 3.44 3.06 Intr + 99855 100241 387 1 0 81 113 375 0.017 34.06 3.07 Intr + 102650 103186 537 0 0 127 91 616 0.999 58.69 3.08 Intr + 108338 108483 146 0 2 97 119 95 0.965 13.50 3.09 Intr + 113818 113943 126 0 0 94 101 78 0.994 10.58 3.10 Intr + 125228 125407 180 1 0 38 89 136 0.447 8.76 3.11 Term + 128039 128149 111 1 0 76 49 45 0.286 -2.04 3.12 PlyA + 128489 128494 6 1.05 4.03 PlyA - 128647 128642 6 -0.45 4.02 Term - 140703 140405 299 1 2 81 41 171 0.920 7.13 4.01 Init - 142358 142343 16 2 1 78 105 22 0.842 3.41 4.00 Prom - 151163 151124 40 -5.46 5.00 Prom + 152507 152546 40 -3.76 5.01 Init + 169088 169223 136 1 1 56 26 108 0.059 1.71 5.02 Intr + 186586 186690 105 2 0 42 105 90 0.150 6.29 5.03 Term + 194374 194624 251 2 2 80 45 77 0.034 -1.53 5.04 PlyA + 194820 194825 6 1.05 6.00 Prom + 198202 198241 40 -6.26 6.01 Sngl + 205605 206027 423 2 0 64 44 332 0.987 22.70 6.02 PlyA + 212004 212009 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 93964 93869 96 1 0 51 37 207 0.874 10.21 S.002 Init + 100001 100241 241 1 1 97 113 354 0.931 36.84 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588f:7955656_8174020|GENSCAN_predicted_peptide_1|784_aa XEEEEQVPTDGGTSAEAMQVPLEEDDELEEEEIINDENFLGKRPLDSPEAEELPAMKRPR LLSTKGDTLDVVLLEAREPLSSINTQKIPPMLSPVHVQDSTDLAPPSPEPPMLAPVAKSQ MPTAKPLETKSFTPKTKTKTSSPGQKTKSPKTAQSPAMVGSPIRSPKTVSKEKKSPGRSK SPKSPKSPKVTTHIPQTPVRPETPNRTPSATLSEKISKETIQVKQIQTPPDAGKLNSENQ PKKAVVADKTIEASIDAVIARACAEREPDPFEFSSGSESEGDIFTSPKRISGPECTTPKA STSANNFTKSGSTPLPLSGGTSSSDNSWTMDASIDEVVRKAKLGTPSNMPPNFPYISSPS VSPPTPEPLHKVYEEKTKLPSSVEVKKKLKKELKTKMKKKEKQRDREREKDKNKDKSKEK DKVKEKEKDKETGRETKYPWKEFLKEEEADPYKFKIKEFEDVDPKVKLKDGLVRKEKEKH KDKKKDREKGKKDKDKREKEKVKDKGREDKMKAPAPPLVLPPKELALPLFSPATASRVPA MLPSLLPVLPEKLFEEKEKVKEKEKKKDKKEKKKKKEKEKEKKEKEREKEKREREKREKE KEKHKHEKVSSFSFLALSEQSPSEKHSVISKVVPAPEAKPAPSQNRPKTPPPAPAPAPGP MLVSPAPVPLPLLAQAAAGPALLPSPGPAASGASAKAPVRSVVTETVSTYVIRDEWGNQI WICPGCNKPDDGSPMIGCDDCDDWYHWPCVGIMTAPPEEMQWFCPKCANKKKDKKHKKRK HRAH >gi568815588f:7955656_8174020|GENSCAN_predicted_CDS_1|2355_bp naagaagaagaagagcaggtgcccactgatggaggcacatcagcagaagccatgcaggtt cccttggaagaagatgatgaattggaggaggaagaaattattaatgatgagaatttcctg ggcaagagaccactggatagtcctgaagctgaagaactgccagccatgaagcggcctcgg ctattaagcactaaaggggacacgctagatgttgtgttattggaagctcgagagccactc agctcaataaatactcaaaagatcccaccaatgctttctccagtccatgtacaggacagt acagacttggcacctccctcacccgagccgccaatgttggctccagttgcaaaatcacaa atgccaactgcaaaaccattagaaacaaagtcatttacacctaaaacaaagactaaaact agctctccaggacagaagactaaatcacctaaaaccgcccagtcaccagcaatggtcgga agtcctattcgatcaccaaaaactgtatccaaagaaaagaaatcacctggacgttccaag agccccaagagtcccaagagccccaaggtcacgactcacattccccaaacacctgtgaga cctgaaacgcccaacaggactccttcagctacactcagtgaaaaaatcagtaaagagact atccaggtaaaacaaatacagacaccccctgatgctgggaaactgaacagtgagaatcag ccgaaaaaggctgtggtagcagataaaacgattgaggcctctatcgatgctgtgattgca cgagcctgtgctgagcgagagccagatcctttcgaattttcttctggatcggaatctgaa ggagacatttttactagccctaagagaatttcaggcccggagtgtactactcccaaagct tccacttccgcgaacaatttcacaaagtcaggatccactcctctgcctctttccggtgga acctcaagttccgataactcatggacaatggatgcctccattgatgaggttgtacgtaaa gcaaaactgggaacaccttcaaatatgccccccaactttccttatatctcttctccgtca gtgtctcctcccactcccgaacctctccacaaggtgtatgaggagaaaaccaagctgcct tcctccgtggaggtaaagaagaagttgaaaaaggaactaaagactaaaatgaaaaagaaa gaaaagcagagagatagggagagggaaaaagacaagaacaaggacaaaagtaaggagaag gataaagtgaaagagaaagagaaagacaaggaaactggcagggaaacaaagtatccctgg aaggaatttcttaaagaggaagaggcagatccctacaagtttaaaatcaaagaatttgaa gatgttgatcccaaagtgaaattgaaagatggacttgtgaggaaggagaaagagaagcat aaagataagaagaaagatagagagaaaggcaagaaagataaagataagagagagaaagaa aaagtgaaagataaaggcagagaagataagatgaaagccccagcacccccactggtgttg cccccaaaagagttggccctgcccttgttcagccctgccacagcctccagggtcccagcc atgctgccatctttgttgccagtgcttccggaaaaactgtttgaggagaaagagaaggtg aaggagaaagaaaagaaaaaggacaaaaaggagaagaagaaaaagaaggaaaaagagaag gagaagaaggagaaggaaagagagaaagagaagagagagcgagagaagagagaaaaagag aaggagaaacacaagcatgaaaaagtaagcagtttctcatttttggccctatctgaacag agtcctagtgagaaacacagtgtcatcagcaaggtggtccctgcccccgaggccaagccg gcgccctcgcagaacaggccgaagaccccaccgccggcccccgcgcccgcccccggcccc atgctcgtcagccctgcgcccgtgccgctgccgctgctcgcccaggccgccgcgggccct gccctgctgccctccccgggtcccgccgcctccggggccagtgccaaagcccccgtgcgc agcgtggtgactgagacggtcagcacctacgtgatccgagatgagtggggcaatcagatc tggatctgccctgggtgtaacaagcctgacgatgggagtcccatgattgggtgtgacgac tgcgatgactggtaccactggccctgtgttggaatcatgactgcacccccagaagagatg cagtggttctgccccaagtgtgcgaacaagaagaaggacaaaaagcacaagaagaggaag catcgagcccactga >gi568815588f:7955656_8174020|GENSCAN_predicted_peptide_2|55_aa MQLVRKYGYAVSASWTSLLPGHGKDEDDNEGEALFLSSVITDLAEITVDFRESQN >gi568815588f:7955656_8174020|GENSCAN_predicted_CDS_2|168_bp atgcagttggtgaggaagtatggctatgctgtgagtgcttcctggaccagcctccttcct ggccatgggaaagatgaggatgacaatgaaggagaagccttgtttctgtccagcgtcatt acagacctggctgaaatcacagtggatttcagagaaagccagaattaa >gi568815588f:7955656_8174020|GENSCAN_predicted_peptide_3|659_aa MEPPNGFTQDVATASSHPEGPSHPEVVREDPDGRIGQWRDPWSSRLLCRCVNDRDTQGNP YQHRAGQRYPAPPPGSALNPHLRSMNARPGTKGYKAGVALYNSHTEGKASKERSRRLPKR TLPSAQPRPLFQSAPVGQRAGARGRFSRHLGPAELAWKRRLPGSAFPSPHRKQIIQRPPT LRRQEPPDLPGGPPSLPARGFRARREGASTAEAMEVTADQPRWVSHHHPAVLNGQHPDTH HPGLSHSYMDAAQYPLPEEVDVLFNIDGQGNHVPPYYGNSVRATVQRYPPTHHGSQVCRP PLLHGSLPWLDGGKALGSHHTASPWNLSPFSKTSIHHGSPGPLSVYPPASSSSLSGGHAS PHLFTFPPTPPKDVSPDPSLSTPGSAGSARQDEKECLKYQVPLPDSMKLESSHSRGSMTA LGGASSSTHHPITTYPPYVPEYSSGLFPPSSLLGGSPTGFGCKSRPKARSSTEGRECVNC GATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCANCQTTTTT LWRRNANGDPVCNACGLYYKLHNLPRQPPPTVTANSEQTLPGFPAIANPMILSQELCDFP PLQVDHSQAGHMSWPVFDKFETQSPESAHSSITSHIHKGGEGEAAASLPAARTDHQQLR >gi568815588f:7955656_8174020|GENSCAN_predicted_CDS_3|1980_bp atggagccacccaacggctttacacaagacgtggccaccgcctcttctcatcccgaaggt cccagccatcccgaggtggtcagagaagaccccgacgggcggatcgggcagtggagagac ccgtggtcctcgcgcctgctgtgtcgctgtgtcaatgaccgtgacactcagggcaatcct taccaacacagagcaggacaacgataccccgcacccccccctggttcagctctgaatcca cacctgagaagcatgaacgcccgcccagggacaaagggctacaaggctggtgtcgccctt tacaattcccacaccgaaggcaaagcaagcaaggaaagaagccggcgcctccccaagcgg accctcccttccgcgcagcctcggccactcttccagagcgccccggtcgggcagcgcgcg ggcgcccggggccgcttctcccggcacctcggccccgcggagctcgcctggaagcgccgg ttgcctggctctgccttcccatccccccaccgaaagcaaatcattcaacgacccccgacc ctccgacggcaggagccccccgacctcccaggcggaccgccctccctccccgcgcgcggg ttccgggcccggcgagagggcgcgagcacagccgaggccatggaggtgacggcggaccag ccgcgctgggtgagccaccaccaccccgccgtgctcaacgggcagcacccggacacgcac cacccgggcctcagccactcctacatggacgcggcgcagtacccgctgccggaggaggtg gatgtgctttttaacatcgacggtcaaggcaaccacgtcccgccctactacggaaactcg gtcagggccacggtgcagaggtaccctccgacccaccacgggagccaggtgtgccgcccg cctctgcttcatggatccctaccctggctggacggcggcaaagccctgggcagccaccac accgcctccccctggaatctcagccccttctccaagacgtccatccaccacggctccccg gggcccctctccgtctaccccccggcctcgtcctcctccttgtcggggggccacgccagc ccgcacctcttcaccttcccgcccaccccgccgaaggacgtctccccggacccatcgctg tccaccccaggctcggccggctcggcccggcaggacgagaaagagtgcctcaagtaccag gtgcccctgcccgacagcatgaagctggagtcgtcccactcccgtggcagcatgaccgcc ctgggtggagcctcctcgtcgacccaccaccccatcaccacctacccgccctacgtgccc gagtacagctccggactcttcccccccagcagcctgctgggcggctcccccaccggcttc ggatgcaagtccaggcccaaggcccggtccagcacagaaggcagggagtgtgtgaactgt ggggcaacctcgaccccactgtggcggcgagatggcacgggacactacctgtgcaacgcc tgcgggctctatcacaaaatgaacggacagaaccggcccctcattaagcccaagcgaagg ctgtctgcagccaggagagcagggacgtcctgtgcgaactgtcagaccaccacaaccaca ctctggaggaggaatgccaatggggaccctgtctgcaatgcctgtgggctctactacaag cttcacaatcttccaagacagcctccacccacagtcacagccaactctgagcagacgctg cctggctttcccgccatcgccaacccgatgatcctgagccaggagctctgtgactttcct ccactccaggtagaccattcccaggccgggcacatgagctggcccgtctttgataaattt gagacccagagtccagagtcagctcactcctccattacgtcgcacatccacaaaggaggg gaaggagaggcagcggcatcgctgcccgccgcacggacggatcaccagcagctccgatga >gi568815588f:7955656_8174020|GENSCAN_predicted_peptide_4|104_aa MGPKPGSTKASGGPAQDNSHDPSTLYTALNLNPNTLTHYASHTAGLMQKSLQIVISRGSR CGHRSCSLSLQNCNPACHSGTYGPGPINCHQATDGCSSPPDSDD >gi568815588f:7955656_8174020|GENSCAN_predicted_CDS_4|315_bp atggggcccaagcctggatccaccaaagcttcaggtggacctgctcaggataactcacat gacccctcgacgctgtacacagctttaaatctgaacccaaatacactcacccattatgcg agccacactgccggtttgatgcaaaaatccctgcagattgtcatcagtcgtggttcacgc tgtggtcaccgcagctgttctttgtcattgcagaattgcaaccctgcatgtcactctggg acatatggccctgggcctataaactgccaccaggccaccgacggctgctcctcaccgcct gacagtgatgattaa >gi568815588f:7955656_8174020|GENSCAN_predicted_peptide_5|163_aa MNGGEFDSKVSLRLWALYGSICTKHHSECGKLMQMNYREVALVCSRRFDIAIFLVKELGP EKCSKAMFFQNLPVSFGFESARCLSVPADVEEGTKALPGHLLRQQRCLQDGSFPLMVYDP PSSYLGPASRLKNENERNSQTHVDPGSEAICSLRAMAKCRAFD >gi568815588f:7955656_8174020|GENSCAN_predicted_CDS_5|492_bp atgaatggaggggagtttgactctaaagtcagcctcaggctttgggcactgtatggctcc atctgcaccaagcaccatagtgaatgtgggaagttaatgcagatgaattaccgagaagtg gctctagtgtgctccagaagatttgacattgccatctttttggtcaaggagctgggacct gagaagtgtagcaaggctatgttttttcagaaccttcccgtttcttttggatttgagtct gctaggtgccttagcgtgcccgctgatgttgaggaggggacaaaagccctccctggccac cttctgaggcagcagaggtgcctgcaagatggtagctttcctcttatggtttacgatcct ccttcttcctacctggggccagcgagcaggctaaagaacgaaaatgagcggaatagtcag acgcacgttgatccggggtctgaagcaatctgcagcctcagagctatggcaaaatgcaga gcgtttgactga >gi568815588f:7955656_8174020|GENSCAN_predicted_peptide_6|140_aa MANRTLKDAHSVRGTNPQYLVGKIIRMRICESKHWKEECFGLMAELVVDNAMELMFVGGE YGGNIKPTPFLCLILKMLQIQSEKGITAEFIENEDFKYVHMLGALYMRLMGTAIDCYKYL EPLYNDYRKIKSQNRNGGLN >gi568815588f:7955656_8174020|GENSCAN_predicted_CDS_6|423_bp atggctaaccgtacactgaaggatgcgcacagcgtccgcggcaccaaccctcaatatctg gtggggaagatcattcgaatgcgaatctgtgagtccaagcactggaaagaggagtgcttt ggacttatggctgaacttgtagtcgataacgccatggagttaatgtttgtgggtggcgaa tatggtggcaacataaaaccaacaccctttctgtgtttaatcttgaagatgcttcagatt caatccgagaaaggtatcactgcagagtttatagaaaatgaagatttcaagtatgtccac atgttgggggcactttacatgaggctgatgggcactgcaattgattgctacaagtacttg gagcctttgtacaatgactatcgaaaaatcaagagccagaacagaaatgggggtttgaac tga