GENSCAN 1.0 Date run: 2-Nov-116 Time: 22:16:44 Sequence gi568815585r:83779420_83981507 : 202088 bp : 35.01% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 44965 45428 464 2 2 92 53 149 0.399 6.03 1.02 PlyA + 45615 45620 6 1.05 2.02 PlyA - 46848 46843 6 1.05 2.01 Sngl - 81066 80659 408 0 0 69 39 163 0.930 5.54 2.00 Prom - 89854 89815 40 -6.25 3.09 PlyA - 91087 91082 6 1.05 3.08 Term - 102259 99998 2262 1 0 65 47 1787 0.620 156.65 3.07 Intr - 102708 102585 124 2 1 36 86 89 0.594 3.17 3.06 Intr - 102953 102869 85 0 1 138 68 78 0.402 8.76 3.05 Intr - 127308 127004 305 2 2 42 80 234 0.230 13.31 3.04 Intr - 128362 127916 447 0 0 74 -11 353 0.478 15.54 3.03 Intr - 132087 132045 43 1 1 2 121 63 0.607 -2.62 3.02 Intr - 133715 133398 318 0 0 83 90 157 0.247 10.41 3.01 Init - 176086 176047 40 1 1 79 79 56 0.327 4.20 3.00 Prom - 179777 179738 40 -7.35 4.02 PlyA - 180015 180010 6 1.05 4.01 Sngl - 182819 182010 810 2 0 44 48 311 0.787 18.23 4.00 Prom - 183077 183038 40 -5.25 5.02 PlyA - 183134 183129 6 -3.24 5.01 Sngl - 184091 183177 915 2 0 48 43 702 0.995 57.76 5.00 Prom - 184138 184099 40 -18.07 6.00 Prom + 184140 184179 40 -18.08 6.01 Init + 184186 186287 2102 0 2 70 53 721 0.577 54.34 6.02 Term + 188262 188376 115 0 1 84 38 82 0.825 -0.14 6.03 PlyA + 188448 188453 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 52598 52645 48 1 0 60 87 70 0.810 5.10 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585r:83779420_83981507|GENSCAN_predicted_peptide_1|154_aa XVGQELKTHRMVGLKELVTQMAQTCPLSPHCGKQEGEKKGEKSCSPSGSLDLGPPQASAV TPTLGLCSSWHLQASRHHCVPRCLQWKPLAGHLVQLQPCRKPTPMPVPGAACPTTAACLV VCSGQTLRSLSYTPLVTLPQAHLWQHEIQASSMS >gi568815585r:83779420_83981507|GENSCAN_predicted_CDS_1|465_bp natgtgggacaagaactcaagacccacagaatggtgggattgaaagagctggtaacacaa atggctcaaacatgccccctctctccacattgtgggaaacaagaaggagagaagaaagga gagaagagctgcagcccttcagggagcctagacctaggacctccccaagccagtgctgtg acacccactttagggctctgcagttcctggcatctccaagcttccaggcaccactgtgtt ccccggtgcctgcagtggaagccacttgcaggacacttggtccagctgcagccttgcagg aagccgacgcccatgccagtgcctggagctgcctgccccaccacagctgcttgcctggtt gtgtgcagtggccagactctgcgctcgctctcttacacaccccttgtcactctgccccag gctcacctttggcagcatgagatccaggccagtagcatgagctga >gi568815585r:83779420_83981507|GENSCAN_predicted_peptide_2|135_aa MKESKKESKKERKKESKKERKKERKKERERERERERKKERKKERKKERERERKRERGKKK ERKRERKERKKRKKKKRKEEKRKKKKERERKRKERKEGEERERKKEGKKERKRKKKGRKE ERKRKKERKRIIRGD >gi568815585r:83779420_83981507|GENSCAN_predicted_CDS_2|408_bp atgaaagaaagtaagaaagaaagtaagaaagaaagaaagaaagaaagtaagaaagaaaga aagaaagaaagaaagaaagaaagagagagagagagagagagagagagaaagaaagaaaga aagaaagaaagaaagaaagaaagagaaagagagagaaagagagaaagaggaaagaaaaaa gaaagaaagagagaaagaaaagaaagaaagaaaagaaaaaagaaaaagagaaaggaagaa aaaagaaagaagaagaaagagagagaaagaaagaggaaagaaaggaaggaaggagaagaa agagaaagaaagaaggaaggaaagaaagaaagaaaaagaaagaagaaaggaaggaaagaa gaaagaaaaagaaagaaagaaagaaaaagaatcatcaggggtgattaa >gi568815585r:83779420_83981507|GENSCAN_predicted_peptide_3|1207_aa MLTVQMKLNTETAETKETCFICGPKTLALVTDWEGSLPLVFNHCRDASLSIHPSFKGVRP CRDACLSPSPLAASPAFLGEGQVPQPRISAPQSLISTPQPLISVPQSLISTTPLPTFLED HQGRRASGNSHSGRAGSRLGERLPALPVSGPSGAEVAGRGYAAHRGPDENLVSKPPDETR TANAGFTAEQPLLGVSPRARAFHSPSSGLANGLQLLCPWAPLPGPQAPMMPPGSFWGLSQ VEQEALASAWASCCGQPLAYHPPSPGSGVDTLGPALSTGPWRLESGAAGTCRCAWTEAVP HVPTLQNVLTCPRPHSSLGTKQNDSETALPAESPEVPRMGGTEAPPGEDSPSLMDRPFLV QIPDCTHRTLDKSASVHIAPECPGDNSPPALRPPPRNSERAEEREGEQEKKSYTGGTGQH LGGTSGQPATTARTPPAASTTEAAENPALYQRGKCGGVREEGIHPHPPKPFSSPFLASDI GALNELELCLWRAGWSLLLCDEIGDELLALKMLLWILLLETSLCFAAGNVTGDVCKEKIC SCNEIEGDLHVDCEKKGFTSLQRFTAPTSQFYHLFLHGNSLTRLFPNEFANFYNAVSLHM ENNGLHEIVPGAFLGLQLVKRLHINNNKIKSFRKQTFLGLDDLEYLQADFNLLRDIDPGA FQDLNKLEVLILNDNLISTLPANVFQYVPITHLDLRGNRLKTLPYEEVLEQIPGIAEILL EDNPWDCTCDLLSLKEWLENIPKNALIGRVVCEAPTRLQGKDLNETTEQDLCPLKNRVDS SLPAPPAQEETFAPGPLPTPFKTNGQEDHATPGSAPNGGTKIPGNWQIKIRPTAAIATGS SRNKPLANSLPCPGGCSCDHIPGSGLKMNCNNRNVSSLADLKPKLSNVQELFLRDNKIHS IRKSHFVDYKNLILLDLGNNNIATVENNTFKNLLDLRWLYMDSNYLDTLSREKFAGLQNL EYLNVEYNAIQLILPGTFNAMPKLRILILNNNLLRSLPVDVFAGVSLSKLSLHNNYFMYL PVAGVLDQLTSIIQIDLHGNPWECSCTIVPFKQWAERLGSEVLMSDLKCETPVNFFRKDF MLLSNDEICPQLYARISPTLTSHSKNSTGLAETGTHSNSYLDTSRVSISVLVPGLLLVFV TSAFTVVGMLVFILRNRKRSKRRDANSSASEINSLQTVCDSSYWHNGPYNADGAHRVYDC GSHSLSD >gi568815585r:83779420_83981507|GENSCAN_predicted_CDS_3|3624_bp atgctgactgtccagatgaagctaaacactgagacagcagagacaaaggagacttgtttt atctgtggacccaaaactctggcactggtcacggactgggaaggcagccttcccttggtg tttaatcattgcagggacgcctctctgagtattcacccaagtttcaaaggtgtcagacca tgcagggatgcctgccttagtccttcacccttagcggcaagtcccgcttttctgggggag gggcaagtaccccaacctcgtatctctgcaccccaatcccttatttccacaccccaacct cttatttctgtgccccaatcccttatttccacgactccccttcccacttttctggaggac catcagggacgccgagcttcgggtaactctcacagtggaagagcaggttcgcgccttgga gagcgtcttccggcactaccagtatctgggccctctggagcggaagtggctggccgggga tatgcagctcacagaggtccagatgaaaacctggtttcaaaaccgccagatgaaacacga acggcaaatgcaggattcacagctgaacagccccttcttggggtctctccacgcgcccgg gctttccactcaccgtcttctggccttgccaacggcctgcagctgctgtgcccttgggct cccctgccagggccccaggctccgatgatgccccctggttccttctggggtctctcacaa gtggaacaagaggccctggcctctgcgtgggcttcctgctgcgggcagcctctggcctac caccccccaagcccaggaagtggcgtggatacactgggaccagccctgtccacagggccc tggaggctggaaagcggggcagctgggacctgccgctgtgcctggactgaagccgtcccc cacgtccctaccctccagaatgtgctcacctgtccccgtcctcacagcagcctcgggaca aaacagaatgactcagagacagcacttcctgcagaaagtccggaagtgcccagaatggga ggcacggaagcccctcccggggaggactctccctcactgatggacagaccattcttggtg cagattcctgactgcacgcaccgaaccttagacaagagcgcatccgtccacatcgcccca gagtgcccaggagacaactcgccccccgccctccgcccccctccacgtaattccgaaaga gcagaagaaagagaaggagaacaggaaaagaagagctacaccgggggaactggacagcac ctcggggggacttctgggcaacccgcaaccacagcaagaactccaccagcagcctcaaca acagaagccgcggaaaaccctgctttgtatcagagaggcaagtgcgggggggttagggag gaaggaatccacccccacccccccaaacccttttcttctcctttcctggcttcggacatt ggagcactaaatgaacttgaattgtgtctgtggcgagcaggatggtcgctgttactttgt gatgagatcggggatgaattgctcgctttaaaaatgctgctttggattctgttgctggag acgtctctttgttttgccgctggaaacgttacaggggacgtttgcaaagagaagatctgt tcctgcaatgagatagaaggggacctacacgtagactgtgaaaaaaagggcttcacaagt ctgcagcgtttcactgccccgacttcccagttttaccatttatttctgcatggcaattcc ctcactcgacttttccctaatgagttcgctaacttttataatgcggttagtttgcacatg gaaaacaatggcttgcatgaaatcgttccgggggcttttctggggctgcagctggtgaaa aggctgcacatcaacaacaacaagatcaagtcttttcgaaagcagacttttctggggctg gacgatctggaatatctccaggctgattttaatttattacgagatatagacccgggggcc ttccaggacttgaacaagctggaggtgctcattttaaatgacaatctcatcagcacccta cctgccaacgtgttccagtatgtgcccatcacccacctcgacctccggggtaacaggctg aaaacgctgccctatgaggaggtcttggagcaaatccctggtattgcggagatcctgcta gaggataacccttgggactgcacctgtgatctgctctccctgaaagaatggctggaaaac attcccaagaatgccctgatcggccgagtggtctgcgaagcccccaccagactgcagggt aaagacctcaatgaaaccaccgaacaggacttgtgtcctttgaaaaaccgagtggattct agtctcccggcgccccctgcccaagaagagacctttgctcctggacccctgccaactcct ttcaagacaaatgggcaagaggatcatgccacaccagggtctgctccaaacggaggtaca aagatcccaggcaactggcagatcaaaatcagacccacagcagcgatagcgacgggtagc tccaggaacaaacccttagctaacagtttaccctgccctgggggctgcagctgcgaccac atcccagggtcgggtttaaagatgaactgcaacaacaggaacgtgagcagcttggctgat ttgaagcccaagctctctaacgtgcaggagcttttcctacgagataacaagatccacagc atccgaaaatcgcactttgtggattacaagaacctcattctgttggatctgggcaacaat aacatcgctactgtagagaacaacactttcaagaaccttttggacctcaggtggctatac atggatagcaattacctggacacgctgtcccgggagaaattcgcggggctgcaaaaccta gagtacctgaacgtggagtacaacgctatccagctcatcctcccgggcactttcaatgcc atgcccaaactgaggatcctcattctcaacaacaacctgctgaggtccctgcctgtggac gtgttcgctggggtctcgctctctaaactcagcctgcacaacaattacttcatgtacctc ccggtggcaggggtgctggaccagttaacctccatcatccagatagacctccacggaaac ccctgggagtgctcctgcacaattgtgcctttcaagcagtgggcagaacgcttgggttcc gaagtgctgatgagcgacctcaagtgtgagacgccggtgaacttctttagaaaggatttc atgctcctctccaatgacgagatctgccctcagctgtacgctaggatctcgcccacgtta acttcgcacagtaaaaacagcactgggttggcggagaccgggacgcactccaactcctac ctagacaccagcagggtgtccatctcggtgttggtcccgggactgctgctggtgtttgtc acctccgccttcaccgtggtgggcatgctcgtgtttatcctgaggaaccgaaagcggtcc aagagacgagatgccaactcctccgcgtccgagattaattccctacagacagtctgtgac tcttcctactggcacaatgggccttacaacgcagatggggcccacagagtgtatgactgt ggctctcactcgctctcagactaa >gi568815585r:83779420_83981507|GENSCAN_predicted_peptide_4|269_aa MVKGSIQQEELTILNIYAPNTGAPRFIKQVLSDLQRDLDSHTLIMGDFNTPLSTLDRSTR QKVNKDTQELNSALHQADLIDIYRTLHPKSTEYTFFSAPHHTYSKIDHIVGSKALLSKCR RTEIITNYLSDHSAIKLELRIKNLTQSRSTTWKLNNLLLNDYWVHNEMKAEIKMFFETNE NKDTTYQNLWDAFKAVCRGKFIALNAYKRKQERSKIDTLTSQLKELEKQEQTHSKASRRQ EITKIRAELKEIETQKTLQKINESRSWFF >gi568815585r:83779420_83981507|GENSCAN_predicted_CDS_4|810_bp atggtaaagggatcaattcaacaagaggagctaactatcctaaatatttatgcacccaat acaggagcacccagattcataaagcaagtcctgagtgacctacaaagagacttagactcc cacacattaataatgggagactttaacaccccactgtcaacattagacagatcaacgaga cagaaagtcaacaaggatacccaggaattgaactcagctctgcaccaagcagacctaata gacatctacagaactctccaccccaaatcaacagaatatacatttttttcagcaccacac cacacctattccaaaattgaccacatagttggaagtaaagctctcctcagcaaatgtaga agaacagaaattataacaaactatctctcagaccacagtgcaatcaaactagaactcagg attaagaatctcactcaaagccgctcaactacatggaaactgaacaacctgctcctgaat gactactgggtacataacgaaatgaaggcagaaataaagatgttctttgaaaccaacgag aacaaagacaccacataccagaatctctgggacgcattcaaagcagtgtgtagagggaaa ttcatagcactaaatgcctacaagagaaagcaggaaagatccaaaattgacaccctaaca tcacaattaaaagaactagaaaagcaagagcaaacacattcaaaagctagcagaaggcaa gaaataactaaaatcagagcagaactgaaggaaatagagacacaaaaaacccttcaaaaa atcaatgaatccaggagctggtttttttga >gi568815585r:83779420_83981507|GENSCAN_predicted_peptide_5|304_aa MENDFDELREEGFRRSNYSELREDIQTKGKEVENFEKNLEECITRITNTEKCLKELMELK TKARELREECRSLRSRCDQLEERVSAMEDEMNEMKREGKFREKRIKRNEQSLQEIWDYVK RPNLRLIGVPESDVENGTKLENTLQDIIQENFPNLARQANVQIQEIQRTPQRYSSRRATP RHIIVRFTKVEMKEKMLRAAREKGRVTLKGKPIRLTADLLAETLQARREWGPIFNILKEK NFQPRISYPAKLSFISEGEIKYFIDKPMLRYFVTTRPALKELLKEALNMERNNRYQPLQN HDKM >gi568815585r:83779420_83981507|GENSCAN_predicted_CDS_5|915_bp atggagaatgattttgacgagctgagagaagaaggcttcagacgatcaaattactctgag ctacgggaggacattcaaaccaaaggcaaagaagttgaaaactttgaaaaaaatttagaa gaatgtataactagaataaccaatacagagaagtgcttaaaggagctgatggagctgaaa accaaggctcgagaactacgtgaagaatgcagaagcctcaggagccgatgcgatcaactg gaagaaagggtatcagcaatggaagatgaaatgaatgaaatgaagcgagaagggaagttt agagaaaaaagaataaaaagaaatgagcaaagcctccaagaaatatgggactatgtgaaa agaccaaatctacgtctgattggtgtacctgaaagtgatgtggagaatggaaccaagttg gaaaacactctgcaggatatcatccaggagaacttccccaatctagcaaggcaggccaac gttcagattcaggaaatacagagaacgccacaaagatattcctcgagaagagcaactcca agacacataattgtcagattcaccaaagttgaaatgaaggaaaaaatgttaagggcagcc agagagaaaggtcgggttaccctcaaaggaaagcccatcagactaacagcggatctcttg gcagaaaccctacaagccagaagagagtgggggccaatattcaacattcttaaagaaaag aattttcaacccagaatttcatatccagccaaactaagcttcataagtgaaggagaaata aaatactttatagacaagccaatgctgagatattttgtcaccaccaggcctgccctaaaa gagctcctgaaggaagcgctaaacatggaaaggaacaaccggtaccagccgctgcaaaat catgacaaaatgtaa >gi568815585r:83779420_83981507|GENSCAN_predicted_peptide_6|738_aa MDTFLDTYTLPRLNQEEVESLNRPITGSEIVAIINSLPIKKSPGPDGFTAEFYQRYKEEL VPFLLKLFQSIEKEGVLPNSFYEASIILRPKPGRDTTKKENFRPISLMNIDAKILNKILA NRIQQHIKKLIHHDQVGFIPGMQGWFNIRKSINVIQHINRAKDKNHMIISIDAEKAFDKI QQPFMLKTLNKLHIDGTYFKIIRAIYDKPTANIILNGQKLEAFPLKTGTRQGCPLSPLLF NIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQNLLKLISNFSKV SGYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLTSDVKDLFKENYKPL LKEIKEDTNKWKNIPCSWVGRINIVKMAILPKVINRFNAIPIKLPMTFFTELEKTTLKFI WNQKRARIAKSILSQKNKAGGITLPDFKLYYKATVTKTACYWYQNRDIDQWNRTEPSEIM PRIYNYLIFDKPEKNKQWGKDSLFNKWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLN VRPKTIKTLEENLGITIQDIGVGKDFMSKTPKAMATKAKIDKWDLIKLKSFCTAKETTIR VNRQPTTWEKIFATYSSDKGLISRIYNELKQIYKKKTNNPIKKWAKDMNRHFSKEDIYAA KKHMKKCSSSLAIREMQIKTTMRYHLTPVRMAIIKKSGNNSSGAENKGSYFENEHSQDYE QPAFLMQRHLQEHLYFSM >gi568815585r:83779420_83981507|GENSCAN_predicted_CDS_6|2217_bp atggatacattcctcgacacatacactctcccaagactaaaccaggaagaagttgaatct ctgaatagaccaataacaggctctgaaattgtggcaataatcaatagtttaccaatcaaa aagagtccaggaccagatggattcacagccgaattctaccagaggtacaaggaggaactg gtaccattccttctgaaactattccaatcaatagaaaaagagggagtcctccctaactca ttttatgaggccagcatcattctgagaccaaagccgggcagagacacaaccaaaaaagag aattttagaccaatatccttgatgaacattgatgcaaaaatcctcaataaaatactggca aaccgaatccagcagcacatcaaaaagcttatccaccatgatcaagtgggcttcatccct gggatgcaaggctggttcaatatacgcaaatcaataaatgtaatccagcatataaacaga gccaaagacaaaaaccacatgattatctcaatagatgcagaaaaagcctttgacaaaatt caacaacccttcatgctaaaaactctcaataaattacatattgatgggacgtatttcaaa ataataagagctatctatgacaaacccacagccaatatcatactgaatgggcaaaaactg gaagcattccctttgaaaactggcacaagacagggatgccctctctcaccactcctattc aacatagtgttggaagttctggccagggcaattaggcaggagaaggaaataaagggtatt caattaggaaaagaggaagtcaaattgtccctgtttgcagacgacatgattgtttatcta gaaaaccccatcgtctcagcccaaaatctccttaagctgataagcaacttcagcaaagtc tcaggatacaaaatcaatgtacaaaaatcacaagcattcttatacaccaacaacagacaa acagagagccaaatcatgagtgaactcccattcacaattgcttcaaagagaataaaatac ctaggaatccaacttacaagtgatgtgaaggacctcttcaaggagaactacaaaccactg ctcaaggaaataaaagaggacacaaacaaatggaagaacattccatgctcatgggtagga agaatcaatatcgtgaaaatggccatactgcccaaggtaattaacagattcaatgccatc cccatcaagctaccaatgactttcttcacagaattggaaaaaactacgttaaagttcata tggaaccaaaaaagagcccgcatcgccaagtcaatcctaagccaaaagaacaaagctgga ggcatcacactacctgacttcaaactatactacaaggctacagtaaccaaaacagcatgc tactggtaccaaaacagagatatagatcaatggaacagaacagagccctcagaaataatg ccacgtatctacaactatctgatctttgacaaacctgagaaaaacaagcaatggggaaag gattccctatttaataaatggtgctgggaaaactggctagccatatgtagaaagctgaaa ctggatcccttccttacaccttatacaaaaatcaattcaagatggattaaagatttaaat gttagacctaaaaccataaaaaccctagaagaaaacctaggcattaccattcaggacata ggcgtgggcaaggacttcatgtccaaaacaccaaaagcaatggcaacaaaagccaaaatt gacaaatgggatctaattaaactaaagagcttctgcacagcaaaagaaactaccatcaga gtgaacaggcaacctacaacatgggagaaaattttcgcaacctactcatctgacaaaggg ctaatatccagaatctacaatgaactcaaacaaatttacaagaaaaaaacaaacaacccc atcaaaaagtgggcgaaggacatgaacagacacttctcaaaagaagacatttatgcagcc aaaaaacacatgaagaaatgctcatcatcactggccatcagagaaatgcaaatcaaaacc actatgagatatcatctcacaccagttagaatggcaatcattaaaaagtcaggaaacaac agctctggtgctgaaaacaaaggtagctactttgaaaatgaacatagtcaagattatgaa caaccagcctttttaatgcagagacacctacaagagcatctctacttctctatgtaa