GENSCAN 1.0 Date run: 6-Nov-116 Time: 23:54:59 Sequence gi568815594r:176583929_176892311 : 308383 bp : 36.69% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 12709 12871 163 0 1 89 28 167 0.090 9.43 1.02 Intr + 16294 16361 68 2 2 96 77 10 0.056 -1.59 1.03 Intr + 22974 23150 177 2 0 56 37 185 0.311 9.39 1.04 Term + 24484 24555 72 0 0 81 49 79 0.881 0.23 1.05 PlyA + 25016 25021 6 1.05 2.03 PlyA - 26310 26305 6 1.05 2.02 Term - 41714 41170 545 0 2 57 48 280 0.803 14.34 2.01 Init - 46239 46125 115 0 1 78 60 94 0.791 6.02 2.00 Prom - 46841 46802 40 -7.95 3.04 PlyA - 46938 46933 6 -0.45 3.03 Term - 47619 47475 145 2 1 100 32 181 0.194 10.10 3.02 Intr - 50972 50825 148 0 1 49 64 92 0.060 1.27 3.01 Init - 74785 74695 91 1 1 79 52 103 0.608 6.50 3.00 Prom - 91122 91083 40 -6.05 4.05 PlyA - 92546 92541 6 1.05 4.04 Term - 100112 99998 115 1 1 108 44 109 0.650 5.56 4.03 Intr - 103592 103259 334 0 1 21 110 208 0.082 10.01 4.02 Intr - 104043 103893 151 0 1 48 103 77 0.070 4.01 4.01 Init - 104920 104813 108 1 0 76 69 69 0.060 4.07 4.00 Prom - 105210 105171 40 -7.95 5.00 Prom + 106653 106692 40 -8.45 5.01 Init + 109092 109803 712 2 1 88 47 606 0.676 51.60 5.02 Intr + 111388 111635 248 2 2 46 41 205 0.134 7.86 5.03 Term + 111928 112863 936 0 0 -27 36 332 0.124 7.62 5.04 PlyA + 112971 112976 6 1.05 6.00 Prom + 113735 113774 40 -3.65 6.01 Init + 129749 129937 189 1 0 21 89 160 0.335 8.46 6.02 Intr + 143105 143191 87 1 0 29 85 90 0.000 2.05 6.03 Intr + 177841 177999 159 0 0 76 116 29 0.373 3.76 6.04 Term + 183886 184146 261 0 0 55 44 247 0.767 11.54 6.05 PlyA + 185120 185125 6 1.05 7.03 PlyA - 185638 185633 6 1.05 7.02 Term - 207870 207685 186 0 0 64 50 68 0.555 -2.89 7.01 Init - 208383 208237 147 0 0 92 90 243 0.985 23.04 7.00 Prom - 224103 224064 40 -6.25 8.00 Prom + 242863 242902 40 -3.65 8.01 Init + 244735 244900 166 0 1 52 101 115 0.779 9.04 8.02 Intr + 250993 251100 108 2 0 90 89 16 0.427 1.24 8.03 Term + 256487 256623 137 0 2 70 40 120 0.408 2.60 8.04 PlyA + 258230 258235 6 1.05 9.02 PlyA - 258411 258406 6 1.05 9.01 Sngl - 286962 286630 333 0 0 71 44 193 0.919 8.97 9.00 Prom - 307261 307222 40 -2.95 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 103999 103893 107 2 2 95 103 73 0.870 8.51 S.002 Term - 140979 140828 152 1 2 85 39 96 0.864 1.49 S.003 Intr - 145818 145605 214 0 1 88 106 144 0.996 13.57 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594r:176583929_176892311|GENSCAN_predicted_peptide_1|159_aa AVKEDVELLSFTFGDMKNDPEIWRKYRGHQQSSRKPSGSSKTQESSGVWLNAQCDWVNSK ALSSNSKVLSSAWLILLKRKRYRHSEGYVKMEREAGFMQPPATEQQRQPLEVEKAPPSHG AFPCRRLGFRFPASGTSTTAAAAKCYRKVAVKIHSIGQC >gi568815594r:176583929_176892311|GENSCAN_predicted_CDS_1|480_bp gcagtcaaagaggatgtggagctactatcattcacttttggagatatgaaaaatgacccg gagatttggagaaagtatagagggcaccagcagagctctcgcaagcctagtggcagcagc aaaacacaagaatccagtggtgtttggttaaatgcccagtgtgattgggttaattcaaaa gccttatcttcaaactctaaagttctttcttctgcttggttgattctattgaagaggaaa agatacagacacagcgaaggctatgtgaagatggagagagaggctggatttatgcagcca ccagccacagaacaacagcgacagccactagaagttgaaaaggctccaccttctcacgga gcatttccttgccgacgccttggtttcaggtttccggcttctggaactagcaccacagca gcagcagccaaatgttataggaaagttgcagtaaagattcattcgattggacaatgttga >gi568815594r:176583929_176892311|GENSCAN_predicted_peptide_2|219_aa MECYTVVKKSEIMSFAATWMQLDAIILSELTQKQKTKYTLEPFSPPLRCGSPFLGWPRPE LAPSACREVWRERREQEPGLCAVLVDQLEFRVGVGLAGPHSEQPAGPVAPGNEGLSTRAS GCGGCTGSPNSASPPALRSISHRALAAFLRGRARDLQPAMPEPPTPSMDSFAAGASLMSA APCSKAPSPINHPRAEGCGVTARDWPASSTCIPGAGSRG >gi568815594r:176583929_176892311|GENSCAN_predicted_CDS_2|660_bp atggaatgctacacagtcgtaaaaaagagtgaaatcatgtcctttgcagcaacatggatg cagctggatgccattatcctaagtgaactaacacagaaacagaaaaccaaatacacactt gagcccttcagcccgccgctgcgctgtgggagcccctttctgggctggccaaggccggag ctggctccctcagcttgcagggaggtgtggagggagaggcgcgagcaggaaccggggctg tgcgcggtgcttgtggaccagctggagttccgggtgggcgtgggcttggcgggcccgcac tcggagcagccggccggccctgtcgccccgggcaatgaggggcttagcacccgggccagc ggctgtggagggtgtaccgggtcccccaacagtgccagcccaccggcgctgcgctcgatt tctcaccgggccttagctgcctttctgcggggcagggctcgggacctgcagcccgccatg cctgagcctcccaccccctccatggactcctttgcggccggagcctccctgatgagcgcc gccccctgctccaaggcgcccagtcccatcaaccacccaagggctgaggggtgcggagtc acagcgcgggactggccggccagctccacctgcatccctggtgcgggatccagggggtga >gi568815594r:176583929_176892311|GENSCAN_predicted_peptide_3|127_aa MTRVEQRAQDEYKSCGEELSGCGSKPYQGTVSYIVAHKGMTSRIWVWKLCPTALLVQAFF LVGAQQLLICNMTLGHGALSSQVLITLGKPLNLQGELEAVMGSHEDLTPESHEGLTPPSQ QLTSEHP >gi568815594r:176583929_176892311|GENSCAN_predicted_CDS_3|384_bp atgactcgggtggaacaaagagcccaggatgaatataagagctgcggagaagaactctca ggttgtgggtctaagccttatcaaggaactgtgagctacatcgtcgcccataagggtatg actagtaggatttgggtctggaagctctgcccaacagctctactggtccaggcatttttc cttgttggtgctcagcaacttctcatctgcaacatgactttgggccatggtgccctgagc tcacaggtcctgataacattaggaaagcccctgaatcttcagggagagcttgaagctgtg atgggatcacatgaagatttgactccagagtcacatgaaggtttgactccaccttcacag cagctgacctctgagcacccataa >gi568815594r:176583929_176892311|GENSCAN_predicted_peptide_4|235_aa MADARCGGENEQAELVIYVHPREGSCHVGTNREGSQVPVTPLLNLVCFFQRCQAANKTCP TNYMWNNHICRCLAQEDFMFSSDAGDDSTDGFHDICGPNKELDEETCQCVCRAGLRPASC GPHKELDRNSCQCVCKNKLFPSQCGANREFDENTCQCVCKRTCPRNQPLNPGKCACECTE SPQKCLLKGKKFHHQTCSCYRRPCTNRQKACEPGFSYSEEVCRCVPSYWKRPQMS >gi568815594r:176583929_176892311|GENSCAN_predicted_CDS_4|708_bp atggctgatgccaggtgtggaggagaaaatgagcaagctgagcttgtgatatatgttcac cccagagagggctcatgtcacgtgggcaccaacagagaggggtctcaggtaccagttaca ccattgctaaacctcgtctgtttcttccaaaggtgtcaggcagcgaacaagacctgcccc accaattacatgtggaataatcacatctgcagatgcctggctcaggaagattttatgttt tcctcggatgctggagatgactcaacagatggattccatgacatctgtggaccaaacaag gagctggatgaagagacctgtcagtgtgtctgcagagcggggcttcggcctgccagctgt ggaccccacaaagaactagacagaaactcatgccagtgtgtctgtaaaaacaaactcttc cccagccaatgtggggccaaccgagaatttgatgaaaacacatgccagtgtgtatgtaaa agaacctgccccagaaatcaacccctaaatcctggaaaatgtgcctgtgaatgtacagaa agtccacagaaatgcttgttaaaaggaaagaagttccaccaccaaacatgcagctgttac agacggccatgtacgaaccgccagaaggcttgtgagccaggattttcatatagtgaagaa gtgtgtcgttgtgtcccttcatattggaaaagaccacaaatgagctaa >gi568815594r:176583929_176892311|GENSCAN_predicted_peptide_5|631_aa MGKKQNRKTGKSKKQSASPPPKKHSSLPAMEQSWMENNFDELREEGFRRSNYSELQEDIQ TKGKEVENFEKNLEECITRITNTEKCLKELMELKTKARELREECRSLRSRCNQLEERVSA MEDEMNEMKREGKFREKRIKRNEQSLQEIWDYVKRPNLCLIGVPESDGENGTKLENTLQD IIQENFPNLARQANVQIQEIQRMPQRYSSRRATPRHITVRFTKVEMKEKMLRAAREKEIQ TTIREYYKHLYANKLENLEEMNKFLDTYTLPRLNQEEVESLNRPITGSEIVAIINSLPTK KSPGPDGFTAQFYQRYKEELHINRAKDKNHMIISIDAEKAFDKIQQPFMLKTLNKLGIDG TYFKIIRAIYDKPTANIILNGQKLEAFPLKTGTRQGCPLSPLLFNIVLEVLARAIRQEKE IKGIQLGKEEVKLSLFADNMIVYLENPIVSAKNLLKLISNFSKVSRYKINVQKSQALLYT NNRQTESQIMSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLKEIKEDTNKWKNIPC SWVGRINIVKMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRACITKSTLSQK NKAGGITLPDFKLYHKATVTKTAWYWFKTEI >gi568815594r:176583929_176892311|GENSCAN_predicted_CDS_5|1896_bp atggggaaaaaacagaacagaaaaactggaaagtctaaaaagcagagtgcctctcctcct ccaaagaaacacagttccttaccagcaatggaacaaagctggatggagaataactttgac gagctgcgagaagaaggcttcagacgatcaaattactctgagctacaggaggacattcaa accaaaggcaaagaagttgaaaactttgaaaaaaatttagaagaatgtataactagaata accaatacagagaagtgcttaaaggagctgatggagctgaaaaccaaggctcgagaacta cgtgaagaatgcagaagcctcaggagccgatgcaatcaactggaagaaagagtatcagca atggaagatgaaatgaatgaaatgaagcgagaagggaagtttagagaaaaaagaataaaa agaaatgagcaaagcctccaagaaatatgggactatgtgaaaagaccaaatctatgtctg attggtgtacctgaaagtgatggggagaatggaaccaagttggaaaacactctgcaggat attatccaggagaacttccccaatctagcaaggcaggccaacgttcagattcaggaaata cagagaatgccacaaagatactcctcgagaagagcaactccaagacacataactgtcaga ttcaccaaagttgaaatgaaggaaaaaatgttaagggcagccagagagaaagaaatacaa actaccatcagagaatactacaaacacctctatgcaaataaactagaaaatctagaagaa atgaataaattcctcgacacatacactctcccaagactaaaccaggaagaagttgaatct ctgaatagaccaataacaggatctgaaattgtggcaataatcaatagcttaccaaccaaa aagagtccaggaccagacggattcacagcccaattctaccagaggtacaaggaggaactg catataaacagagccaaagacaaaaaccacatgattatctcaatagatgcagaaaaagcc tttgacaaaattcaacaacccttcatgctaaaaactctcaataaattaggtattgatggg acatatttcaaaataataagagctatctatgacaaacccacagccaatatcatactgaat gggcaaaaactggaagcattccctttgaaaactggcacaagacagggatgccctctctca ccactactattcaacatagtgttggaagttctggccagggcaattaggcaggagaaggaa ataaagggtattcaattaggaaaagaggaagtcaaattgtccctgtttgcagacaacatg attgtatatctagaaaaccccattgtctcagccaaaaatctccttaagctgataagcaac ttcagcaaagtctccagatacaaaatcaatgtacaaaaatcacaagcactcttatacacc aacaacagacaaacagagagccaaatcatgagtgaactcccattcacaattgcttcaaag agaataaaatacctaggaatccaacttacaagggatgtgaaggacctcttcaaggagaac tacaaaccactgctcaaggaaataaaagaggatacaaacaaatggaagaacattccatgc tcatgggtaggaagaatcaatatcgtgaaaatggccatactgcccaaggtaatttacaga ttcaatgccatccccatcaagctaccaatgactttcttcacagaactggaaaaaactact ttaaagttcatatggaaccaaaaaagagcctgcatcaccaagtcaaccctaagccaaaag aacaaagctggaggcatcaccctacctgacttcaaactataccacaaggctacagtaacc aaaacagcatggtactggttcaaaacagagatatag >gi568815594r:176583929_176892311|GENSCAN_predicted_peptide_6|231_aa MSTLNEKRKCLRIETWGTGTSMRWEEAKELVNMSGKKHNNDGASGGETVKPTGARERGES FKQRPEVHTISNNTFMVIASTYACSTASLLYHDTEIIKRKCFQADGRDQRRDEGMEYKMK QQRVLEFRVGRSHQPGRKSGPRQRKLRIHSREVKIRTSETGKQNAALIQVEDDGGSGQRC SKWLDSGYIFEDTAKRIGFPDESECRVGKKEVKDDTMNFDLTNSEQFVATD >gi568815594r:176583929_176892311|GENSCAN_predicted_CDS_6|696_bp atgagcactctaaatgaaaagagaaaatgtctaagaatagaaacctggggcacaggaaca tctatgaggtgggaagaggcaaaggaactagtgaatatgtctgggaagaagcataacaat gatggagcatcaggaggggagactgtcaagccaacgggtgccagagaaagaggggagagc ttcaaacagaggccagaggtgcacacaatttcaaacaacactttcatggttatcgccagc acatacgcgtgttccacagccagcctcctctaccacgacacagaaataataaaaaggaag tgctttcaggcagatggaagggatcagagaagagacgaaggcatggaatacaaaatgaaa cagcagagagtgctagaattcagagttggaagaagccatcagcctggaagaaaatctggc ccaagacagagaaagttgagaatacactctagggaggtgaagataagaaccagtgagact ggtaaacagaacgctgccctgatccaagtggaagacgatggtggctcaggtcagagatgt agcaagtggttagactctggttatatttttgaagacacagccaagaggattggattccct gatgagtctgaatgtagggtaggaaagaaagaagtcaaggatgataccatgaattttgac ttgaccaactcggagcaatttgttgccactgactga >gi568815594r:176583929_176892311|GENSCAN_predicted_peptide_7|110_aa MHLLGFFSVACSLLAAALLPGPREAPAAAAAFESGLDLSDAEPDAGEATLIHGTPNRFWD TGDQSTQPYFPGVFIVCRAPGAVSDFRKKVSGTLVTRLQSFQSSLEVGQT >gi568815594r:176583929_176892311|GENSCAN_predicted_CDS_7|333_bp atgcacttgctgggcttcttctctgtggcgtgttctctgctcgccgctgcgctgctcccg ggtcctcgcgaggcgcccgccgccgccgccgccttcgagtccggactcgacctctcggac gcggagcccgacgcgggcgaggccacgctgattcatgggactccaaacagattctgggac actggtgatcagtcaacccagccttactttcctggagtgttcatagtctgcagagcacca ggcgctgtgagcgactttagaaaaaaagtgtcagggactttagtaaccaggctccagagc tttcagagttcacttgaagttggtcagacttga >gi568815594r:176583929_176892311|GENSCAN_predicted_peptide_8|136_aa MVWRFLKKQKIELLYDPVIPLLGIYPKELKSIYQQDVRSLVFIAASFTIAKIQEQSSIAA NMESKENQNFRCENVLHLFHLHLTLHHKSRTAHPEPESSLCPEYPCSIYCLSVSHLVTAL TVRSTVGSIIAVLVLK >gi568815594r:176583929_176892311|GENSCAN_predicted_CDS_8|411_bp atggtatggaggttcctcaaaaagcaaaaaatagaactactctatgacccagtaatcccg cttctgggtatctatccaaaggagctgaaatcaatatatcaacaggatgtccgttctctc gtgtttattgcagcctcattcacaatagccaagatacaggaacaatcttcaattgcagcc aacatggaatccaaagagaaccagaactttcgttgtgagaatgttttacacttattccat ttacacttaacacttcatcataaaagtagaacagcccacccagaaccagaatcatccctt tgtccagagtacccatgctctatatactgcctgtctgttagtcacttagtgactgctttg actgtcagatcaactgtcggcagtatcatcgcagtgcttgtgttaaagtaa >gi568815594r:176583929_176892311|GENSCAN_predicted_peptide_9|110_aa MGKDFMTKTPKAIATKAKIDKWDLIKLESFCRAKETIISMNRQHREWEKIFAIYPSDKGL ISGLYRELKQIYKKTNNPIKKWVKDMNRHFLKEDIYVANKHEKKLFITGH >gi568815594r:176583929_176892311|GENSCAN_predicted_CDS_9|333_bp atgggcaaagacttcatgactaaaacaccaaaagcaattgcaacaaaagccaaaattgac aaatgggatctaattaaactagagagcttctgcagagcaaaagaaactatcatcagcatg aacaggcaacatagagaatgggagaaaatttttgcaatctatccatctgacaaaggtcta atatccggactctacagggaacttaaacaaatttacaagaaaacaaacaaccccatcaaa aagtgggtgaaggacatgaacagacatttcttaaaagaagacatttatgtggccaataaa catgaaaaaaagctcttcatcactggtcattag