GENSCAN 1.0 Date run: 4-Nov-116 Time: 06:49:09 Sequence gi568815593f:147964045_148233888 : 269844 bp : 35.97% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 707 702 6 1.05 1.03 Term - 18225 18011 215 1 2 95 42 82 0.481 0.71 1.02 Intr - 21789 21680 110 2 2 74 66 73 0.365 2.71 1.01 Init - 32957 32875 83 2 2 43 48 119 0.470 3.89 1.00 Prom - 33016 32977 40 -3.05 2.00 Prom + 37049 37088 40 -6.45 2.01 Init + 41267 41321 55 1 1 33 99 44 0.237 1.50 2.02 Intr + 41826 41932 107 1 2 49 60 104 0.768 2.71 2.03 Term + 42085 42165 81 0 0 131 55 76 0.814 5.31 2.04 PlyA + 42506 42511 6 1.05 3.00 Prom + 43712 43751 40 -6.75 3.01 Init + 45076 45171 96 0 0 96 72 25 0.619 2.23 3.02 Intr + 47563 47680 118 0 1 89 99 71 0.797 7.42 3.03 Intr + 58382 60483 2102 0 2 34 53 755 0.001 52.97 3.04 Intr + 70102 70206 105 0 0 76 84 51 0.005 2.99 3.05 Term + 92019 92120 102 2 0 56 52 128 0.304 3.30 3.06 PlyA + 93336 93341 6 1.05 4.00 Prom + 96256 96295 40 -4.55 4.01 Init + 107811 107869 59 2 2 65 61 77 0.539 3.53 4.02 Intr + 108104 108176 73 2 1 118 78 45 0.561 4.99 4.03 Intr + 122361 122488 128 2 2 108 66 57 0.921 4.06 4.04 Intr + 124498 124561 64 1 1 87 94 52 0.873 3.50 4.05 Intr + 125450 125577 128 1 2 104 81 123 0.974 11.76 4.06 Intr + 127121 127184 64 2 1 96 93 115 0.996 10.60 4.07 Intr + 130310 130437 128 1 2 49 59 109 0.929 2.66 4.08 Intr + 135190 135271 82 1 1 90 73 59 0.952 3.32 4.09 Intr + 136410 136537 128 2 2 89 70 104 0.989 7.26 4.10 Intr + 137311 137392 82 1 1 63 105 58 0.913 3.72 4.11 Intr + 137737 137864 128 0 2 79 70 89 0.854 4.76 4.12 Intr + 140908 140956 49 1 1 44 85 61 0.827 -0.84 4.13 Intr + 142643 142663 21 1 0 103 115 12 0.810 2.22 4.14 Intr + 142993 143120 128 0 2 67 62 162 0.982 10.06 4.15 Intr + 144709 144793 85 1 1 100 68 123 0.999 10.40 4.16 Intr + 147724 147851 128 0 2 85 70 151 0.976 11.56 4.17 Intr + 148824 148890 67 0 1 82 62 59 0.977 0.69 4.18 Intr + 150318 150445 128 2 2 125 70 126 0.979 13.06 4.19 Intr + 152326 152422 97 1 1 78 94 109 0.986 9.59 4.20 Intr + 154393 154520 128 0 2 120 87 179 0.999 19.56 4.21 Intr + 154942 155014 73 1 1 103 113 30 0.520 5.49 4.22 Intr + 155965 156092 128 0 2 81 74 109 0.536 7.36 4.23 Intr + 156251 156347 97 2 1 104 75 41 0.842 3.49 4.24 Intr + 159789 159920 132 2 0 67 18 121 0.374 2.92 4.25 Intr + 161480 161569 90 1 0 77 79 52 0.647 2.47 4.26 Intr + 161679 161806 128 2 2 122 70 69 0.996 7.06 4.27 Intr + 162939 163035 97 0 1 97 95 86 0.999 9.29 4.28 Intr + 163836 163853 18 2 0 126 101 26 0.923 3.59 4.29 Intr + 167215 167345 131 0 2 94 95 63 0.995 6.17 4.30 Intr + 169753 169843 91 1 1 82 84 81 0.961 6.18 4.31 Term + 175458 175571 114 2 0 75 42 67 0.342 -1.61 4.32 PlyA + 175628 175633 6 1.05 5.02 PlyA - 176723 176718 6 1.05 5.01 Sngl - 190530 190060 471 0 0 66 40 221 0.502 10.97 5.00 Prom - 194036 193997 40 -7.15 6.04 PlyA - 195059 195054 6 1.05 6.03 Term - 198224 198091 134 0 2 78 38 109 0.192 2.17 6.02 Intr - 257716 257594 123 2 0 69 82 75 0.256 4.64 6.01 Init - 266744 266552 193 2 1 68 67 184 0.505 11.48 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 58450 60615 2166 0 0 44 42 763 0.827 61.11 S.002 Sngl + 267463 267756 294 0 0 88 54 228 0.915 14.85 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:147964045_148233888|GENSCAN_predicted_peptide_1|135_aa MNTSVEEDTSGWSSRASWQKSTQTATSSSVLVIHPLLSDVVPSALAHRILCAMGIFQSAP VSTLGPECNLYSSSDPHLRLETGLYYHNTNRGCPHSGPKPGDQEQARSKLKGAAEGTLNL CFPRPTLMMDGGQVI >gi568815593f:147964045_148233888|GENSCAN_predicted_CDS_1|408_bp atgaacacatccgtagaagaagatacaagcggctggtcgtcgagagccagctggcagaag agcacgcagacagccaccagtagctctgtgctcgtcatccacccattgctttctgatgtt gtgccatctgctttagctcaccgcattctctgtgccatgggaatcttccaatcagctcct gtgagtactctcgggccagaatgcaatctctattcctcaagtgatcctcatttgagattg gagactggattgtattatcacaatactaacagaggctgccctcattcaggccctaagcca ggagatcaagagcaagcaagatccaaactcaaaggagctgctgaaggaacactaaatctc tgctttcccaggcccaccctcatgatggatggaggccaggttatttga >gi568815593f:147964045_148233888|GENSCAN_predicted_peptide_2|80_aa MDLRGPEADDMEVKRHSTDASSKLEGVNFPNLKLVGLGLGSGEANTEAQHAGNRDIKLQA VMQLEPLTMAPSGRNAYIGL >gi568815593f:147964045_148233888|GENSCAN_predicted_CDS_2|243_bp atggatctgaggggtccagaggcagatgatatggaggttaaaaggcacagcacagatgca tcctccaaactggaaggagttaatttcccaaaccttaaactggttggtttaggattgggc tcaggggaagcgaacacagaagcccaacatgccggcaacagggacatcaaactccaagca gtcatgcaactggagcctctgacgatggctccttctggcaggaacgcttacataggcctc tga >gi568815593f:147964045_148233888|GENSCAN_predicted_peptide_3|840_aa MGETTPMIQSPPTWSLPRHVGIMGITIRDEIWPCASREAGLTRSSRLDLSDVREILLTLP VISLELPKLNTGAPRFIKQVLSDLQRDLDSHTLIMGDFNTPLSTLDRSMRQKVNKDTQEL NSALHQADLIDIYRTLHPKSTEYTFFSAPHHTYSKIDHIVGSKALLSKCKRTQIITNYLS DHSAIKLELRIKNLTQNRSTTWKLNNLLLNDYWVHNEMKAEIKMFFETNENKDTTYQNLW DAFKAVCRGKFIALNAHKRKQERSKIDTLTSQLKELEKQEQTHSKASRRQEITKIRAELK EIETQKTLQKINESRSWFFERINKIDRPLARLIKKKREKNQIDAIKNDKGDITTDPTEIQ TTIREYYKYLYTNKLENLEEMDKFLDTYTLQRLNQEEVESLNRPITGSEIVAIINSLPTK KSPGPDGFTAESYQRYKEELVPFLLKLFQSIEKEGILPNSFYEASIILIPKPGRDTTKKE NFRPISLMNIDAKILNKILANRIQQHIKKLIHHDQVGFIPGMQGWFNIRKSINVIQHINR AKDKNHMIISIDAEKAFDKIQQPFMLKTLNKLGIDGTYFKIIRAIYDKPTANIILNGQKL EAFPLKTGTRQGCPLSPLLFNIVLEVLARGIRQEKEIKGIQLGKEEVKLSLFADDMIVYL ENPIVSAQNVLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKY LRIQLTRDVKDLFKENYKPLLKEIKEDTNKWKNIPCSWVGRINIVKMAILPRNKLNSVTL LSSDPQPPGPQTGIGPWPARNWAAQQRPCVEDTPVAEGGHGLGYGFRGCKPEALTASTWC >gi568815593f:147964045_148233888|GENSCAN_predicted_CDS_3|2523_bp atgggggaaaccacccccatgatccaatcacctcccacctggtccctccctcgacatgtg gggattatggggattacaattcgagatgagatttggccctgtgcttccagagaagctggt cttaccagaagctccaggctggacctgagtgatgtgagggaaattctactgactttgcca gtgattagcttagaattacctaaactcaacacaggagcacccagattcataaagcaagtc ctgagtgacctacaaagagacttagactcccacacattaataatgggagactttaacacc ccactgtcaacattagacagatcaatgagacagaaagtcaacaaggatacccaggaattg aactcagctctgcaccaagcagacctaatagacatctacagaactctccaccccaaatca acagaatatacatttttttcagcaccacaccacacctattccaaaattgaccacatagtt ggaagtaaagctctcctcagcaaatgtaaaagaacacaaattataacaaactatctctca gaccacagtgcaatcaaactagaactcaggattaagaatctcactcaaaaccgctcaaca acatggaaactgaacaacctgctcctgaatgactactgggtacataacgaaatgaaggca gaaataaagatgttctttgaaaccaacgagaacaaagacacaacataccagaatctctgg gacgcattcaaagcagtgtgtagagggaaatttatagcactaaatgcccacaagagaaag caggaaagatccaaaattgacaccctaacatcacaattaaaagaactagaaaagcaagag caaacacattcaaaagctagcagaaggcaagaaataactaaaatcagagcagaactgaag gaaatagagacacaaaaaacccttcaaaaaattaatgaatccaggagctggttttttgaa aggatcaacaaaattgatagaccgctagcaagactaataaagaaaaaaagagagaagaat caaatagacgcaataaaaaatgataaaggggatatcaccaccgatcccacagaaatacaa actaccatcagagaatactacaaatacctctacacaaataaactagaaaatctagaagaa atggataaattcctcgacacatacaccctccaaagactaaaccaggaagaagttgaatct ctgaatagaccaataacaggatctgaaattgtggcaataatcaatagcttaccaaccaaa aagagtccaggaccagatggattcacagccgaatcctaccagaggtacaaggaggaactg gtaccattccttctgaaactattccaatcaatagaaaaagagggaatcctccctaactca ttttatgaggccagcatcattctgataccaaagccgggcagagacacaaccaaaaaagag aattttagaccaatatccttgatgaacattgatgcaaaaatcctcaataaaatactggca aaccgaatccagcagcacatcaaaaagcttatccaccatgatcaagtgggcttcatccct gggatgcaaggctggttcaatatacgcaaatcaataaatgtaatccagcatataaacaga gccaaagacaaaaaccacatgattatctcaatagatgcagaaaaggcctttgacaaaatt caacaacccttcatgctaaaaactctcaataaattaggtattgatgggacatatttcaaa ataataagagctatctatgacaaacccacagccaatatcatactgaatgggcaaaaactg gaagcattccctttgaaaactggcacaagacagggatgccctctctcaccactcctattc aacatagtgttggaagttctggccaggggaattaggcaggagaaggaaataaagggtatt caattaggaaaagaggaagtcaaattgtccctgtttgcagatgacatgattgtatatcta gaaaaccccattgtctcagcccaaaatgtccttaagctgataagcaacttcagcaaagtc tcaggatacaaaatcaatgtacaaaaatcacaagcattcttatacaccaacaacagacaa acagagagccaaatcatgagtgaactcccattcacaattgcttcaaagagaataaaatac ctaagaatccagcttacaagggatgtgaaggacctcttcaaggagaactacaaaccactg ctcaaggaaataaaagaggatacaaacaaatggaagaacattccatgctcatgggtagga agaatcaatatcgtgaaaatggccatactgcccaggaataaactaaatagtgtaacactt ttgagcagtgatccccaacccccggggccacagactggtattggcccatggcctgctagg aactgggctgcacagcagaggccctgcgttgaagacactccagtggctgaagggggccat ggcttgggctatggcttccgagggtgcaagcctgaagccttgacagcttccacatggtgt tga >gi568815593f:147964045_148233888|GENSCAN_predicted_peptide_4|997_aa MVQIALRVGDHLAVGSIPGREKEAKSQKRARHLARAPKATAPTELNCDDFKKGERDGDFI CPDYYEAVCGTDGKTYDNRCALCAENAKTGSQIGVKSEGECKSSNPEQDVCSAFRPFVRD GRLGCTRENDPVLGPDGKTHGNKCAMCAELFLKEAENAKREGETRIRRNAEKDFCKEYEK QVRNGRLFCTRESDPVRGPDGRMHGNKCALCAEIFQAENEEKKKAEARARNKRESGKATS YAELCSEYRKLVRNGKLACTRENDPIQGPDGKVHGNTCSMCEVFFQAEEEEKKKKEGKSR NKRQSKSTASFEELCSEYRKSRKNGRLFCTRENDPIQGPDGKMHGNTCSMCEAFFQQEER ARAKAKREAAKTPKDKSQEICSEFRDQVRNGTLICTREHNPVRGPDGKMHGNKCAMCASV FKLEEEEKKNDKEEKGKVEAEKVKREAVQELCSEYRHYVRNGRLPCTRENDPIEGLDGKI HGNTCSMCEAFFQQEAKEKERAEPRAKVKREAEKETCDEFRRLLQNGKLFCTRENDPVRG PDGKTHGNKCAMCKAVFQKENEERKRKEEEDQRNAAGHGSSGGGGGNTQDECAEYREQMK NGRLSCTRESDPVRDADGKSYNNQCTMCKAKLEREAERKNEYSRSRSNGTGSESGKDTCD EFRSQMKNGKLICTRESDPVRGPDGKTHGNKCTMCKEKLEREAAEKKKKEDEDRSNTGER SNTGERSNDKEDLCREFRSMQRNGKLICTRENNPVRGPYGKMHINKCAMCQSILYDQCRQ VQNEAEDAKFRQPGRSLASVARMSTDECSEFRNYIRNNELICPRENDPVHGADGKFYTNK CYMCRAVFLTEALERAKLQEKPSHVRASQEEDSPDSFSSLLMVTSNDSEMCKDYRVLPRI GYLCPKDLKPVCGDDGQTYNNPCMLCHENLIRQTNTHIRSTGKCEESSTPGTTAASMPPS ASEMPKDVEKRARLHTLDENINYFKLCGKQYVDISKN >gi568815593f:147964045_148233888|GENSCAN_predicted_CDS_4|2994_bp atggtgcagatagctttgagagttggtgaccacctggctgttggaagtattccaggaagg gaaaaagaagcaaaatcacagaagagggccaggcatttagcaagagctcccaaggctact gccccaacagagctgaattgtgatgattttaaaaaaggagaaagagatggggattttatc tgtcctgattattatgaagctgtttgtggcacagatgggaaaacatatgacaacagatgt gcactgtgtgctgagaatgcgaaaaccgggtcccaaattggtgtaaaaagtgaaggggaa tgtaagagcagtaatccagagcaggatgtatgcagtgcttttcggccctttgttagagat ggaagacttggatgcacaagggaaaatgatcctgttcttggtcctgatgggaagacgcat ggcaataagtgtgcaatgtgtgctgagctgtttttaaaagaagctgaaaatgccaagcga gagggtgaaactagaattcgacgaaatgctgaaaaggatttttgcaaggaatatgaaaaa caagtgagaaatggaaggcttttttgtacacgggagagtgatccagtccgtggccctgac ggcaggatgcatggcaacaaatgtgccctgtgtgctgaaattttccaagcagaaaatgaa gaaaagaaaaaggctgaagcacgagctagaaacaaaagagaatctggaaaagcaacctca tatgcagagctttgcagtgaatatcgaaagcttgtgaggaacggaaaacttgcttgcacc agagagaacgatcctatccagggcccagatgggaaagtgcatggcaacacctgctccatg tgtgaggtcttcttccaagcagaagaagaagaaaagaaaaagaaggaaggtaaatcaaga aacaaaagacaatctaagagtacagcttcctttgaggagttgtgtagtgaataccgcaaa tccaggaaaaacggacggcttttttgcaccagagagaatgaccccatccagggcccagat ggaaaaatgcatggcaacacctgctccatgtgtgaggccttctttcaacaagaagaaaga gcaagagcaaaggctaaaagagaagctgcaaagacccctaaagacaaatcacaggaaatc tgcagtgaatttcgggaccaagtgaggaatggaacacttatatgcaccagggagcataat cctgtccgtggcccagatggcaaaatgcatggaaacaagtgtgccatgtgtgccagtgtg ttcaaacttgaagaagaagagaagaaaaatgataaagaagaaaaagggaaagtcgaggct gaaaaagttaagagagaagcagttcaggagctgtgcagtgaatatcgtcattatgtgagg aatggacgactcccctgtaccagagagaatgatcctattgagggtctagatgggaaaatc cacggcaacacctgctccatgtgtgaagccttcttccagcaagaagcaaaagaaaaagaa agagctgaacccagagcaaaagtcaaaagagaagctgaaaaggagacatgcgatgaattt cggagacttttgcaaaatggaaaacttttctgcacaagagaaaatgatcctgtgcgtggc ccagatggcaagacccatggcaacaagtgtgccatgtgtaaggcagtcttccagaaagaa aatgaggaaagaaagaggaaagaagaggaagatcagagaaatgctgcaggacatggttcc agtggtggtggaggaggaaacactcaggacgaatgtgctgagtatcgggaacaaatgaaa aatggaagactcagctgtactcgggagagtgatcctgtacgtgatgctgatggcaaatcg tacaacaatcagtgtaccatgtgtaaagcaaaattggaaagagaagcagagagaaaaaat gagtattctcgctccagatcaaatgggactggatcagaatcagggaaggatacatgtgat gagtttagaagccaaatgaaaaatggaaaactcatctgcactcgagaaagtgaccctgtc cggggtccagatggcaagacacatggcaataagtgtactatgtgtaaggaaaaactggaa agggaagcagctgaaaaaaaaaagaaagaggatgaagacaggagcaatacaggagaaagg agcaatacaggagaaaggagcaatgacaaagaggatctgtgtcgtgaatttcgaagcatg cagagaaatggaaagcttatctgcaccagagaaaataaccctgttcgaggcccatatggc aagatgcacatcaataaatgtgctatgtgtcagagcatcttgtacgaccagtgcagacag gttcagaatgaagcggaggatgcaaaatttagacaacctgggcgttccttggcctctgtt gccaggatgagtacagatgagtgcagtgaatttcgaaactatataaggaacaatgaactc atctgccctagagagaatgacccagtgcacggtgctgatggaaagttctatacaaacaag tgctacatgtgcagagctgtctttctaacagaagctttggaaagggcaaagcttcaagaa aagccatcccatgttagagcttctcaagaggaagacagcccagactctttcagttctctg ctgatggtcaccagtaatgattctgagatgtgcaaagactaccgagtattgcccaggata ggttatctttgtccaaaggatttaaagcctgtctgtggtgacgatggccaaacctacaac aatccttgcatgctctgtcatgaaaacctgatacgccaaacaaatacacacatccgcagt acagggaagtgtgaggagagcagcaccccaggaaccaccgcagccagcatgcccccgtct gcttcagaaatgcctaaggatgtggagaaaagagcacgtttacatactttagatgaaaat ataaattattttaaactctgtggaaaacagtacgtagacatctcaaaaaactaa >gi568815593f:147964045_148233888|GENSCAN_predicted_peptide_5|156_aa MVLGDTNKAMWGMAVDPGETAVAEGIRQAGACLQRLLCWSGAVCQCKSYDEGRQGTKACT ANRYSKAGTPREASRLIGAQVRLALSDEQDLPAEFRSGSSPRAKVSYRRKLSLGGWASLD KLHYRCSHTKPSGLYTKWSASPNTSLSSSSCQLKCP >gi568815593f:147964045_148233888|GENSCAN_predicted_CDS_5|471_bp atggtgctaggggatacgaataaggcaatgtggggtatggctgtggaccctggggaaact gcagtggcagaagggatcaggcaggctggtgcatgtctacagagactgctctgctggtca ggtgcagtctgccagtgcaagagctatgatgagggtcgccagggcaccaaagcctgcact gcaaacagatacagcaaggctgggaccccaagagaggccagcagactaataggtgctcag gtcagactggccctgtctgatgagcaagacctccctgcagagtttaggtctggcagttcc cccagggctaaagtctcctacagaagaaagttgagcctagggggatgggcatccctggac aagctccactacagatgctcccacaccaaaccctctgggctctacaccaaatggagtgct tctcctaatacttcactaagcagctcttcctgccaactcaagtgtccatag >gi568815593f:147964045_148233888|GENSCAN_predicted_peptide_6|149_aa MPLPASARARCADPLTCAHCLALPSEMNLVPQMEMQKSPVFCVNHAGSCRPELFLFGHLG SAKIVPSSAGGVAVAGQLSFTKAFGALGAAGTQHLSSQRQRCLKEVSLSVSVSLCYSPLR RLSTKAEIVLDITVSIEPGTRSDNSGGKY >gi568815593f:147964045_148233888|GENSCAN_predicted_CDS_6|450_bp atgcctctccctgcttcggctcgtgcacggtgcgcggacccactgacctgcgcccactgt ctggcacttcctagtgagatgaacctggtacctcagatggaaatgcagaaatcacccgtc ttctgcgtcaatcacgctgggagctgcagaccggagctgttcctattcggccatcttggc tccgccaaaatagttccttcgtctgcaggaggagtggcagtggccgggcagctcagcttc accaaggcttttggtgcgctgggagctgcaggcacccagcacttgtcctcccaaaggcaa agatgcctaaaagaagtctccctgtctgtttctgtgtctctatgttactcacccctaaga aggctatccacaaaggcagaaattgtcctggacatcactgtatccatagaacctggtaca agatctgacaacagtggtggcaaatactaa