GENSCAN 1.0 Date run: 4-Nov-116 Time: 13:53:33 Sequence gi568815591r:151367122_151619511 : 252390 bp : 47.16% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 740 847 108 1 0 53 116 63 0.907 6.08 1.02 Intr + 1614 1766 153 2 0 101 65 67 0.868 5.97 1.03 Intr + 6976 7122 147 0 0 59 64 187 0.916 13.83 1.04 Intr + 8727 8822 96 2 0 61 72 74 0.803 3.31 1.05 Intr + 9513 9690 178 2 1 94 96 203 0.999 21.19 1.06 Term + 9926 10104 179 0 2 84 36 200 0.956 12.25 1.07 PlyA + 11301 11306 6 1.05 2.19 PlyA - 11353 11348 6 -1.75 2.18 Term - 13046 12557 490 1 1 61 43 196 0.612 6.53 2.17 Intr - 13454 13156 299 2 2 37 91 240 0.607 14.87 2.16 Intr - 14625 14497 129 0 0 108 86 207 0.999 23.39 2.15 Intr - 14860 14757 104 2 2 51 96 206 0.998 17.59 2.14 Intr - 18102 17967 136 0 1 132 95 234 0.136 28.64 2.13 Intr - 24985 24845 141 1 0 47 88 75 0.501 3.95 2.12 Intr - 29075 28655 421 1 1 147 78 601 0.891 58.95 2.11 Intr - 33120 32979 142 1 1 112 76 222 0.992 22.81 2.10 Intr - 34469 34373 97 2 1 78 94 37 0.404 2.88 2.09 Intr - 36089 35974 116 0 2 66 23 85 0.209 -0.03 2.08 Intr - 41932 41836 97 1 1 107 50 69 0.906 4.58 2.07 Intr - 42266 42124 143 0 2 104 72 17 0.970 1.77 2.06 Intr - 42592 42306 287 0 2 105 69 248 0.555 21.59 2.05 Intr - 44887 44655 233 1 2 43 82 59 0.053 -2.63 2.04 Intr - 65164 65047 118 0 1 112 30 219 0.242 18.97 2.03 Intr - 69204 69059 146 0 2 -6 75 218 0.974 10.18 2.02 Intr - 71123 70875 249 2 0 114 101 401 0.992 41.63 2.01 Init - 72796 72776 21 1 0 114 113 7 0.938 5.57 2.00 Prom - 88586 88547 40 -2.96 3.04 PlyA - 88596 88591 6 1.05 3.03 Term - 92734 92609 126 1 0 48 39 134 0.512 2.78 3.02 Intr - 93097 92955 143 2 2 76 37 78 0.511 1.57 3.01 Init - 93847 93664 184 1 1 52 8 203 0.634 6.44 3.00 Prom - 95622 95583 40 -1.86 4.08 PlyA - 98914 98909 6 1.05 4.07 Term - 100090 99998 93 1 0 127 37 90 0.791 5.63 4.06 Intr - 103531 103450 82 0 1 71 92 68 0.706 5.14 4.05 Intr - 104484 104428 57 2 0 81 98 22 0.426 0.50 4.04 Intr - 110294 110212 83 2 2 110 116 22 0.651 5.64 4.03 Intr - 117683 117616 68 0 2 110 108 30 0.899 5.82 4.02 Intr - 123893 123822 72 0 0 85 98 79 0.159 7.98 4.01 Init - 152390 152339 52 2 1 104 92 79 0.453 11.44 4.00 Prom - 164074 164035 40 -3.86 5.00 Prom + 166907 166946 40 -4.26 5.01 Init + 172264 172466 203 0 2 107 86 95 0.943 9.55 5.02 Intr + 175680 175784 105 0 0 57 37 113 0.619 2.43 5.03 Term + 186354 186465 112 0 1 103 54 77 0.353 3.83 5.04 PlyA + 188773 188778 6 1.05 6.14 PlyA - 188910 188905 6 1.05 6.13 Term - 189790 189606 185 2 2 56 33 101 0.633 -0.89 6.12 Intr - 192702 192601 102 1 0 107 78 8 0.696 1.85 6.11 Intr - 193496 193403 94 2 1 159 98 116 0.971 19.24 6.10 Intr - 197103 196957 147 0 0 91 78 135 0.719 13.23 6.09 Intr - 198262 198225 38 2 2 99 87 16 0.994 0.58 6.08 Intr - 198764 198599 166 2 1 57 72 211 0.984 15.93 6.07 Intr - 201721 201595 127 0 1 84 78 88 0.978 8.08 6.06 Intr - 203104 203050 55 2 1 62 92 37 0.646 -0.46 6.05 Intr - 209331 209250 82 0 1 112 87 41 0.484 5.61 6.04 Intr - 228333 228224 110 1 2 107 108 55 0.216 9.40 6.03 Intr - 237746 237625 122 1 2 16 41 120 0.001 0.24 6.02 Intr - 242611 242556 56 1 2 68 81 51 0.049 0.18 6.01 Intr - 245698 245565 134 2 2 97 105 31 0.693 6.06 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 18166 17967 200 0 2 90 95 287 0.863 28.59 S.002 Term + 136045 136342 298 1 1 35 42 237 0.851 8.54 S.003 Term - 161869 161760 110 2 2 77 48 98 0.944 3.37 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:151367122_151619511|GENSCAN_predicted_peptide_1|286_aa GNCGKEKVLFLRLYLLQGIRNYHSGNDVEAYEYLNKARQLFKELYIDPSKVDNLLQLGFT AQEARLGLRACDGNVDHAATHITNRREELAQIRKEEKEKKRRRLENIRFLKGMGYSTHAA QQVLHAASGNLDEALKILLSNPQMWWLNDSNPETDNRQESPSQENIDRLVYMGFDALVAE AALRVFRGNVQLAAQTLAHNGGSLPPELPLSPEDSLSPPATSPSDSAGTSSASTDEDMET EAVNEILEDIPEHEEDYLDSTLEDEEIIIAEYLSYVENRKSATKKN >gi568815591r:151367122_151619511|GENSCAN_predicted_CDS_1|861_bp ggaaattgtgggaaagagaaggtactgtttctaagactctacttacttcaagggatccga aactatcacagtggaaatgatgtagaggcttatgagtatcttaacaaggcacgtcagctc tttaaagagctatatattgatccatcaaaagtggacaatttgttgcagttggggtttact gcccaggaagcccggcttggcctgagggcgtgtgatgggaacgtggatcatgcggccact catattaccaaccgcagagaggaactggcccaaataaggaaggaggaaaaagagaagaaa agacgccgcctcgagaacatcaggtttctgaaagggatgggctactccacgcacgcggcc cagcaggtactccacgcagccagcgggaacttggatgaggccctgaagattctgctcagc aatcctcagatgtggtggttaaatgattccaatcctgaaaccgacaaccgtcaagaaagt ccttcccaggaaaacattgaccgattggtgtacatgggttttgatgcactcgtggccgaa gctgcgctgagagtgttcagaggcaacgtccagctggccgcccagacccttgctcacaac ggaggaagcctgcctcccgagctgccgctgtcgccagaagactctttgtccccgccagcc acgtccccttctgactccgcaggaacctctagtgcctcaacagacgaagacatggagaca gaggccgtcaatgagatactggaagacattccagagcatgaggaagactatcttgactca actctggaagatgaagaaattattattgcagagtacctatcctatgtagaaaataggaag tcagcaacaaagaaaaactaa >gi568815591r:151367122_151619511|GENSCAN_predicted_peptide_2|1122_aa MAQRSGKITLYEGKHFTGQKLEVFGDCDNFQDRGFMNRVNSIHVESGAWVCFNHPDFRGQ QFILEHGDYPDFFRWNSHSDHMGSCRPVGMHGEHFRLEIFEGCNFTGQCLEFLEDSPFLQ SRGWVKNCVNTIKVYGDGAWVLYEEPNYHGRMYVVESGDFRSFSDWEAHSARVQSLHKIW GQFPRGESVIYDLQEKREDRLAEDICTHSSTVQTQPPEMAGSSPPGGHFVPNRRRRGGGG GEAKNKKSINLERGSRSRAPRSPRWPRAAVVRSRRADPGAPLPSGAQLLLVPRPASGMGG GGSALRVCADHRGGINWLSLSPDGQRLLTGSEDGTARLWSTADGQCCALLQAHRPQTADP SSCARRPLSRSPAPAHPGTLAEERLSGPSIWLLSARQGRVPCGLFVANCALLLAFPVDDS HSNSRRLLRASDEKHPGKKIPQGSKKQNLASLLTKSYTESTQVKRCVAIACKENSEHVPF PLRLPLSSVCATSREAGIVPIPGHESYVTFCQLEDEAAFTCSADCTIRRWDVLTGQCLQV YRGHTSIVNRILVANNQLFSSSYDRTARVWSVDKGQMSREFRGHRNCVLTLAYSAPWDLP STPCAEEAAAGGLLVTGSTDGTAKVWQVASGCCHQTLRGHTGAVLCLVLDTPGHTAFTGS TDATIRAWDILSGEQLRVFREHRGSVICLELGSTLPCPERLKGLAEGRQHTLLQKEQLAL KAAKRLMVWGSAQVVTKLVNRLVYSGSADRTVKCWLADTGECVRTFTAHRRNVSALKYHA GTLFTGSGDACARAFDAQSGELRRVFRGHTFIINCIQVHGQVLYTASHDGALRLWDVRGL RGAPRPPPPMRSLSRLFSNKSLDGATFCIHVPRLARVPGEKLQVTVSAVESSSSRCREKH LLQEQPLQVEQPLQVEQPFQVEQHALHSLNAEDTSSETAGAARDHLLGDGRVRPRARMNR KMGLLGGKQIGAGEPAETAPCTVLRVRLKHTQTSGVARSCCVPEPLGPVAATRGIPTRED LSIVLKGRFRWRRPENRLECCLRAVLPAGPEPAQTFTLTPACRRHCCLVMLRSGLSSPSL TRQLQSPPYLPTAENGACIKPSRETQWGMDPKHGRCTHFAFL >gi568815591r:151367122_151619511|GENSCAN_predicted_CDS_2|3369_bp atggcgcagcgctcggggaagatcactctctatgaaggcaagcacttcacagggcagaag ctggaggtcttcggggactgtgacaacttccaggaccggggctttatgaaccgagtgaac tccatccacgtggagagcggagcctgggtctgcttcaatcaccccgacttccggggccag cagttcatcttggagcacggcgactaccccgacttcttccgctggaacagccacagtgac cacatgggctcctgtcggcctgtaggaatgcacggagaacatttccgcctagaaatcttc gagggttgcaacttcacgggccagtgcctggagttcctggaggacagccccttcctccag agcaggggctgggtcaagaactgtgtgaacaccatcaaggtgtacggggacggagcgtgg gtcctgtatgaggagcccaactaccacggccgcatgtacgtggtggagagtggcgacttc cgcagcttctcggactgggaggcccacagcgcacgagtgcagtcgctccacaagatctgg gggcaatttcccagaggagagtctgtaatctacgacctgcaagagaaaagggaagacagg ctggctgaggacatctgcacccacagctctacagtccagactcaacccccagagatggct ggctcatccccaccagggggccactttgtgccaaaccggagaaggaggggtgggggaggt ggagaagccaagaacaaaaaatctatcaacctggaaagaggaagtaggagtcgggctccc cgttccccgcgatggccgcgggccgcagtcgtacggagccgccgcgcggaccccggggcg cccctccctagcggggcgcagctccttcttgttccccgccctgccagcgggatggggggc ggcgggtcggccctgagggtctgcgccgaccaccgcgggggcatcaactggctgagcctg agccccgacgggcagcgcctgctgacgggcagcgaggacggcacggcccggctctggagc accgcggacggccagtgctgcgcgctcctgcaagctcaccggccccagaccgcggacccc agctcctgcgcccgcagacccctttctagaagtccagcccccgctcatcccggcacacta gcggaagagcgcttgtctgggcccagcatctggctgttgagcgcacggcagggcagggtg ccttgtggcctctttgtggccaattgtgctcttctgctggcatttcctgttgacgactct cacagcaacagtagacgcttgttaagggccagcgatgaaaaacatccaggaaaaaaaatt ccacaaggctccaaaaagcaaaatttggcttccctgctcaccaagtcctacactgaatcc acacaagtgaagcgatgtgtagccatcgcatgtaaggaaaatagcgaacatgtgccattc cctctgcggctgcctctgtcctcagtctgcgctacatcccgtgaggcaggcattgtcccc atcccaggacatgaaagctatgtgaccttctgccagctggaggatgaggctgccttcaca tgcagcgccgactgcaccatcaggaggtgggacgtgctgaccgggcagtgtctgcaggtg taccgaggacacacgtccatcgtgaacaggatcctggttgccaacaaccagctcttcagc agctcctatgaccggacagctcgggtctggagtgtggacaaggggcagatgtcccgggag ttccggggccaccgcaactgcgtgctgaccctagcctactctgccccgtgggacctcccc agcactccctgcgcggaggaggccgcggccggggggcttctggtgaccggcagcacagat ggcacagccaaggtgtggcaggtggccagcggctgctgccaccagacgctgcggggccac acgggtgcagtgctgtgcctagtgctagacacgcccggccacacggccttcacaggcagc accgacgccaccatccgtgcctgggacatcctgagtggggagcagctgcgggtgttccgg gagcaccggggctccgtcatctgtctggagctgggcagcaccttgccctgccctgagcgt ctgaaggggctggcagaaggaaggcagcacacacttttgcagaaagagcagctggcactg aaggcagccaaacgtctgatggtttggggctctgctcaggtagtcactaagctggtgaac cgactcgtgtactctggcagcgcggacaggaccgtcaagtgctggctggcagacacaggg gagtgtgtgcgcacgttcacggcccacagacgcaacgtgagcgccctcaagtaccacgcg ggcaccttgttcacgggcagcggggacgcttgcgcccgggccttcgacgcgcagtctgga gagctgcggagggtgttccggggccacacattcatcatcaactgcatccaggtgcacggc caggtgctctacaccgcctcgcacgacggcgccctgcgcctctgggacgtgcgcgggctc cgaggtgccccgcggccccctccgcccatgcgcagcctctcgcggctcttcagcaacaag tcccttgatggtgccactttctgcatccatgtgccccggttggcccgggtgccaggagag aagctgcaggtgacagtgtctgccgtggagagctcatcatctcgctgcagagagaagcac ctgctgcaggagcagccccttcaagtggagcagccccttcaagtggagcagccctttcag gtagagcagcatgcgcttcatagtcttaatgcagaagatacttcatcagagacggccggg gccgccagggatcacctgctaggtgatgggcgggtccgccctcgggcgcggatgaacagg aaaatgggcctgttgggaggcaagcagatcggtgcaggagaacccgcagaaacggcacca tgcactgttctccgggtgagacttaagcacacacagacctcgggggtggccaggtcttgc tgcgtcccggagccattggggcctgtggctgccacaaggggcatacctaccagagaggac ctttccatcgtgctgaagggacggttccggtggcgcaggccagagaaccgccttgagtgc tgcctgcgggcagtgctgccggccggccctgagcctgcacagacgttcacactgactcct gcttgccgaagacactgctgcttggtgatgctgcgctctggcctgagcagcccctcactg acacgtcagctccagtccccaccttatctccccacagcagagaatggagcatgcatcaag ccatccagagaaacccagtggggcatggacccaaaacacgggcgctgcactcactttgca tttttatag >gi568815591r:151367122_151619511|GENSCAN_predicted_peptide_3|150_aa MIVYLQQQSVLWVAPESCSAAILFLGYTSKAIRNTTTYPSKRVYQIREKIFHSSLAQGIK NNQQGIQVHQHRTLWLPYGARLTRVICAELYPKRLKQEDSRELQLEQRKSLIAEAPSGAG HLHVFQVPGTLLLLLLRAQGLHFEKHWLGQ >gi568815591r:151367122_151619511|GENSCAN_predicted_CDS_3|453_bp atgattgtgtatctccagcagcagtcggtgctgtgggtggctccggaatcctgcagtgct gccatcttattcctgggctacacatccaaagcaatacggaacaccacgacttaccccagc aaacgagtttaccaaatccgagaaaaaatatttcattccagtctggcccaaggcatcaag aacaatcagcaaggaattcaagtacatcagcacaggactctgtggctgccttacggggcg cggttgacgagggtgatatgtgctgaactttaccctaaaaggctcaagcaggaggacagc agggagttacaactggagcagcggaagtccctgattgcagaggccccgtcaggtgcagga catctgcatgtcttccaagttccagggacgctgctgctgctgctgctgcgagctcagggg ctgcattttgagaagcactggcttggacaataa >gi568815591r:151367122_151619511|GENSCAN_predicted_peptide_4|168_aa MPQSKSRKIAILGYRSVGKSSLTIQFVEGQFVDSYDPTIENTFTKLITVNGQEYHLQLVD TAGQDEYSIFPQTYSIDINGYILVYSVTSIKSFEVIKVIHGKLLDMVGKVQVISYEEGKA LAESWNAAFLESSAKENQTAVDVFRRIILEAEKMDGAASQGKSSCSVM >gi568815591r:151367122_151619511|GENSCAN_predicted_CDS_4|507_bp atgccgcagtccaagtcccggaagatcgcgatcctgggctaccggtctgtggggaaatcc tcattgacgattcaatttgttgaaggccaatttgtggactcctacgatccaaccatagaa aacacttttacaaagttgatcacagtaaatggacaagaatatcatcttcaacttgtagac acagccgggcaagatgaatattctatctttcctcagacatactccatagatattaatggc tatattcttgtgtattctgttacatcaatcaaaagttttgaagtgattaaagttatccat ggcaaattgttggatatggtggggaaagtacaggtgatcagttatgaagaagggaaagct ttggcagaatcttggaatgcagcttttttggaatcttctgctaaagaaaatcagactgct gtggatgtttttcgaaggataattttggaggcagaaaaaatggacggggcagcttcacaa ggcaagtcttcatgctcggtgatgtga >gi568815591r:151367122_151619511|GENSCAN_predicted_peptide_5|139_aa MDAGITEEWSDGERPPPTAVLGVMAERAGEQGCIPNRVPVLSTFVCRRLAVQLPDRSLFL FSTTLSLRARVQEMYSEEAQMDSSLLWDQDTLKEFKVFKEEHSSGKSATSAGPGSGHFFP IGPKGTGEDGDSRKGRKLI >gi568815591r:151367122_151619511|GENSCAN_predicted_CDS_5|420_bp atggatgcggggatcacggaggagtggagtgatggggagcgaccaccgcccactgctgtt cttggggtgatggcagaaagagcaggagagcagggctgtatccccaacagggttcctgtt ctaagcacgtttgtgtgcaggaggctggctgtgcagctgcctgaccgttctctgtttctt ttcagtacaaccctgtcactaagggctagagtccaggagatgtactcagaagaggcacag atggattctagtctcctgtgggatcaggacacattgaaagaattcaaggtcttcaaagag gaacacagctcaggcaagtcggccacaagtgcagggcccggctccggccacttcttcccc attggcccaaagggcacaggtgaggacggggacagcaggaaaggaaggaagctgatctga >gi568815591r:151367122_151619511|GENSCAN_predicted_peptide_6|472_aa XLGLVTASGYCTISWGSPCTCAFGNGPIDGHSLSIILEMGLCTMKTKQEFQDWTKRSVEL EGGNASSELQSFQGHHNKEGLRNCRSQEEPEEMLPTIMWNPGRDPVEDSESGVYMRFMRS HKCYDIVPTSSKLVVFDTTLQVKKAFFALVANGVRAAPLWESKKQSFVELYLQETFKPLV NISPDASLFDAVYSLIKNKIHRLPVIDPISGNALYILTHKRILKFLQLFMSDMPKPAFMK QNLDELGIGTYHNIAFIHPDTPIIKALNIFVERRISALPVVDESGKVVDIYSKFDVINLA AEKTYNNLDITVTQALQHRSQYFEGVVKCNKLEILETIVDRIVRAEVHRLVVVNEADSIV GIISLSDILQALILTPAAPVKYEQPSLCQGHMDPMWEQTACPLNSQNPILNEFTGMEQVM WHKVSARYVQITVPYVRIQQYVTAAAGAHACETTPSLNVEVFEPFTKSVCFL >gi568815591r:151367122_151619511|GENSCAN_predicted_CDS_6|1419_bp ngtctagggctggtaacagcttcaggttattgcactatttcttggggttccccatgcacc tgtgcctttgggaatggtcccatagacggtcactcgctgagcatcatcttagaaatgggg ctgtgcactatgaagactaaacaagagttccaggactggaccaaaagaagtgtggaactt gagggtgggaatgcctcatcagaacttcaaagcttccaaggccatcacaacaaggagggt ctgagaaactgtcgcagccaggaggagcccgaggagatgctcccgaccatcatgtggaat cctggacgggatccagtagaagactcagaaagtggtgtttacatgcgattcatgaggtca cacaagtgttatgacatcgttccaaccagttcaaagcttgttgtctttgatactacatta caagttaaaaaggccttctttgctttggtagccaacggtgtccgagcagcgccactgtgg gagagtaaaaaacaaagttttgtagagctttatttacaagaaacatttaagcctttagtg aatatatctccagatgcaagcctcttcgatgctgtatactccttgatcaaaaataaaatc cacagattgcccgttattgaccctatcagtgggaatgcactttatatacttacccacaaa agaatcctcaagttcctccagctttttatgtctgatatgccaaagcctgccttcatgaag cagaacctggatgagcttggaataggaacgtaccacaacattgccttcatacatccagac actcccatcatcaaagccttgaacatatttgtggaaagacgaatatcagctctgcctgtt gtggatgagtcaggaaaagttgtagatatttattccaaatttgatgtaattaatcttgct gctgagaaaacatacaataacctagatatcacggtgacccaggcccttcagcaccgttca cagtattttgaaggtgttgtgaagtgcaataagctggaaatactggagaccatcgtggac agaatagtaagagctgaggtccatcggctggtggtggtaaatgaagcagatagtattgtg ggtattatttccctgtcggacattctgcaagccctgatcctcacaccagcagccccagtc aaatatgaacaacccagcctttgtcagggccacatggacccgatgtgggagcaaacagca tgccctttgaactctcagaatccaatattaaacgaattcactggtatggaacaggtgatg tggcataaggtgagtgcacggtatgttcagatcacagtgccttatgtccgaatacagcaa tatgtcaccgccgcagccggggcgcacgcgtgtgaaacaacaccgagcttgaatgtggaa gtctttgaaccttttaccaaatcagtttgttttctttag