GENSCAN 1.0 Date run: 5-Nov-116 Time: 13:15:15 Sequence gi568815578f:37803351_38044250 : 240900 bp : 46.96% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 30898 30937 40 -2.16 1.01 Init + 36724 36849 126 0 0 56 98 129 0.780 8.87 1.02 Intr + 38989 39069 81 0 0 108 60 31 0.357 2.13 1.03 Intr + 56549 56686 138 1 0 84 91 302 0.536 30.76 1.04 Intr + 56922 56994 73 2 1 79 96 44 0.957 3.28 1.05 Intr + 58746 58884 139 1 1 62 44 89 0.498 1.42 1.06 Intr + 61871 62025 155 2 2 68 21 134 0.861 4.42 1.07 Intr + 70859 70934 76 0 1 46 110 26 0.012 -0.83 1.08 Intr + 71642 71756 115 2 1 7 65 113 0.017 1.35 1.09 Intr + 77614 77769 156 0 0 54 89 42 0.155 1.11 1.10 Intr + 77842 78005 164 0 2 87 66 136 0.711 10.07 1.11 Term + 79725 79815 91 0 1 99 32 74 0.801 0.09 1.12 PlyA + 80571 80576 6 1.05 2.00 Prom + 94655 94694 40 -4.96 2.01 Init + 100001 100121 121 1 1 96 113 250 0.999 26.65 2.02 Intr + 104769 104950 182 1 2 8 72 166 0.730 6.59 2.03 Intr + 111310 111425 116 0 2 52 77 73 0.844 1.85 2.04 Intr + 119479 119533 55 1 1 54 68 62 0.010 -0.22 2.05 Intr + 121303 121465 163 0 1 24 64 52 0.000 -4.05 2.06 Intr + 128285 128454 170 0 2 97 95 211 0.742 22.37 2.07 Intr + 133409 133474 66 1 0 107 73 9 0.164 0.40 2.08 Term + 140631 140903 273 2 0 103 38 542 0.986 46.17 2.09 PlyA + 141980 141985 6 1.05 3.00 Prom + 155435 155474 40 -2.86 3.01 Init + 167748 167767 20 2 2 113 61 19 0.754 1.23 3.02 Intr + 168738 168784 47 0 2 122 95 14 0.575 3.55 3.03 Intr + 169188 169316 129 1 0 61 84 27 0.375 0.27 3.04 Intr + 174067 174176 110 2 2 53 94 64 0.615 3.50 3.05 Term + 175567 175686 120 0 0 39 40 102 0.472 -1.03 3.06 PlyA + 178679 178684 6 1.05 4.09 PlyA - 178798 178793 6 1.05 4.08 Term - 180289 180106 184 0 1 126 46 271 0.973 23.72 4.07 Intr - 193112 193025 88 0 1 128 94 69 0.811 10.53 4.06 Intr - 193603 193399 205 1 1 54 80 160 0.980 10.57 4.05 Intr - 195978 195838 141 0 0 109 79 32 0.915 4.95 4.04 Intr - 197169 197055 115 2 1 17 70 120 0.781 3.55 4.03 Intr - 199426 199231 196 2 1 69 70 151 0.582 9.87 4.02 Intr - 203047 202847 201 2 0 53 88 150 0.878 10.76 4.01 Init - 210466 208165 2302 1 1 77 56 970 0.967 80.24 4.00 Prom - 214712 214673 40 -5.36 5.00 Prom + 217852 217891 40 -4.16 5.01 Init + 230598 230748 151 2 1 102 92 283 0.521 30.30 5.02 Intr + 237085 237214 130 2 1 105 91 29 0.056 4.65 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 68575 68663 89 2 2 121 44 203 0.999 17.02 S.002 Intr - 122215 122144 72 2 0 76 97 106 0.951 9.68 S.003 Init + 127446 127668 223 2 1 69 1 134 0.838 1.52 S.004 Intr - 182124 182080 45 0 0 88 107 55 0.871 6.01 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578f:37803351_38044250|GENSCAN_predicted_peptide_1|437_aa MFSFLFPNPEHVCSILASLLRNLRGQQRTRLLNKFTENDSEKVDRLMELHFKYLGAMQVA DKKIEGEKHDMVRRGEIIDNDTEEEFYLRRLDAGLFVLQHICYIMAEICNANVPQIRQRV HQILNMRGSSIKIVRHIIKEKKRVKGSDGHSPTHAFRGDTLGASSHSQCQIRVNYCSEPH FASSLRHTSLTSEVTDHSAKQNGASAGMQRYKGNEDGTVNSQLIRLDMKRSPKIQGSLAG TYRMPSGIVIPDLCKLRLRAQRGRGVRFQGPRDQGDSVRRKQPPPLPDFNDPRPRKAFQG EDLMTGTGFQQLQAEVSLSWGKIQRYTDRTSPGWPLGPTCCLTSSGLGTWTRVLCELLHS LSLSLLICDMGGPLALETCHKLMHAKSLINYKVLIQAHVVITDIPEIPKKQLSEAATLLP IKQGKFYRNRIGGTPDT >gi568815578f:37803351_38044250|GENSCAN_predicted_CDS_1|1314_bp atgttctcttttctctttcccaacccagagcatgtctgttcgatcctggcttccctcctg cggaacctgagagggcagcagcggacccggcttctgaataaattcactgaaaatgacagt gagaaggttgacagactaatggagttgcattttaaatatctgggtgcaatgcaggtggcg gacaagaagattgaaggggaaaaacacgacatggtccggcgaggagagatcatcgacaat gacaccgaggaggagttctacctccggcgcctggatgcggggctctttgttctccagcac atctgctacatcatggccgagatctgcaatgccaatgtcccccagattcgccagagggtt caccagatcctaaacatgcgaggaagctccatcaaaattgtcaggcatatcatcaaggag aaaaagagagtaaaggggtctgatggacacagccccactcatgccttccggggtgacacc ctgggggcttcttctcattctcagtgtcagattcgcgtgaattactgctcagagcctcat tttgcctcctccctaaggcatacgtccctcacaagtgaggtgactgaccatagcgcaaag cagaatggagccagtgctgggatgcagaggtacaaggggaatgaagacggcacagttaac tcacagttaatccggctggacatgaaaaggagcccaaagatccagggaagcttagctggc acatatcgaatgcccagtggcattgttatccccgatttatgcaagctgaggctgagggct cagagaggcaggggagtacggttccagggcccgagggaccaaggagacagcgtaaggaga aagcagccaccaccactccctgatttcaacgacccaaggccccgcaaagcttttcagggc gaggacctgatgacaggcactggctttcagcagctgcaagcagaagtctctctgtcatgg gggaagatccaaagatacacggaccgaacttccccaggctggcctctgggccccacctgc tgcctgacctcctccgggctgggcacgtggacccgtgtgctgtgtgagttgcttcactcc ctgagcctcagtctcctcatctgtgatatggggggacccctcgctttggagacctgccac aagctgatgcacgcaaagagcttgatcaactataaagtgctgatccaggctcatgtcgtc attactgatattcctgaaataccgaaaaaacagctgtcagaggcagccaccctgttaccg attaaacaaggcaaattctatcggaacagaatcggaggcactccggacacctaa >gi568815578f:37803351_38044250|GENSCAN_predicted_peptide_2|381_aa MGAPLAVALGALHYLALFLQLGGATRPAGHAPWDNHVSGHAVFPKSFTPIGFDTLPISSS CSLFPKQEKPQAASRTQGIWLLAMVTARMVGKVACGKQVGRRSQKHMTCQSRIARPAAGI GLPFNTPTLQRGKLRLREAWSHGSTRGNEHREELEDRWRLSVHHHNQQQQQQHQNTHQSV YFPVSGAGAVSSLRVRRGRKTLLLIELPAFSGALFTETPHDMTARTGEDVEMACSFRGSG SPSYSLEIQWWYVRSHRDWTDKQAWASNQLAVVEGIYENALETVKCHGLGYVVKVVGSNI SHKLRLSRVKPTDEGTYECRVIDFSDGKARHHKVKAYLRVQPGENSVLHLPEAPPAAPAP PPPKPGKELRKRSVDQEACSL >gi568815578f:37803351_38044250|GENSCAN_predicted_CDS_2|1146_bp atgggggccccgctcgccgtagcgctgggcgccctccactacctggcacttttcctgcaa ctcggcggcgccacgcggcccgccggccacgcgccctgggacaaccacgtctccggccac gcggttttccccaagtccttcacaccaattggttttgacaccttgcccatcagcagctcc tgcagccttttccccaaacaagagaagccgcaggcggccagcaggacgcagggaatttgg ctccttgccatggtgacagcaaggatggtgggcaaagtcgcctgcggcaaacaagtgggc aggagaagccagaagcacatgacctgtcagagccgcatcgcacggcccgcagcaggcatc gggctcccattcaacacgcccaccttgcagaggggaaaactgaggcttagagaggcgtgg agccatgggagcaccagaggaaatgagcacagagaagagctagaggaccggtggagatta tcagtacatcatcataatcagcagcagcagcagcagcatcaaaatacacatcaatctgtg tattttccagtttctggtgctggtgcagtatcctcactcagagttaggagagggaggaag accctgttgctgattgagctgcctgctttcagtggggccctgttcacagagacaccccat gacatgacagcacggacgggcgaggacgtggagatggcctgctccttccgcggcagcggc tccccctcctactcgctggagatccagtggtggtatgtacggagccaccgggactggacc gacaagcaggcgtgggcctcgaaccagttagctgttgttgaaggcatatatgaaaacgct ttggaaactgtaaagtgccatggattaggctatgtggtcaaggtggtgggcagcaacatc tcccacaagctgcgcctgtcccgggtgaagcccacggacgaaggcacctacgagtgccgc gtcatcgacttcagcgacggcaaggcccggcaccacaaggtcaaggcctacctgcgggtg cagccaggggagaactccgtcctgcatctgcccgaagcccctcccgccgcgcccgccccg ccgccccccaagccaggcaaggagctgaggaagcgctcggtggaccaggaggcctgcagc ctctag >gi568815578f:37803351_38044250|GENSCAN_predicted_peptide_3|141_aa MVYRYTQARSGPPVAHCPSSLAYFPEISKAGAHKLEIPWVRTLVFAASGTLGTRPFSSGW EWPGKESTITQNYFLKFGLYSLLWRKGSTGTWLPIKPDAQDWSKTLFQQQQQQQQQQQQQ QQDCKKKKVKEDATETFMYGL >gi568815578f:37803351_38044250|GENSCAN_predicted_CDS_3|426_bp atggtctacagatacactcaggcaagatcaggtcctcctgttgcacattgtcccagctcc ctggcctactttccagaaatcagcaaagcaggtgctcacaagctggaaataccctgggtt aggaccctagtgtttgctgccagtgggaccttaggaaccaggcccttcagctctggctgg gagtggcctggcaaggagtccaccatcactcagaattattttctgaaatttggcctctat tccttactctggaggaaaggcagtacgggcacctggcttccaatcaagcccgacgcacag gactggagtaagaccctgtttcaacaacaacagcaacaacaacaacaacaacaacaacaa caacaagattgtaagaaaaaaaaagtcaaagaagatgccacagagacctttatgtatggc ctgtaa >gi568815578f:37803351_38044250|GENSCAN_predicted_peptide_4|1143_aa MAVFDTPEEAFGVLRPVCVQLTKTQTVENVEHLQTRLQAVSDSALQELQQYILFPLRFTL KTPGPKRERLIQSVVECLTFVLSSTCVKEQELLQELFSELSACLYSPSSQKPAAVSEELK LAVIQGLSTLMHSAYGDIILTFYEPSILPRLGFAVSLLLGLAEQEKSKQIKIAALKCLQV LLLQCDCQDHPRSLDELEQKQLGDLFASFLPGISTALTRLITGDFKQGHSIVVSSLKIFY KTVSFIMADEQLKRISKVQAKPAVEHRVAELMVYREADWVKKTGDKLTILIKKIIECVSV HPHWKVRLELVELVEDLLLKCSQSLVECAGPLLKALVGLVNDESPEIQAQCNKVLRHFAD QKVVVGNKALADILSESLHSLATSLPRLMNSQDDQGKFSTLSLLLGYLKLLGPKINFVLN SVAHLQRLSKALIQVLELDVADIKIVEERRWNSDDLNASPKTSATQPWNRIQRRYFRFFT DERIFMLLRQVCQLLGYYGNLYLLVDHFMELYHQSVVYRKQAAMILNELVTGAAGLEVED LHEKHIKTNPEELREIVTSILEEYTSQENWYLVTCLETEEMGEELMMEHPGLQAITSGEH TCQVTSFLAFSKPSPTICSMNSNIWQICIQLEGIGQFAYALGKDFCLLLMSALYPVLEKA GDQTLLISQVATSTMMDVCRACGYDSLQHLINQNSDYLVNGISLNLRHLALHPHTPKVLE VMLRNSDANLLPLVADVVQDVLATLDQFYDKRAASFVSVLHALMAALAQWFPDTGNLGHL QEQSLGEEGSHLNQRPAALEKSTTTAEDIEQFLLNYLKEKDVADGNVSDFDNEEEEQSVP PKVDENDTRPDVEPPLPLQIQIAMDVMERCIHLLSDKNLQIRLKVSAQLLLCIPRVAGLR LGVHVGGKPALAAHAGGKPALAVVLTAACDTDAGSLFSVLDVLDLCVVVLQSHKNQLLPL AHQAWPSLVHRLTRDAPLAVLRAFKVLRTLGSKCGDFLRSRFCKDVLPKLAGSLVTQAPI SARAGPVYSHTLAFKLQLAVLQGLGPLCERLDLGEGDLNKVADACLIYLSVKQPVKLQEA ARSVFLHLMKVDPDSTWFLLNELYCPVQFTPPHPSLHPVQLHGASGQQNPYTTNVLQLLK ELQ >gi568815578f:37803351_38044250|GENSCAN_predicted_CDS_4|3432_bp atggcagtttttgatactcctgaggaggcctttggtgtcttacgtccagtctgtgttcag ctcacaaagacccagacagtggagaatgtggagcatctgcagacacgactacaagctgtg agtgacagtgcccttcaggaacttcagcagtacatcctcttccctctgcgatttaccctg aagaccccaggtcccaaaagagagcgtttgatccaaagtgtggtggaatgcctcacattt gtcctttcttcaacatgtgtgaaagaacaggagcttctccaggaactcttttcagaactc tctgcttgtctgtattcacccagctcccaaaaacctgcggctgtgtccgaggagttgaaa ttggctgtgatccagggacttagcacattaatgcactcagcttatggggacatcattctg actttttatgagccctccattctgccacgtttaggatttgctgtatctttactgttaggc cttgcagaacaggagaaatcaaagcaaattaaaattgctgccttaaaatgtttacaggtt ctactcttgcagtgtgattgtcaggaccatccaaggtcattggatgaacttgaacaaaag cagctgggggatttgtttgcctcttttttacctggaatctcaactgcactgaccaggctt atcacaggagactttaaacaaggtcacagcattgtcgtatcttccctaaagatcttttac aagacagtgagcttcattatggctgatgaacagctcaaaagaatctcaaaggtccaagca aaacctgcagttgagcacagagtagcagagctgatggtttacagggaagcagattgggta aaaaagactggcgacaagttgactatccttattaaaaagataattgagtgtgtttctgtt cacccacactggaaggtgagactggaactggtagaacttgtggaggaccttcttttgaag tgcagtcaatcattggtcgaatgtgctggtccccttctgaaggccttagtgggactagta aatgatgagagtcctgaaatccaagcccagtgcaataaagttctgagacattttgcagat caaaaagtagtggtgggcaacaaagccctcgctgacatcttgtcagaaagcctgcattcc cttgccacatctcttcctcgcctaatgaactcccaagatgaccagggcaaattctctact ctttccttgttacttggttatctgaaactcttgggcccaaaaataaactttgtcctcaac tctgtggcccatctccagcggctttccaaagcactcatccaagttctagagctagacgtg gctgacatcaagattgttgaggaacggcgttggaactctgatgatctgaatgcttctcca aagacctcagccacacagccttggaaccgcatccagaggagatatttccgcttcttcact gatgagagaatcttcatgctcttgaggcaggtttgtcagctacttggttattatgggaat ctttatttgcttgtggatcactttatggaactttaccatcaatctgtggtttaccggaag caagctgccatgatccttaatgaactggttacaggggctgctgggctggaggttgaggat cttcacgaaaaacatattaaaacaaacccagaagaactgagagagattgtgacatctata cttgaagaatacacaagtcaagaaaattggtatttggttacctgtcttgaaactgaggaa atgggagaggagctgatgatggagcacccaggcctccaagccatcacgtctggtgaacac acctgccaagttacatcttttctagccttctcaaagccaagtcccactatttgctccatg aacagtaacatctggcaaatatgcattcagttggaaggaattggccagtttgcatatgca ctaggaaaagacttctgtttgctcttgatgtcagccctttatccagtactggagaaggct ggagaccaaaccctactcattagtcaggtggctaccagcaccatgatggacgtttgccgt gcttgtggctacgactccctgcagcacctgatcaatcaaaattcagactatttagtgaat gggatctctttaaatctgcgtcatctggctctgcatcctcataccccaaaggtcctggaa gtcatgctgcggaactcagatgctaacctgcttcctttggtggcagatgtggttcaagat gtcttggccaccctggaccaattttacgataagagagctgcttcctttgtcagcgttctg catgctctgatggcagcattagcccagtggttcccagacacaggtaatcttgggcacctc caagagcaaagtttaggagaagagggaagtcatttgaaccaaagaccagcagctcttgag aagagcaccaccacagctgaagacatcgaacagtttttgctgaactacctcaaagagaag gatgtggcagatggaaatgtctcggattttgataatgaagaagaggaacagtcagtccct cccaaagtggatgagaatgacacccgtccagatgtggagccaccactgccattgcagatc caaatagccatggacgtgatggaacgctgcatccacttgttgtcagataaaaatctgcaa atccgcctgaaggtcagtgcgcagctgcttctctgcattcccagagtagcaggactaagg ttgggcgtccatgtgggagggaagcctgcgctggccgcccacgcgggagggaagcctgcg ctggccgtcgtactgaccgctgcctgtgacactgatgcaggcagtctcttttcagtcttg gatgtgctggatctgtgtgtggttgttcttcagtcccacaaaaaccagctgcttcccttg gctcatcaggcctggccctcgctcgttcaccgactcacacgggacgcccccctggcagtg cttagagccttcaaggttttacgtaccctgggaagcaagtgtggtgactttcttcgcagc cggttctgcaaagatgtcctgccaaagctggctggctccctagtcacccaggcccccatc agtgccagggctggaccagtttactcgcacacgctggccttcaagttgcagctggctgtc ttacagggcctgggccccctctgtgagagactggacctaggtgagggtgacctgaataaa gtggctgatgcctgcttgatttacctcagtgtcaaacagcccgtgaaattacaagaggct gccaggagcgtcttcctccacttgatgaaggtggacccagactccacctggttcctcctg aacgagctttactgccccgtgcagttcacacctccccaccccagcctccaccctgtgcag ctgcacggggccagcgggcagcagaacccctacacgaccaacgtgctccagctgctcaag gagctgcagtga >gi568815578f:37803351_38044250|GENSCAN_predicted_peptide_5|94_aa MSSFSESALEKKLSELSNSQQSVQTLSLWLIHHRKHAGPIVSVWHRELRKAKSNRKLTFL YLANDVIQNSKRKGPEFTREFESVLVDAFSHVAS >gi568815578f:37803351_38044250|GENSCAN_predicted_CDS_5|282_bp atgtcctccttctctgagtcggcgctggagaagaagctctcggagctgagcaactctcag cagagcgtgcagaccctgtccctttggctcatccaccaccgcaagcacgcgggacccatc gtctccgtgtggcaccgcgagctccgcaaagccaaatcaaatagaaagcttacttttctg tatttagcgaatgatgtcatccaaaacagtaaaaggaaaggacctgaattcactagagaa tttgaatctgtccttgtggatgctttttctcatgttgccagn