GENSCAN 1.0 Date run: 5-Nov-116 Time: 06:57:08 Sequence gi568815596r:106706681_106943977 : 237297 bp : 40.40% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.08 PlyA - 46 41 6 1.05 1.07 Term - 7470 7327 144 0 0 68 43 133 0.926 3.73 1.06 Intr - 7664 7499 166 1 1 68 87 86 0.294 5.54 1.05 Intr - 23980 23791 190 2 1 87 85 159 0.643 13.22 1.04 Intr - 28377 28299 79 0 1 111 87 27 0.421 3.11 1.03 Intr - 38469 38331 139 2 1 56 62 92 0.047 2.95 1.02 Intr - 62617 62405 213 0 0 4 92 145 0.452 3.41 1.01 Init - 63687 63626 62 0 2 73 103 45 0.562 5.27 1.00 Prom - 64838 64799 40 -5.75 2.03 PlyA - 64986 64981 6 1.05 2.02 Term - 85347 85089 259 2 1 59 38 729 0.991 59.04 2.01 Init - 90993 90941 53 0 2 54 68 52 0.021 0.49 2.00 Prom - 93066 93027 40 -7.05 3.10 PlyA - 93683 93678 6 1.05 3.09 Term - 94716 94574 143 1 2 78 48 121 0.543 4.21 3.08 Intr - 96861 96699 163 0 1 46 77 73 0.433 0.63 3.07 Intr - 100269 100064 206 1 2 55 32 132 0.065 2.30 3.06 Intr - 111524 111447 78 0 0 69 94 63 0.038 3.60 3.05 Intr - 123560 123386 175 2 1 99 74 85 0.939 6.79 3.04 Intr - 125986 125885 102 1 0 91 119 39 0.963 6.75 3.03 Intr - 127502 127369 134 0 2 45 110 76 0.481 4.94 3.02 Intr - 136002 135855 148 0 1 93 19 170 0.164 9.49 3.01 Init - 137114 136332 783 2 0 105 18 504 0.731 37.84 3.00 Prom - 139630 139591 40 -8.05 4.00 Prom + 141659 141698 40 -6.15 4.01 Init + 150288 150352 65 2 2 68 115 54 0.630 6.77 4.02 Intr + 151103 151203 101 2 2 116 71 31 0.525 3.03 4.03 Term + 152856 153052 197 1 2 51 49 110 0.436 -0.01 4.04 PlyA + 154912 154917 6 1.05 5.00 Prom + 157251 157290 40 -8.05 5.01 Init + 159774 159938 165 2 0 62 78 134 0.411 9.48 5.02 Intr + 162744 162944 201 2 0 125 -22 108 0.319 2.06 5.03 Intr + 165059 165158 100 1 1 24 93 96 0.394 2.46 5.04 Intr + 166469 166587 119 0 2 132 93 19 0.308 6.06 5.05 Intr + 168591 168639 49 2 1 94 109 9 0.281 0.93 5.06 Intr + 176256 176430 175 1 1 70 40 144 0.046 5.88 5.07 Term + 178804 179005 202 1 1 19 41 164 0.145 0.68 5.08 PlyA + 179174 179179 6 1.05 6.06 PlyA - 179518 179513 6 1.05 6.05 Term - 179781 179608 174 0 0 91 47 137 0.754 6.68 6.04 Intr - 180383 180296 88 1 1 75 91 7 0.656 -1.25 6.03 Intr - 180962 180721 242 2 2 8 75 232 0.345 9.33 6.02 Intr - 199270 199167 104 2 2 80 77 93 0.326 6.37 6.01 Init - 199977 199773 205 0 1 55 63 81 0.163 1.36 6.00 Prom - 200861 200822 40 -5.85 7.00 Prom + 204946 204985 40 -6.15 7.01 Sngl + 207468 207884 417 2 0 56 38 284 0.746 16.25 7.02 PlyA + 208037 208042 6 1.05 8.00 Prom + 208709 208748 40 -5.15 8.01 Sngl + 209916 210221 306 2 0 68 42 153 0.374 4.32 8.02 PlyA + 211561 211566 6 1.05 9.02 PlyA - 212571 212566 6 1.05 9.01 Term - 235085 234416 670 1 1 80 54 588 0.915 46.87 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:106706681_106943977|GENSCAN_predicted_peptide_1|330_aa MLLGLCKFKPSGKMDDRGMECETQVALQPPTTPADSPGQEERCKVNCRGRSTQTCDENSP ADPTGHSGMGECESLKDQSRVGLQPCIASEKRSQEWTLLPFKSSDYRIIEYYMYLQDLRI VEKSDQHKAGVARSIPKKVPNFIHSINSVLKHDGAPSATEWGKEKHFVAPNLPACLQVGR AQSEPMGYKWKLLRDGQCQLTRWLAFSSSDQVQELDARAREVILKTTRSQMLPSGSLKPE LKERPLLSPQHVPVGPKLDVPTCCRALALLLLCAVCPRQLTLKAGSAIGGDGRHQREQAS RRSLMYLFSACVSLQKSPLKATAAVRQSSS >gi568815596r:106706681_106943977|GENSCAN_predicted_CDS_1|993_bp atgcttctgggcttatgtaaatttaaaccaagtgggaagatggatgacagagggatggag tgtgagactcaagtagctctccagcctccaacaacacctgctgacagtccaggacaggaa gaaaggtgcaaagtcaactgcagagggaggagcacacaaacatgtgatgaaaactcacct gcagaccctactggacacagtgggatgggggagtgtgagtctcttaaggatcagagcagg gtaggactccagccctgtattgcctctgaaaaaagatcccaagaatggacgctattgcct ttcaagagttcagactacagaataattgagtactacatgtacctacaagatttgagaata gttgagaagagtgatcagcacaaggctggggttgcaaggagcatccccaagaaggtccct aacttcatacacagtataaactcagtgttgaagcatgatggggcaccatctgctactgag tggggaaaggaaaaacattttgttgcaccaaatctcccagcctgccttcaggtaggaaga gcacagtctgagccaatgggatataagtggaagttgctacgggatggtcagtgtcagctg acacgctggttggccttctcttcttctgaccaggttcaggaacttgatgctagagctaga gaagtcatcttgaaaactacaaggtcccagatgttgccctctgggtctctgaagccagag cttaaggagagacccctcttgtcccctcagcatgtgccagtgggacctaagctggatgtc cccacctgctgcagggccctggcccttcttctcctctgtgctgtgtgccccagacagctg accctaaaggctggttctgccattggaggtgatggcagacatcagagggagcaagcaagt agacgaagcctgatgtatttgttctccgcatgtgtgtctctccagaaatcccccctgaag gccacagctgctgtcaggcagtcctcttcatag >gi568815596r:106706681_106943977|GENSCAN_predicted_peptide_2|103_aa MGREQQDLVDTFGPSLGWKFPVTRGKKKKKKKEEERRRRKKKKEEERRRRKKKKEEERRR KEEEEEEEEEEEEEEEEEEEEEEEEKKKKKKKNKKKKKKKKKP >gi568815596r:106706681_106943977|GENSCAN_predicted_CDS_2|312_bp atggggagagagcagcaggacctcgtggacactttcgggcctagcctgggatggaagttc ccagtgaccaggggaaagaagaagaagaagaagaaagaagaagaaagaagaagaagaaag aagaagaaagaagaagaaagaagaagaagaaagaagaagaaagaagaagaaagaagaaga aaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaa gaagaagaagaagaaaagaagaagaagaagaagaagaacaagaagaagaagaagaagaag aagaaaccttga >gi568815596r:106706681_106943977|GENSCAN_predicted_peptide_3|643_aa MGAAHEPSPPGGLDARQALPRAHPAGSFHAGPGDLQKWAQSQDGFEHKEFFSSQVGRKSQ SAFYPEDDDYFFAAGQPGWHSHTQGTLGFPSPGEPGPREGAFPAAQVQRRRVKKRHRRQR RSHVLEEGDDGDRLYSSMSRAFLYRLWKGNVSSKMLNPRLQKAMKDYLTANKHGVRFRGK REAGLSRAQLLCQLRSRARVRTLDGTEAPFSALGWRRLVPAVPLSQLHPRGLRSCAVVMS AGAILNSSLGEEIGRSQRGDQKPQSAWICPQAVAPESPVNPSCSGNMGREQEEEARKQKL PNQGVSSKEQENKSKLVTYVPLDSHDAVLRFNSAPTRGYEKDVGNKTTIRIINSQILTNP SHHFIDSSLYKDVILVAWDPAPYSANLNLWYKKPDYNLFTPYIQHRQRNPNQPFYILHPK FIWQLWDIIQENTKEKIQPNPPSSGFIGLYNQQMMKPASLCPSLQASEFPLAPGILIMMS MCREVHVYEYIPSVRQTELCHYHELYYDAACTLGAYHPLLYEKLLVQRLNMGTQGDLHRK GKPVTPGRRVTFNRPGVLMEYRCSSQGADLSHYVLIELLLQRTLCSGPRDGQLKVAWING HMPENESGEIKWKRHIFRLTNLDVDLITQDFDECMALGDSSSA >gi568815596r:106706681_106943977|GENSCAN_predicted_CDS_3|1932_bp atgggcgccgcacatgagccctccccgcctgggggcctggacgcacgccaggcgctgccc cgcgcccacccagccggttcctttcatgcggggcctggagacctgcagaaatgggcccag tcccaagatgggtttgaacataaagagtttttttcatcccaggtggggagaaaatctcaa agtgctttctacccggaggatgacgactacttttttgctgctggtcagccagggtggcac agccacactcaggggacattgggattcccttcccccggggagccaggcccacgggagggg gcttttccggctgcacaggtccagaggaggcgggtgaagaagaggcaccggaggcagaga aggagccacgtgttggaggagggcgacgacggcgacaggctgtactcctccatgtccagg gccttcctgtaccggctctggaaggggaacgtctcttccaaaatgctgaacccgcgcctg cagaaggcgatgaaggattacctgaccgccaacaagcacggggtgcgcttccgcgggaag cgggaggccgggctgagcagggcacagctgctgtgccagctgcggagccgcgcgcgcgtg cggacgctggacggcaccgaggcgcccttttctgcgctgggctggcggcgcctggtgccc gccgtgcccctgagccagctgcacccccgcggcctgcgcagctgcgctgtcgtcatgtct gcaggcgcaatcctcaactcttccttgggcgaggaaataggtaggtctcagcggggggac cagaaaccgcagtctgcatggatatgcccccaggctgtggcccctgaaagcccggtgaac ccatcctgcagtgggaatatgggaagggaacaagaggaggaggcaagaaagcagaagctg cccaatcagggcgtcagctccaaagagcaggagaacaaaagtaagcttgtcacttatgtg cccttagattctcatgatgcggttttgagatttaactctgctcctacacgtggttatgag aaagatgttgggaataaaaccaccatacgcatcattaattcgcagattctgaccaacccc agccatcacttcattgacagttcactgtataaagacgtcattttggtggcctgggaccct gccccatattccgcaaatcttaacctgtggtacaaaaaaccggattacaacctgttcact ccatatattcagcatcgtcagagaaacccaaatcagccattttacattcttcatcctaaa tttatatggcagctctgggatattatccaggagaacactaaagagaagattcaaccaaac ccaccatcttctggtttcattgggctctacaatcagcaaatgatgaagccagccagcttg tgtccttcccttcaggctagtgagttccccctggccccaggaatcctcataatgatgtcc atgtgcagagaggtgcacgtgtatgaatatatcccatccgtgcggcagacggagctgtgc cactaccacgagctgtactacgacgcagcctgcaccctcggggcgtaccacccactactc tatgagaagctcctggtgcagcgcctgaacatgggcacgcagggggatttgcatcgcaag ggcaagccagtcacaccaggcagacgggtaacctttaacagacctggtgttctcatggaa taccgatgctcgtcccagggagctgacctttctcactatgtccttatagagctgctctta cagagaactttgtgctcaggaccaagggatggtcaattaaaagttgcatggattaatggc cacatgccggagaatgaaagtggcgaaatcaagtggaaaaggcatatatttaggctaaca aatcttgatgttgacttaataacccaagattttgatgaatgcatggctttgggagactct tcctcagcttga >gi568815596r:106706681_106943977|GENSCAN_predicted_peptide_4|120_aa MILVYTKITLADAEVPVKVIDRVASLGRCEQHGRRRLWGDTQEQCLAADKNNLHTAWPVT QLPRSKTLVFFLIPFSTYNHIQSQPPANPDNPNSKTHLDSDNALPSVLPPCGPSRLSPEL >gi568815596r:106706681_106943977|GENSCAN_predicted_CDS_4|363_bp atgatcctggtatacacaaaaatcaccttggcagatgcagaggtgccagtgaaagtcata gacagagtggcctctttgggcaggtgtgagcagcacggccgaaggaggttgtggggtgac actcaagaacaatgcctcgctgctgacaagaataacctacacacagcatggccagtcact cagttgcccaggtctaaaactctggtgtttttcttgattcctttttccacttacaaccac atccaatcccaaccaccagcaaacccagataaccctaactccaaaacacatctagactct gacaacgcattgccttctgtactgcccccatgtggacctagtcgcctctcacctgaacta tga >gi568815596r:106706681_106943977|GENSCAN_predicted_peptide_5|336_aa MPHEHRTLVRSQCLGVQQDSKRVQGEYGTSTTSGGSHSSQRPCFLSEPELASNTKELQNI DRPLNSGITAPILGSPQFGNGAKRLQLPPSQGGHGLTVYKVQTPPPAAAFALWAEAGPSK LSVFRGSNTHGTAITYESNAFFWNPPEGPAWGCFTDVPSPVAISCRERWQNEVVAMHLIH TGRGQHSSFLGFKLKDQAMPPTSASRVTGTTEQFSQPTSRPWVLWDNQEPAGGKVKIIVT TAGTLLSPGLGPMLELDQITNSCKALGLPSYSALWKAALAGAIEFNFTQSPNTEPANPNE MDVTLPTQKNASNIHSVTPRKVNPKAEFAACQLHKN >gi568815596r:106706681_106943977|GENSCAN_predicted_CDS_5|1011_bp atgcctcatgagcacaggactctggtgaggtcacaatgcttgggagttcagcaggactca aaacgagtgcaaggtgagtatggtacctctacgacaagtggaggatcccacagctcccag aggccatgctttctgtcagaaccagaactagcttcaaacacaaaagaacttcaaaatatc gaccgccctttaaactcagggataactgctcccatcctaggaagcccgcaatttggaaat ggggcaaagaggctccagctcccacccagtcagggagggcatgggctgactgtttacaaa gtgcagacgccccctcctgctgctgcttttgctctctgggcagaagcagggccttcaaag ctgtcagtcttcaggggcagtaacacgcatggaactgccatcacctatgaaagcaacgcc ttcttctggaatcctcctgaaggacctgcctggggctgttttacagatgtccccagccct gtggccatctcctgcagggagcgatggcagaatgaggttgtggccatgcatctgatacac acaggtaggggacagcacagcagcttcctgggctttaagctaaaggatcaagcaatgccc ccaacctcagcctccagagtaactgggaccacagaacaattctcccaaccaaccagtagg ccctgggtactgtgggataatcaggaacctgcaggtggcaaagtgaagataatagtcaca actgctgggaccctgctttcaccagggctaggaccaatgctggagctggaccagatcacc aactcctgcaaagccttaggactgcccagctattcagctctttggaaggcagcacttgcg ggagccattgagtttaacttcactcagtcccccaacacagaaccggcaaatccgaatgag atggacgtcacactccctactcaaaagaatgcatcaaatatccactccgtaacaccgcgg aaagtaaacccaaaagcggagtttgccgcctgccaactacataagaattaa >gi568815596r:106706681_106943977|GENSCAN_predicted_peptide_6|270_aa MLLKKTGTGKVLTKGSGDAEAFLLNNAPIIQRNKNHGGRMQPKEAFFKIQVFYADYQNVA VEHYRYHLGTENRASTPQLLHILSYAYRAGTHSISARSSSNAQDQLPPSRENARAAVTSG HSARDQTDTRWAPPRGPARRLQLPSSQPPTGSGGCFNHLSRRLLPQELSVPLGVGDRQCG GAGSLCAFLHSCRRPGPECPGGVQARCQAGTVQLPPARSSAGPGEVIPQAAWDGSAGQLR DPTVAGGGLEPAAGRLPPRTRLLQPPSSAA >gi568815596r:106706681_106943977|GENSCAN_predicted_CDS_6|813_bp atgctcttgaagaagactgggacagggaaagttctcacaaaggggagtggagatgctgaa gcattcctcctaaacaatgctcccatcatacagagaaacaaaaatcatggaggaaggatg cagccaaaggaggcattttttaaaatccaggtattttatgctgactaccaaaatgtggca gtggagcattacaggtaccatctaggaacagaaaatcgtgcatccacaccccagctcctc cacattctgagctatgcgtaccgtgctggtacccacagcatctcagcaagatcgagctcc aatgcccaggaccagctgccgccgtctcgagaaaacgcaagggccgcggtgacaagtgga cattccgcacgcgaccagaccgacactcgatgggcacctccccgcggccccgcccgcagg ctccagctgcccagctcgcagccgcccactggctccggaggctgcttcaaccacttgtcc cgccggcttctaccccaagaactttccgtccctcttggggttggggaccggcagtgtgga ggggcgggcagcctgtgcgcattcctgcattcctgccgccgcccgggacccgagtgcccc ggaggtgtccaggcgcggtgccaggcgggtactgtgcagctgccgccggcgcggagcagc gctggccccggagaggtgatcccgcaggccgcctgggacgggagcgcggggcagctccga gaccccacggtggcaggaggaggactcgagcctgcggcaggacgactcccgccgcgcact cgccttctgcagcccccaagttccgcggcgtga >gi568815596r:106706681_106943977|GENSCAN_predicted_peptide_7|138_aa MEKAKWKPLKQPLPREIVNQKQYHIPGGSVEISATIKDLKDAGVVISTTSPCNSPIWPMQ KTDGSWRMTVDYCKLNQVVTPIAAATPDVVTLFEQINTFPGAGYVAIDLANPFFFIPVHK IHHKQLALGVYQLASPVS >gi568815596r:106706681_106943977|GENSCAN_predicted_CDS_7|417_bp atggaaaaggccaagtggaagccactaaaacaacctctgcctagagaaatagtaaaccaa aagcaatatcacattcctggagggtctgtggagattagtgccaccatcaaggacttgaaa gatgcaggggtggtgatttccaccacatccccatgtaattctcctatttggcctatgcag aagacagatggatcttggagaatgacagtggattattgtaagcttaaccaggtggtgact ccaattgcagctgctacgccagatgtggttacattgtttgagcaaattaacacatttcct ggtgccggatatgtagctattgatctggcaaatccctttttcttcatccctgtccataag atccaccataagcagttagccttaggggtttatcagctcgctagccctgtgtcataa >gi568815596r:106706681_106943977|GENSCAN_predicted_peptide_8|101_aa MYHPVSLISKIHRSRNQGTEMGVAPLAITPSNPLAKFLPSLLMTLFSADLENLFPKEEIL PPGNTKMIPLNWKLTASQPLWTPYTSQLAGKEESYCAHRRD >gi568815596r:106706681_106943977|GENSCAN_predicted_CDS_8|306_bp atgtatcatcctgtttctctcatatccaagattcacaggtctagaaatcaagggacagaa atgggagtagcaccactcgccattacccctagtaacccactagcaaaatttttgccttct ctcctcatgactttattctctgctgatctagagaacttatttccaaaggaagaaatcctt ccaccaggaaacacaaaaatgattccactgaactggaagttgactgccagccagccactt tggactccttatacctctcaattagcaggcaaagaagaaagttactgcgctcacaggagg gactga >gi568815596r:106706681_106943977|GENSCAN_predicted_peptide_9|223_aa XIFWPVSHRQQWVNCRLLRVKCQRLRAHKELLPVQRWLWRQGRCVPTTSLVAVSETAGAG AAALALSRLCRCLQVSASARPFSSRTLSPQPKCWAMAASHQPIKGILKNKTSRTSSMVAL SEQPCRTVHEELSKKCQKCDEMNILSTYHPADKDYGLMKIDEPSPPYHGITGDDENACSD TETTETMVSDILAKKLAAAEGLEPKYWVQEQESSGEEDLLAEE >gi568815596r:106706681_106943977|GENSCAN_predicted_CDS_9|672_bp nnaatcttttggccagtgtctcaccgtcagcagtgggtcaactgcaggctgctaagggtc aagtgccagcggctaagggcacataaggagctgctaccagttcagcgttggctctggcgt cagggtcgttgtgtcccgacaacctctctggtagccgtttctgagacagcaggtgcaggt gcggccgctttagccctgagcaggctctgccgctgcttgcaggtctctgctagcgcccga cccttctcttcacggaccttgagcccacagcccaagtgctgggcaatggcagcctcgcac cagcccatcaaggggatcctgaagaacaagacctccaggacttcctctatggtggcgttg tcggaacagccctgcagaactgtccacgaggagctgagtaaaaaatgccagaagtgtgat gaaatgaacatcctgtcaacatatcatccagcagacaaagactatggtttaatgaaaata gatgagccaagccctccttaccatggtataacgggtgatgatgaaaatgcatgtagtgat acagaaaccactgaaaccatggtatcagatatcttagctaagaaattagctgctgctgaa ggcttagagccaaaatattgggttcaggaacaagaaagcagtggagaggaggacctctta gctgaagaatga