GENSCAN 1.0 Date run: 6-Nov-116 Time: 23:29:31 Sequence gi568815589r:72801211_73053000 : 251790 bp : 37.27% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 2108 2416 309 1 0 71 48 175 0.979 7.45 1.02 PlyA + 2689 2694 6 1.05 2.08 PlyA - 3316 3311 6 1.05 2.07 Term - 7900 7702 199 0 1 19 42 208 0.227 5.19 2.06 Intr - 15732 15606 127 1 1 58 42 122 0.351 3.42 2.05 Intr - 26938 26860 79 1 1 108 30 76 0.114 2.01 2.04 Intr - 27639 27543 97 2 1 54 89 56 0.062 1.39 2.03 Intr - 35637 35534 104 0 2 99 37 87 0.051 2.75 2.02 Intr - 35917 35808 110 2 2 8 48 154 0.489 2.48 2.01 Init - 36670 36538 133 1 1 43 89 62 0.884 2.15 2.00 Prom - 41859 41820 40 -4.85 3.00 Prom + 44875 44914 40 -5.25 3.01 Init + 49409 49529 121 1 1 68 74 90 0.927 6.00 3.02 Term + 69498 69619 122 1 2 20 53 182 0.030 5.56 3.03 PlyA + 69779 69784 6 1.05 4.00 Prom + 70088 70127 40 -5.75 4.01 Sngl + 79030 79224 195 0 0 65 42 202 0.930 8.21 4.02 PlyA + 79422 79427 6 1.05 5.15 PlyA - 80333 80328 6 1.05 5.14 Term - 88768 88649 120 1 0 66 48 99 0.369 1.19 5.13 Intr - 91088 90981 108 2 0 92 84 48 0.044 4.36 5.12 Intr - 104822 104804 19 1 1 112 103 -14 0.107 -1.80 5.11 Intr - 108549 108392 158 0 2 59 103 93 0.924 5.79 5.10 Intr - 110912 110748 165 2 0 34 84 206 0.999 14.04 5.09 Intr - 115894 115710 185 2 2 88 111 98 0.997 10.69 5.08 Intr - 117612 117510 103 0 1 109 74 139 0.997 13.43 5.07 Intr - 122922 122809 114 0 0 100 75 124 0.847 12.02 5.06 Intr - 124402 124274 129 1 0 43 107 55 0.708 2.87 5.05 Intr - 125967 125906 62 1 2 108 69 -21 0.646 -4.47 5.04 Intr - 127811 127682 130 2 1 39 111 110 0.900 7.75 5.03 Intr - 129809 129669 141 2 0 60 101 133 0.993 11.43 5.02 Intr - 139042 138938 105 1 0 86 113 95 0.993 11.29 5.01 Init - 147428 147381 48 2 0 47 121 37 0.636 3.90 5.00 Prom - 148634 148595 40 -3.85 6.03 PlyA - 150957 150952 6 1.05 6.02 Term - 157758 157337 422 1 2 62 53 203 0.832 8.67 6.01 Init - 158420 158285 136 2 1 59 25 120 0.621 3.15 6.00 Prom - 160537 160498 40 -8.55 7.00 Prom + 160832 160871 40 -4.55 7.01 Init + 179804 179942 139 1 1 49 54 140 0.961 7.05 7.02 Intr + 180360 180441 82 1 1 123 45 66 0.766 3.58 7.03 Intr + 181873 182128 256 1 1 54 75 96 0.662 1.42 7.04 Term + 186721 186897 177 0 0 54 43 117 0.525 0.50 7.05 PlyA + 187343 187348 6 1.05 8.00 Prom + 209475 209514 40 -5.35 8.01 Init + 210449 210694 246 1 0 31 27 204 0.539 6.24 8.02 Intr + 217661 217750 90 1 0 70 100 20 0.300 0.67 8.03 Intr + 234692 234767 76 1 1 -21 110 102 0.429 -0.33 8.04 Intr + 237434 237588 155 0 2 81 31 136 0.619 6.07 8.05 Intr + 237616 237795 180 0 0 114 99 -6 0.486 2.24 8.06 Intr + 238123 238252 130 0 1 21 59 126 0.462 2.35 8.07 Intr + 241572 241705 134 1 2 72 18 75 0.193 -1.66 8.08 Intr + 242162 242248 87 1 0 29 116 76 0.573 3.75 8.09 Intr + 244133 244256 124 1 1 52 84 143 0.517 9.54 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 35637 35530 108 0 0 99 54 87 0.842 3.93 S.002 Init + 38577 38697 121 2 1 68 74 95 0.840 6.50 S.003 Term + 49625 49719 95 0 2 85 49 80 0.839 0.81 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589r:72801211_73053000|GENSCAN_predicted_peptide_1|102_aa MGKDFKTKTSKAIATKTKIDKWELIKLKSFCTAKETLIRVNRQPTEWEKIFAIYPFDKDL ISRIYKELKQIYEKKTYNPIKKWVKDMNRHFSKKRFTWPTTI >gi568815589r:72801211_73053000|GENSCAN_predicted_CDS_1|309_bp atgggcaaagatttcaagacgaaaacatcaaaagcaattgcaacaaaaacaaaaattgac aaatgggaactaattaaactaaagagcttctgtacagcaaaagaaactctcatcagagtg aacagacaacctacagaatgggagaaaatttttgcaatctatccatttgacaaagatcta atatccagaatctacaaagaacttaaacaaatttacgagaaaaaaacatacaaccccatt aaaaagtgggtgaaggacatgaacagacacttctcaaagaagagattcacgtggccaaca accatatga >gi568815589r:72801211_73053000|GENSCAN_predicted_peptide_2|282_aa MEVNLKKTTRGIKFRTGRETVRLFYGCLLVIKKNEKLRDRIIEITPSNCRQATIHTDSPS AKKIKRGEMQTPEPCQCIETQLNSSLQGGKNCIEFDADLYGFTTDNMIIGRQMSRLKSVL TLGLYHLGQVAFSPSEIPTDSDPCLVLQVISSPSQKASQYKEVSTGFTVKGSASGWKSEV TDFQDTWSPDESDVVIQFLNSKTISTVSTHLNLSSLQGAESGKENRNETKPAEYLIAKNG KATQEHPVSVAITSLSFTEEHYLTAFGQSEEALSMRVMETTL >gi568815589r:72801211_73053000|GENSCAN_predicted_CDS_2|849_bp atggaagtcaatctgaaaaagacaaccagaggaataaagtttagaactggcagggaaaca gtgcgattattctatggctgccttttggtgatcaaaaagaatgagaaactcagagacagg ataatagaaatcactccttccaactgccggcaggccaccatccacacggacagcccatcc gccaaaaaaatcaagcggggagagatgcaaactccagaaccatgccagtgtatagaaacc cagctgaattcttccctccaaggaggcaagaattgtatcgagtttgatgcagacctgtat ggattcaccactgataacatgatcattggaagacaaatgagcagactgaaaagtgtgctg accctgggtctataccacctcggacaagtggctttcagcccatctgaaattcccactgat tctgatccctgtttggtgctgcaagttattagctctccctcacagaaagcctcgcagtat aaagaagtgagcactggatttactgtgaaaggatctgcttcaggctggaaatctgaagtg acagatttccaagacacatggtccccagatgaatcggatgtcgtgattcagtttttaaac tcaaaaacgatttctactgtatcgactcatcttaacctgtcaagtttacaaggagctgaa tcaggtaaggaaaatagaaatgaaaccaagccagctgaatacctcatagcaaagaatggg aaagcaacacaagagcatcctgtatctgtagccattacctcactgtcgttcaccgaagag cattacctaactgcttttggccagtccgaggaggcactttcaatgagggtgatggagacg accttataa >gi568815589r:72801211_73053000|GENSCAN_predicted_peptide_3|80_aa MTLKEHAAFKHLFNKAHLAPPLIHSTLSGYSTCFREHRVGGNYAKWLLGTATVLLAEGVR GGDGITRVQNHGEENYLGQS >gi568815589r:72801211_73053000|GENSCAN_predicted_CDS_3|243_bp atgactcttaaggagcatgctgccttcaagcatctgtttaacaaagcacatcttgcaccg cccttaatccattcaaccctgagtggatacagcacatgtttcagagagcacagggttggg ggtaactatgctaaatggctgctgggaacagcgacagtccttctggctgaaggcgtacgt ggaggtgatggaatcaccagagtccagaatcatggagaagagaactacctgggccagtcc tga >gi568815589r:72801211_73053000|GENSCAN_predicted_peptide_4|64_aa MTVALEEGKDWDDASTSQGTQKLASEPPEARRGPGTNSPSQPSGGSNPDLELPASRTESL QDCL >gi568815589r:72801211_73053000|GENSCAN_predicted_CDS_4|195_bp atgactgtggccttagaagaaggtaaagattgggatgatgcttccacaagccaaggaaca caaaagcttgccagtgaaccaccagaagctaggaggggtcctggaaccaattctccctca cagccctctggaggatctaaccctgatcttgaacttccagcctccaggactgagagcctc caggactgtctctaa >gi568815589r:72801211_73053000|GENSCAN_predicted_peptide_5|528_aa MSRRAFDEDFWEGALKIFINNEWHDSVSGKKFPVFNPATEEELCQVEEGDKEDVDKAVKA ARQAFQIGSPWRTMDASERGRLLYKLADLIERDRLLLATMESMNGGKLYSNAYLNDLAGC IKTLRYCAGWADKIQGRTIPIDGNFFTYTRHEPIGVCGQIIPWNFPLVMLIWKIGPALSC GNTVVVKPAEQTPLTALHVASLIKEAGFPPGVVNIVPGYGPTAGAAISSHMDIDKVAFTG STEVGKLIKEAAGKSNLKRVTLELGGKSPCIVLADADLDNAVEFAHHGVFYHQGQCCIAA SRIFVEESIYDEFVRRSVERAKKYILGNPLTPGVTQGPQIDKEQYDKILDLIESGKKEGA KLECGGGPWGNKGYFVQPTVFSNVTDEMRIAKEEIFGPVQQIMKFKSLDDVIKRANNTFY GLSAGVFTKDIDKAITISSALQAGTVWVNCYGVLLTWAKTTPALSSTPGPFLLNERSWNN PEELHPLLHAASQCPHSSHCAFQVKAYQGSSYGKEQQFSGGEEEAAEK >gi568815589r:72801211_73053000|GENSCAN_predicted_CDS_5|1587_bp atgagtagaagagcttttgatgaggatttttgggaaggtgctttaaagatcttcataaac aatgaatggcatgattcagtgagtggcaagaaatttcctgtctttaatcctgcaactgag gaggagctctgccaggtagaagaaggagataaggaggatgttgacaaggcagtgaaggcc gcaagacaggcttttcagattggatccccgtggcgtactatggatgcttccgagaggggg cgactattatacaagttggctgatttaatcgaaagagatcgtctgctgctggcgacaatg gagtcaatgaatggtggaaaactctattccaatgcatatctgaatgatttagcaggctgc atcaaaacattgcgctactgtgcaggttgggctgacaagatccagggccgtacaatacca attgatggaaatttttttacatatacaagacatgaacctattggtgtatgtggccaaatc attccttggaatttcccgttggttatgctcatttggaagatagggcctgcactgagctgt ggaaacacagtggttgtcaaaccagcagagcaaactcctctcactgctctccacgtggca tctttaataaaagaggcagggtttcctcctggagtagtgaatattgttcctggttatggg cctacagcaggggcagccatttcttctcacatggatatagacaaagtagccttcacagga tcaacagaggttggcaagttgatcaaagaagctgccgggaaaagcaatctgaagagggtg accctggagcttggaggaaagagcccttgcattgtgttagctgatgccgacttggacaat gctgttgaatttgcacaccatggggtattctaccaccagggccagtgttgtatagccgca tccaggatttttgtggaagaatcaatttatgatgagtttgttcgaaggagtgttgagcgg gctaagaagtatatccttggaaatcctctgaccccaggagtcactcaaggccctcagatt gacaaggaacaatatgataaaatacttgacctcattgagagtgggaagaaagaaggggcc aaactggaatgtggaggaggcccgtgggggaataaaggctactttgtccagcccacagtg ttctctaatgttacagatgagatgcgcattgccaaagaggagatttttggaccagtgcag caaatcatgaagtttaaatctttagatgacgtgatcaaaagagcaaacaatactttctat ggcttatcagcaggagtgtttaccaaagacattgataaagccataacaatctcctctgct ctgcaggcaggaacagtgtgggtgaattgctatggcgtgctgctgacttgggctaagact accccagccctcagcagcactccaggccccttcctgctgaatgaaaggtcatggaacaat ccagaagagctgcatcccttactgcatgctgctagccagtgccctcacagctctcactgt gcttttcaagtcaaagcataccagggaagcagctatggcaaagaacagcagttcagtggt ggagaagaggaggctgcagaaaagtaa >gi568815589r:72801211_73053000|GENSCAN_predicted_peptide_6|185_aa MRESLEPLGNWLSDCDQNADRNMDSEDQADEVSDGNKEFIGNWSKALAQKALDTALATAS EDETIRLGSFHAMLSLQVFRLVPRQEPDAGAELPHRTSTREVLSGNGVRAPNRVLTRALP SGAVGMGPLPSRPRMGAPPAACTLSLEMLQAFTPALANSHVGCAQQSHGDRATRGLRTTF LTPEC >gi568815589r:72801211_73053000|GENSCAN_predicted_CDS_6|558_bp atgagagaaagtttggaacctcttggaaactggttaagtgattgtgaccaaaacgctgat agaaatatggacagtgaagaccaggctgatgaggtctcagatggaaataaggaatttatt ggaaactggagcaaagccttggctcaaaaggctttagacactgctctggccactgcttca gaggatgaaaccataaggcttggcagcttccatgcaatgttaagtctgcaggtattcaga ctggtgcccaggcaggaacctgatgcaggggcagagctcccacataggacttctactaga gaagtgctaagtggaaatggggttagagccccaaacagagtcctcaccagggcattgcct agtggagctgtaggaatggggccactgccctctaggcccagaatgggagcgccgccagca gcctgtactctgagcctggaaatgctgcaggcattcactccagcccttgcaaacagccat gtgggttgtgcccagcaaagccatggggacagggctaccagaggcctcaggaccacattt ctcacaccagaatgctga >gi568815589r:72801211_73053000|GENSCAN_predicted_peptide_7|217_aa MLTLSNKIQNRERFITGSRVPHGDVHFERSDSKLVLSSFHFPMGAGDLASGEMEITILKP SPFSCLLDKQAKPYSNLENSHKKIKILDQVERILIPVKSQCLSSSHLKVNNVGLDKEKCQ ALKTSVFLTGTQCFRADGPAFQALCYMIPCAHSLRLLIKNSPLSLKFTDSSWNYQTVQHI YSAMNYITEISITMPPYSVDVINRNITRHLLPLEEIA >gi568815589r:72801211_73053000|GENSCAN_predicted_CDS_7|654_bp atgctgactctctcaaacaaaatccagaaccgcgaaagatttatcactggaagcagagtt ccccacggggacgtccattttgaacgaagcgactccaaacttgttctctcgagtttccac tttcccatgggtgctggtgatctggcctctggagaaatggagatcaccatcttaaaaccc tcgcccttcagttgcctcctggacaagcaggcaaagccatactccaatttggaaaattct cacaaaaaaattaaaatactggatcaagtggaaaggatcctgattccagtcaagagtcag tgtctcagttccagccatctgaaagtcaacaatgtaggcctggacaaagagaagtgccag gctctgaagacctctgtgttcctaactggaacacagtgctttagagcagatgggcctgcc tttcaagctctgtgctacatgattccatgtgcacattctctcaggctgctcattaagaat tctcctttgtcactgaaattcactgactcctcatggaattaccagacggttcaacatatt tatagtgccatgaattacattactgaaataagtattacaatgccaccctattcggtggac gtcataaacagaaacataacaagacatctgcttcctctggaggaaattgcatga >gi568815589r:72801211_73053000|GENSCAN_predicted_peptide_8|408_aa MQTLTPQLSGPQIRHSEGDKTYPTPSYEGPHIEMRGSPPYAETRNPCPYSEDGSPWLEST NWEVGVPSGSPYQTEEQEEDEKAFQHYVKQKWQSGHSCLLSDLGGKIFSLPPRKERESKQ IVQDESDPIAKDDTGKAEAFAPNAFPLTPLTVKSPSFPDPILATKASLPLFPAPSALDQA PNSFPHRSKEGVFSPGLSLQAKFCPCLLLLLWKRLFVFLRTHTALPKSFSVFWFLHQIPH TDPQDFWAQQPTLREPHIPCPLPEFILKTPGTECASVLSKNVAEKVGKASEPENLLQLGD HPYLTLMEMRKKYGDVFLLKLGMVPVLVVNGMEMVKQIINAPLEFYQALNGFIALHVQDH LATYGKDHIRDITDALINVCHNKYAATKTDTLNDSEIISTVSDLFGAX >gi568815589r:72801211_73053000|GENSCAN_predicted_CDS_8|1224_bp atgcaaacacttacacctcaactcagtggtccccaaatcagacacagtgaaggagataaa acgtatcctactccatcctatgaaggtccacacattgaaatgagaggatcccctccatat gcagagactagaaacccatgcccttattcagaggatggaagcccatggctagaatcaact aactgggaagtgggtgtcccatctggatcaccatatcagactgaagagcaagaagaggat gagaaagccttccagcactatgttaaacagaagtggcaaagtgggcattcttgtcttctt tctgatcttggaggaaaaatcttcagtcttccaccaaggaaggagagagaatcaaagcag atagtacaagatgaaagtgacccaatagcaaaagatgacactggaaaggcagaagccttt gctccaaatgccttccctctgactccactcacagtgaaatctccgagcttcccagacccg atcctggccacaaaagcctctctacccctctttcctgctcccagtgcccttgatcaggct ccaaactccttcccacacaggtcaaaggaaggggtgttctcaccaggcctttccctccag gcaaaattctgcccctgcctgctgctgctgctatggaagagactgtttgtgtttcttagg acacacacagctttgccaaaatccttctctgtcttttggttcctgcaccagattcctcac actgacccacaagatttctgggcacagcagcccacccttagggagcctcacatcccctgt cctctgcctgagtttatcctcaaaacaccaggaacagaatgtgcttcagtgctttccaag aatgttgctgagaaagttggcaaagcatctgagcctgagaatcttctccagcttggagat catccttaccttacattaatggagatgaggaagaaatatggagatgtctttctcctcaaa cttggcatggtgcctgtcttggtggtaaatggaatggaaatggtgaaacaaattataaat gctcctctggagttttaccaggccctgaatgggtttattgcactacacgtacaagatcat cttgctacctatggtaaggatcatatccgagacattactgatgctctgattaatgtatgc cacaacaaatatgctgctaccaaaacagacaccttgaatgacagtgaaatcataagcacc gtgagcgacctctttggagctgnn