GENSCAN 1.0 Date run: 5-Nov-116 Time: 05:41:38 Sequence gi568815590r:133138881_133384311 : 245431 bp : 47.04% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 5343 5545 203 2 2 53 113 87 0.426 6.05 1.02 Intr + 12183 12290 108 0 0 102 101 41 0.710 6.20 1.03 Intr + 16308 16369 62 0 2 129 69 -17 0.039 -1.12 1.04 Intr + 35434 35502 69 2 0 108 83 18 0.377 2.45 1.05 Intr + 45534 45701 168 1 0 74 109 47 0.606 5.32 1.06 Intr + 52118 52333 216 0 0 21 113 127 0.031 6.98 1.07 Intr + 58184 58244 61 0 1 34 66 88 0.315 -1.01 1.08 Intr + 61378 61491 114 1 0 88 43 87 0.807 3.76 1.09 Intr + 64654 64740 87 1 0 71 99 65 0.860 4.99 1.10 Intr + 66592 66687 96 1 0 79 86 22 0.354 0.22 1.11 Intr + 71274 71335 62 0 2 92 94 1 0.552 -0.52 1.12 Intr + 72975 73045 71 1 2 120 62 34 0.609 2.90 1.13 Intr + 73984 74263 280 0 1 145 113 139 0.919 19.15 1.14 Intr + 81701 81961 261 0 0 79 72 281 0.498 23.06 1.15 Intr + 86510 86703 194 0 2 48 98 126 0.487 8.81 1.16 Term + 88531 88830 300 0 0 115 48 360 0.999 29.92 1.17 PlyA + 89498 89503 6 1.05 2.19 PlyA - 90116 90111 6 1.05 2.18 Term - 100239 99998 242 1 2 127 48 325 0.909 28.59 2.17 Intr - 103194 103143 52 0 1 117 79 81 0.848 8.58 2.16 Intr - 105510 105475 36 0 0 93 109 37 0.949 4.76 2.15 Intr - 106111 105985 127 0 1 104 46 21 0.437 0.18 2.14 Intr - 109891 109835 57 0 0 94 73 102 0.967 7.30 2.13 Intr - 111663 111560 104 0 2 74 72 208 0.982 16.77 2.12 Intr - 115715 115659 57 2 0 109 113 19 0.977 5.58 2.11 Intr - 117983 117897 87 2 0 75 100 75 0.970 7.57 2.10 Intr - 119546 119486 61 1 1 93 68 57 0.985 2.94 2.09 Intr - 120350 120288 63 1 0 117 86 48 0.961 5.33 2.08 Intr - 123287 123167 121 0 1 94 76 198 0.998 18.75 2.07 Intr - 125772 125667 106 0 1 112 103 134 0.962 17.09 2.06 Intr - 128242 128198 45 1 0 83 89 49 0.721 3.11 2.05 Intr - 145449 145369 81 0 0 121 105 80 0.991 12.83 2.04 Intr - 146016 145877 140 1 2 109 91 12 0.725 3.78 2.03 Intr - 153202 153110 93 2 0 101 64 16 0.455 0.44 2.02 Intr - 162978 162891 88 0 1 71 28 206 0.224 12.44 2.01 Init - 163116 163108 9 0 0 88 91 6 0.457 1.12 2.00 Prom - 168608 168569 40 -5.46 3.00 Prom + 168977 169016 40 -3.76 3.01 Sngl + 187279 187524 246 0 0 36 42 262 0.671 11.48 3.02 PlyA + 187551 187556 6 1.05 4.09 PlyA - 188146 188141 6 1.05 4.08 Term - 195860 195808 53 0 2 59 47 67 0.035 -2.71 4.07 Intr - 200361 200260 102 1 0 78 50 70 0.050 2.35 4.06 Intr - 200741 200709 33 0 0 98 72 27 0.034 0.29 4.05 Intr - 208321 208122 200 0 2 78 37 118 0.142 4.69 4.04 Intr - 219139 219094 46 2 1 88 85 29 0.071 0.07 4.03 Intr - 219276 219189 88 0 1 115 40 28 0.102 0.24 4.02 Intr - 238788 238676 113 1 2 127 95 4 0.323 5.10 4.01 Init - 243744 243600 145 0 1 70 113 25 0.621 3.48 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 159001 158863 139 1 1 100 85 89 0.904 10.13 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590r:133138881_133384311|GENSCAN_predicted_peptide_1|783_aa MHLDSQVHVNLARGDFRAQSMPVSKVSPGFCRRTLVIQVRDHRPASCSFQTSPETSMSPL EGNKLRPRKLGTRFSTLLQNITGAIYHLEHRLAISYTPVSLSGWIWSLLVGWPGNNSFAQ AAQGDAALSPHGQNVQIIKSINSAAVREMWSCRISERTVAFPGSPSCHAMQQGFPCIFSR GHPDAKALEEAWETSPVAKTGATDLALIKPPKGCGRGISGAPDGPASLGPAPPRGGRILW AARSMPVPLTSRHEVVPALDAGSSDSSSRQHRPGHGERNDDDDDDGGGGGGGGACRESDL SKATSFSMGTGAQGLSPPGKELRRLEDDPHYPIGDDTNNCSLRLLEDVDITIILQMMKQA QSRMMQAPYDSVFAKIIFSICRTGPGTPAQVAHGGRTERCKARALPFPSSHSSERGPGLF RDRAKGTDGIQGGNQMGQQALSPAPTTMDFTPAPLEDTSSRPQFCKWPCECPPSPPRCPL GVSLITDGCECCKMCAQQLGDNCTEAAICDPHRGLYCDYSGDRPRYAIGVCAQVVGVGCV LDGVRYNNGQSFQPNCKYNCTCIDGAVGCTPLCLRVRPPRLWCPHPRRVSIPGHCCEQWV CEDDAKRPRKTAPRDTGAFDAVGEVEAWHRNCIAYTSPWSPCSTSCGLGVSTRISNVNAQ CWPEQESRLCNLRPCDVDIHTLIKAGKKCLAVYQPEASMNFTLAGCISTRSYQPKYCGVC MDNRCCIPYKSKTIDVSFQCPDGLGFSRQVLWINACFCNLSCRNPNDIFADLESYPDFSE IAN >gi568815590r:133138881_133384311|GENSCAN_predicted_CDS_1|2352_bp atgcacctggacagccaagtacatgtaaatttggccagaggagacttcagggcccagagc atgcccgtcagcaaagtctcccctggcttctgcagaagaactctggtcatccaggtgagg gaccacaggccagccagctgttccttccagacgtctccagaaacctctatgagtcccttg gaaggtaacaaactcagaccaaggaagctgggaactcgcttctcaactctcctacagaat atcactggtgccatttaccacttagaacatcgacttgcaatcagttacacacctgtctcc ctctctggatggatttggtccttgcttgttgggtggccaggaaataacagctttgcacag gctgctcaaggtgatgcagcgttgagcccacatgggcagaatgttcagattattaagtca atcaactcagcagctgttagagagatgtggtcttgccgcatctccgagaggacagtggcc tttccaggatctcccagctgtcatgcaatgcagcaagggtttccttgtatatttagcaga ggccatcctgacgccaaagctcttgaggaggcttgggagacctcacctgtggccaagact ggggctacagatcttgctttaataaaaccccccaagggctgcggaagaggcatatctggt gctcctgatgggccggccagtctgggcccagctcccccgagaggtggtcggatcctctgg gctgctcggtcgatgcctgtgccactgacgtccaggcatgaggtggttcctgccctggac gctggcagcagtgacagcagcagccgccagcaccgtcctggccacggagaaagaaatgat gatgatgatgatgatggtggtggtggtggtggtggtggtgcctgtagagaaagtgacttg tccaaggccaccagcttcagcatggggactggagcccagggcctctcccctccagggaag gagctcaggagactggaagacgatccccactaccccatcggggatgacactaataactgc tcattaaggctgctggaggatgttgacatcaccatcattttacaaatgatgaaacaagct cagagcagaatgatgcaggctccatatgacagtgtgtttgcaaagatcattttctccatc tgcagaacaggcccaggcaccccagctcaggtagctcatggtgggcgcacagagagatgc aaggcccgggccctgccctttccaagttcacattccagtgagcgaggcccaggtctcttt cgagatagagcaaagggcacagatgggattcaggggggcaaccagatgggacagcaggcc ctctctccagcccctacgaccatggactttaccccagctccactggaggacacctcctca cgcccccaattctgcaagtggccatgtgagtgcccgccatccccaccccgctgcccgctg ggggtcagcctcatcacagatggctgtgagtgctgtaagatgtgcgctcagcagcttggg gacaactgcacggaggctgccatctgtgacccccaccggggcctctactgtgactacagc ggggaccgcccgaggtacgcaataggagtgtgtgcacaggtggtcggtgtgggctgcgtc ctggatggggtgcgctacaacaacggccagtccttccagcctaactgcaagtacaactgc acgtgcatcgacggcgcggtgggctgcacaccactgtgcctccgagtgcgccccccgcgt ctctggtgcccccacccgcggcgcgtgagcatacctggccactgctgtgagcagtgggta tgtgaggacgacgccaagaggccacgcaagaccgcaccccgtgacacaggagccttcgat gctgtgggtgaggtggaggcatggcacaggaactgcatagcctacacaagcccctggagc ccttgctccaccagctgcggcctgggggtctccactcggatctccaatgttaacgcccag tgctggcctgagcaagagagccgcctctgcaacttgcggccatgcgatgtggacatccat acactcattaaggcagggaagaagtgtctggctgtgtaccagccagaggcatccatgaac ttcacacttgcgggctgcatcagcacacgctcctatcaacccaagtactgtggagtttgc atggacaataggtgctgcatcccctacaagtctaagactatcgacgtgtccttccagtgt cctgatgggcttggcttctcccgccaggtcctatggattaatgcctgcttctgtaacctg agctgtaggaatcccaatgacatctttgctgacttggaatcctaccctgacttctcagaa attgccaactag >gi568815590r:133138881_133384311|GENSCAN_predicted_peptide_2|522_aa MKQVCKMPGGMRQDDDDDDDDDDDGDDDSEDEDGRLLSAKQITRAAKTPLRISRKISVVL QKRAGSRESCTVGCVCVFTRMRVPVCLCVCITNLLLGLTVRLKLALCLYVAGDSRDMSRE MQDVDLAEVKPLVEKGEKHDGGLGLAGFGVLQEQDIETLHGSVHVTLCGTPKGNRPVILT YHDIGMNHKTCYNPLFNYEDMQEITQHFAVCHVDAPGQQDGAASFPAGYMYPSMDQLAEM LPGVLQQFGLKSIIGMGTGAGAYILTRFALNNPEMVEGLVLINVNPCAEGWMDWAASKIS GWTQALPDMVVSHLFGKEEMQSNVEVVHTYRQHIVNDMNPGNLHLFINAYNSRRDLEIER PMPGTHTVTLHLPLLPCRQRCPFPGWLHSLTACNGLLRFGDFTGTLNVHHLGRMADCGGL PQISQPAKLAEAFKYFVQGMGYMPSASMTRLMRSRTASGSSVTSLDGTRSRSHTSEGTRS RSHTSEGTRSRSHTSEGAHLDITPNSGAAGNSAGPKSMEVSC >gi568815590r:133138881_133384311|GENSCAN_predicted_CDS_2|1569_bp atgaagcaggtgtgcaagatgccaggagggatgagacaagatgatgatgatgatgatgat gatgatgatgatggtgatgatgacagtgaggatgaggacggtcgtcttctgtctgccaaa caaattactcgtgctgccaaaactccactgagaatctccaggaagatctcggttgtattg cagaaaagagcaggcagccgagagtcctgcaccgtggggtgtgtatgtgtgttcacgcgc atgcgtgtgcctgtgtgtctgtgcgtctgcatcaccaaccttctgttagggctaactgtg aggctgaagcttgcactatgcctatacgtggcaggtgacagcagggacatgtctcgggag atgcaggatgtagacctcgctgaggtgaagcctttggtggagaaaggggagaagcacgat ggtggcctggggctggctggcttcggggttctccaagagcaggacatcgagactttacat ggctctgttcacgtcacgctgtgtgggactcccaagggaaaccggcctgtcatcctcacc taccatgacatcggcatgaaccacaaaacctgctacaaccccctcttcaactacgaggac atgcaggagatcacccagcactttgccgtctgccacgtggacgcccctggccagcaggac ggcgcagcctccttccccgcagggtacatgtacccctccatggatcagctggctgaaatg cttcctggagtccttcaacagtttgggctgaaaagcattattggcatgggaacaggagca ggcgcctacatcctaactcgatttgctctaaacaaccctgagatggtggagggccttgtc cttatcaacgtgaacccttgtgcggaaggctggatggactgggccgcctccaagatctca ggatggacccaagctctgccggacatggtggtgtcccacctttttgggaaggaagaaatg cagagtaacgtggaagtggtccacacctaccgccagcacattgtgaatgacatgaacccc ggcaacctgcacctgttcatcaatgcctacaacagccggcgcgacctggagattgagcga ccaatgccgggaacccacacagtcaccctgcacctccccctgctcccttgccgtcagcgc tgcccatttccaggctggctgcacagtctcacagcctgcaatggcctcttgcggttcggt gatttcacagggactttgaacgtccaccatttaggaaggatggcggactgtggcggcctc ccgcagatctcccagccggccaagctcgctgaggccttcaagtacttcgtgcagggcatg ggatacatgccctcggctagcatgacccgcctgatgcggtcccgcacagcctctggttcc agcgtcacttctctggatggcacccgcagccgctcccacaccagcgagggcacccgaagc cgctcccacaccagcgagggcacccgcagccgctcgcacaccagcgagggggcccacctg gacatcacccccaactcgggtgctgctgggaacagcgccgggcccaagtccatggaggtc tcctgctag >gi568815590r:133138881_133384311|GENSCAN_predicted_peptide_3|81_aa MPNIGYGSNQKTKHMLPSGFWKFLVHNVKELEVLLMCNKSYCADIAHNVASKNHKANMER AAQLAIRVTNPNARQRGKENE >gi568815590r:133138881_133384311|GENSCAN_predicted_CDS_3|246_bp atgcccaacattggttatgggagcaaccaaaaaacaaagcacatgctgcccagtggcttc tggaagttcctggtccacaatgtcaaggagctggaagtgctgctgatgtgcaataaatct tactgtgctgacatcgctcacaatgttgcctccaagaaccacaaagccaacatggaaaga gccgcccagctagccatcagagtcaccaaccctaatgccaggcagcgtggcaaagaaaat gagtag >gi568815590r:133138881_133384311|GENSCAN_predicted_peptide_4|259_aa MHSVSYVQVCGDVNTVTQAELTAHAGLCTNTFWSIIASSPLIPLITLQGFCPSLKVNGHR AMSRWKMQVEASSLSPLVCLVETCNKIMTFPKPSKSTDKNAQCMKTVKWDPATSHGDENE PWKSLHGALGRWAADKDIPQARAGSENDHVRAQLETNCKWLRRRQTILEETCFKTSGFSP PGQKYKTIKLQAGDMPPAALQAMCPDGRGPKLVGPPPLGALLLSVAAGSGERAEWTLEMP TAGHSNSISEDSVSEYEVE >gi568815590r:133138881_133384311|GENSCAN_predicted_CDS_4|780_bp atgcacagcgtgtcctatgttcaggtttgtggtgatgtcaacacagtaactcaggctgaa ctcacagcccacgctgggctctgcacaaacaccttctggagcattattgcatcatcacca ttaatccccctgataaccctacaaggtttttgtccctctcttaaagtgaatggtcacaga gccatgagcaggtggaagatgcaggtggaggcatcttccctgtcaccgttggtatgtctg gttgaaacctgcaacaagatcatgaccttcccaaagccatctaagagtacagataaaaat gcacagtgcatgaaaactgtgaaatgggatcccgccaccagccatggtgatgaaaatgag ccctggaagtctttgcatggggcgctgggcagatgggctgctgataaagacataccacag gccagagctggctctgaaaatgaccatgtccgagcccagctggaaaccaactgcaagtgg ttgaggaggcggcagacgattctagaggagacatgcttcaagacttctggcttctctcca ccaggacaaaaatacaaaacaatcaagctgcaggctggagacatgcctccagcagccttg caggctatgtgtcctgatggcagagggcccaagctggtggggccgccaccattaggagca ttattgctcagcgttgcagcaggaagtggggagcgtgcagagtggacactggagatgcca actgcaggccacagcaactcaatatcagaagattcagtatcagaatatgaagtggaatga