GENSCAN 1.0 Date run: 24-Oct-119 Time: 21:41:54 Sequence gi568815590r:23037873_23263935 : 226063 bp : 45.12% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.08 PlyA - 1063 1058 6 1.05 1.07 Term - 1182 1169 14 1 2 112 33 23 0.004 -2.44 1.06 Intr - 5371 5266 106 1 1 127 84 -17 0.011 1.59 1.05 Intr - 12175 11967 209 2 2 80 68 82 0.023 4.20 1.04 Intr - 31070 30879 192 0 0 53 97 92 0.330 6.06 1.03 Intr - 32996 32862 135 0 0 74 66 60 0.455 2.94 1.02 Intr - 34553 34247 307 2 1 82 97 158 0.933 12.12 1.01 Init - 34663 34643 21 1 0 82 100 5 0.903 0.81 1.00 Prom - 40474 40435 40 -3.86 2.00 Prom + 43753 43792 40 -7.46 2.01 Init + 44293 44403 111 0 0 59 42 89 0.224 -0.35 2.02 Intr + 46564 46673 110 0 2 60 109 92 0.356 7.58 2.03 Intr + 46842 46944 103 0 1 50 72 69 0.758 1.68 2.04 Term + 53416 53484 69 0 0 138 47 55 0.933 4.34 2.05 PlyA + 54226 54231 6 1.05 3.02 PlyA - 56178 56173 6 1.05 3.01 Sngl - 58092 57943 150 0 0 98 42 148 0.855 5.37 3.00 Prom - 58899 58860 40 -6.16 4.00 Prom + 65013 65052 40 -3.46 4.01 Init + 65130 65309 180 2 0 62 81 190 0.610 14.91 4.02 Intr + 73848 73953 106 2 1 134 55 55 0.980 6.59 4.03 Intr + 76785 76898 114 1 0 123 92 23 0.731 6.62 4.04 Intr + 77636 77744 109 0 1 37 94 103 0.854 5.14 4.05 Intr + 78769 79114 346 1 1 115 19 303 0.120 21.50 4.06 Intr + 85093 85198 106 0 1 42 57 113 0.691 3.39 4.07 Term + 90421 90569 149 2 2 82 40 140 0.818 6.66 4.08 PlyA + 93265 93270 6 1.05 5.10 PlyA - 97742 97737 6 1.05 5.09 Term - 100131 99998 134 1 2 86 48 147 0.998 8.75 5.08 Intr - 100388 100316 73 2 1 93 98 32 0.996 3.68 5.07 Intr - 106763 106578 186 2 0 81 105 156 0.993 16.49 5.06 Intr - 107217 107186 32 1 2 112 70 14 0.937 -0.15 5.05 Intr - 108049 107796 254 0 2 88 96 130 0.983 10.88 5.04 Intr - 109200 109089 112 1 1 60 90 93 0.995 6.14 5.03 Intr - 110679 110566 114 1 0 135 92 -18 0.771 3.72 5.02 Intr - 117107 117002 106 2 1 127 55 98 0.813 10.19 5.01 Init - 126063 125914 150 0 0 24 117 144 0.250 9.31 5.00 Prom - 126194 126155 40 0.24 6.02 PlyA - 132630 132625 6 1.05 6.01 Sngl - 136187 135864 324 2 0 65 43 167 0.807 5.90 6.00 Prom - 143032 142993 40 -0.26 7.13 PlyA - 145277 145272 6 1.05 7.12 Term - 154141 153822 320 2 2 80 48 394 0.974 29.64 7.11 Intr - 159332 159260 73 2 1 91 89 26 0.942 1.98 7.10 Intr - 161576 161394 183 2 0 82 105 128 0.997 13.88 7.09 Intr - 162045 162014 32 1 2 119 96 31 0.999 4.85 7.08 Intr - 162728 162633 96 0 0 105 96 12 0.928 3.68 7.07 Intr - 162888 162815 74 2 2 84 87 88 0.592 7.35 7.06 Intr - 164047 163936 112 2 1 35 76 101 0.514 3.04 7.05 Intr - 164889 164776 114 1 0 106 92 10 0.379 3.62 7.04 Intr - 168921 168900 22 0 1 92 77 16 0.117 -1.98 7.03 Intr - 169414 169179 236 2 2 75 98 136 0.111 10.71 7.02 Intr - 174340 174244 97 1 1 114 88 5 0.037 2.68 7.01 Init - 187015 186884 132 1 0 60 109 188 0.932 16.30 7.00 Prom - 189245 189206 40 -3.96 8.00 Prom + 207192 207231 40 -1.16 8.01 Init + 208824 209122 299 2 2 28 77 495 0.995 37.09 8.02 Intr + 211338 211509 172 0 1 104 98 173 0.992 19.85 8.03 Intr + 217375 217560 186 0 0 99 86 219 0.998 22.69 8.04 Intr + 218588 218721 134 1 2 76 78 118 0.999 9.04 8.05 Intr + 220161 220209 49 0 1 124 105 34 0.997 7.38 8.06 Intr + 220458 220577 120 2 0 82 21 195 0.965 12.89 8.07 Intr + 220860 220958 99 2 0 89 77 106 0.935 9.91 8.08 Intr + 221194 221254 61 0 1 89 95 52 0.972 4.31 8.09 Intr + 222272 222451 180 0 0 104 64 210 0.409 20.04 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 78769 79159 391 1 1 115 48 322 0.879 25.36 S.002 Term - 180110 179949 162 2 0 94 49 81 0.836 2.74 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590r:23037873_23263935|GENSCAN_predicted_peptide_1|327_aa MAAIIQKVVRKIHYMKMCSLWLGAGHQLPNSTGSGHGTVSTVPIPNLEARASLPCHFWMQ DREPELGLWSLSRGCEQPSVATRIETFPAHSSNWGTGWVPNRKTIGRVAAGASPTSTSPC SRAPSPIDHPRAEECGRTAREHGAGLAGSSTCNPGLKPTGLRDYKSVPYRHGTTGTERPG RFGGPEKARPRTQGGAGSQAWAPGPQDPCARCRRGPAVVEKGTCRFAHKQPSGGSPGETS PDSCWDPPEDCLIASYQPLNAALSLSSGWRLFCSQTKLRVGLPNNEMQVSAESALITQQD LAPQQRAAPQQKRSSPSEGLCPPDRTI >gi568815590r:23037873_23263935|GENSCAN_predicted_CDS_1|984_bp atggctgctattatacagaaggttgttaggaaaatccactacatgaagatgtgcagtctc tggctgggagcaggacaccagctgcccaacagcacaggctcagggcatggcactgtctcc actgtccccatccccaacctggaagctagagcctcgctgccctgccacttctggatgcag gacagagaaccggaactgggactgtggtctctcagtagaggctgtgaacagccatctgtg gccacacggattgagacctttcctgctcactcttcaaactggggcaccggctgggtgccc aacagaaagacaattggaagagtggcagccggagcctccccgacgagcacctccccctgc tccagggcgcccagtcccatcgaccacccaagggctgaggagtgcgggcgcacggcgcgg gagcacggcgcgggactggcaggcagctccacctgcaaccccgggctgaaacccacgggc ctgagagactataagagcgttccctaccgccatggaacaacggggacagaacgccccggc cgcttcgggggcccggaaaaggcacggcccaggacccagggaggcgcggggagccaggcc tgggccccgggtccccaagacccttgtgctcgttgtcgccgcggtcctgctgttgtagag aaagggacgtgtagatttgcacataaacagccctctggagggtcccctggggaaacctca cctgacagctgctgggatcctcctgaagactgtctcatcgcaagctaccagccgctgaac gcggcactgtccttatcttctggctggcgacttttttgttcccagaccaaactgagggtg gggctgcccaataatgagatgcaggtctcagctgagtctgctctgatcacccaacaagac ctagctccccagcagagagcggccccacaacaaaagaggtccagcccctcagagggattg tgtccacctgataggaccatctag >gi568815590r:23037873_23263935|GENSCAN_predicted_peptide_2|130_aa MGGEILPLWEALTTFQFSLLRPETGPGFDDWQLLKGPRETRLSSHRLLSPDDQVPGVIKE QPLSCVEKAMETARRCQKTRSPDNLSSERAENPPGEYGGTLQREKAKWVSCHSVTNLILL STAINSTFNT >gi568815590r:23037873_23263935|GENSCAN_predicted_CDS_2|393_bp atgggaggagaaatcctccctctgtgggaggcactgactacattccagttctccctccta cgcccagagactggaccagggtttgatgactggcaacttctcaaggggccgcgtgagact cgcttgtcctcccaccgcctgctctctcctgatgaccaggttccaggagttatcaaagaa cagcctctgagctgcgtggagaaagccatggagacagccaggcgctgccagaagacaagg tcaccagataatctgtcctcagagagggctgagaacccacccggggagtatggaggaacc ctacagagggagaaagcaaaatgggtctcttgccactcagttactaacctgattcttctc tccacggccatcaactcgacttttaatacatga >gi568815590r:23037873_23263935|GENSCAN_predicted_peptide_3|49_aa MAEGDGEAGTFFTRRQEKEKRVKEELPNPYKTIRSRENYHENSMRKPPP >gi568815590r:23037873_23263935|GENSCAN_predicted_CDS_3|150_bp atggcggaaggagatggggaagctggcaccttcttcacaaggcggcaggagaaagagaag agagtgaaggaggaactgccaaacccttataaaaccatcagaagtcgcgagaactatcat gagaacagcatgaggaaaccacccccatga >gi568815590r:23037873_23263935|GENSCAN_predicted_peptide_4|369_aa MQGVKERFLPLGNSGDRAPRPPDGRGRVRPRTQDGVGNHTMARIPKTLKFVVVIVAVLLP VLAYSATTARQEEVPQQTVAPQQQRHSFKGEECPAGSHRSEHTGACNPCTEGVDYTNASN NEPSCFPCTVCKSDQKHKSSCTMTRDTVCQCKEGTFRNENSPEMCRKCSRCPSGEVQVSN CTSWDDIQCVEEFGANATVETPAAEETMNTSPGTPAPAAEETMNTSPGTPAPAAEETMTT SPGTPAPAAEETMTTSPGTPAPAAEETMITSPGTPASSHYLSCTIPEDLLSDICVKQYSS ILTVKGNCQWTEPRTPCGTVAQRLCFYNFGQILPFNSWNPLMRQPSLMDNEILMARADAA IPGNALYGC >gi568815590r:23037873_23263935|GENSCAN_predicted_CDS_4|1110_bp atgcaaggggtgaaggagcgcttcctaccgttagggaactctggggacagagcgccccgg ccgcctgatggccgaggcagggtgcgacccaggacccaggacggcgtcgggaaccatacc atggcccggatccccaagaccctaaagttcgtcgtcgtcatcgtcgcggtcctgctgcca gtcctagcttactctgccaccactgcccggcaggaggaagttccccagcagacagtggcc ccacagcaacagaggcacagcttcaagggggaggagtgtccagcaggatctcatagatca gaacatactggagcctgtaacccgtgcacagagggtgtggattacaccaacgcttccaac aatgaaccttcttgcttcccatgtacagtttgtaaatcagatcaaaaacataaaagttcc tgcaccatgaccagagacacagtgtgtcagtgtaaagaaggcaccttccggaatgaaaac tccccagagatgtgccggaagtgtagcaggtgccctagtggggaagtccaagtcagtaat tgtacgtcctgggatgatatccagtgtgttgaagaatttggtgccaatgccactgtggaa accccagctgctgaagagacaatgaacaccagcccggggactcctgccccagctgctgaa gagacaatgaacaccagcccggggactcctgccccagctgctgaagagacaatgaccacc agcccggggactcctgccccagctgctgaagagacaatgaccaccagcccggggactcct gccccagctgctgaagagacaatgatcaccagcccggggactcctgcctcttctcattac ctctcatgcaccatccctgaggacctgctgtcagacatctgtgtaaagcagtacagctcc atcctcacagtcaaagggaactgtcaatggacagagcccagaaccccctgcggcactgtg gctcagaggctgtgcttctacaactttggacagatcctaccctttaactcctggaacccg ctcatgaggcagccgagcctcatggacaatgagatcctcatggccagagctgatgcagcg atccctgggaatgccttgtacggatgctag >gi568815590r:23037873_23263935|GENSCAN_predicted_peptide_5|386_aa MGLWGQSVPTASSARAGRYPGARTASGTRPWLLDPKILKFVVFIVAVLLPVRVDSATIPR QDEVPQQTVAPQQQRRSLKEEECPAGSHRSEYTGACNPCTEGVDYTIASNNLPSCLLCTV CKSGQTNKSSCTTTRDTVCQCEKGSFQDKNSPEMCRTCRTGCPRGMVKVSNCTPRSDIKC KNESAASSTGKTPAAEETVTTILGMLASPYHYLIIIVVLVIILAVVVVGFSCRKKFISYL KGICSGGGGGPERVHRVLFRRRSCPSRVPGAEDNARNETLSNRYLQPTQVSEQEIQGQEL AELTGVTVELPEEPQRLLEQAEAEGCQRRRLLVPVNDADSADISTLLDASATLEEGHAKE TIQDQLVGSEKLFYEEDEAGSATSCL >gi568815590r:23037873_23263935|GENSCAN_predicted_CDS_5|1161_bp atgggactttggggacaaagcgtcccgaccgcctcgagcgctcgagcagggcgctatcca ggagccaggacagcgtcgggaaccagaccatggctcctggaccccaagatccttaagttc gtcgtcttcatcgtcgcggttctgctgccggtccgggttgactctgccaccatcccccgg caggacgaagttccccagcagacagtggccccacagcaacagaggcgcagcctcaaggag gaggagtgtccagcaggatctcatagatcagaatatactggagcctgtaacccgtgcaca gagggtgtggattacaccattgcttccaacaatttgccttcttgcctgctatgtacagtt tgtaaatcaggtcaaacaaataaaagttcctgtaccacgaccagagacaccgtgtgtcag tgtgaaaaaggaagcttccaggataaaaactcccctgagatgtgccggacgtgtagaaca gggtgtcccagagggatggtcaaggtcagtaattgtacgccccggagtgacatcaagtgc aaaaatgaatcagctgccagttccactgggaaaaccccagcagcggaggagacagtgacc accatcctggggatgcttgcctctccctatcactaccttatcatcatagtggttttagtc atcattttagctgtggttgtggttggcttttcatgtcggaagaaattcatttcttacctc aaaggcatctgctcaggtggtggaggaggtcccgaacgtgtgcacagagtccttttccgg cggcgttcatgtccttcacgagttcctggggcggaggacaatgcccgcaacgagaccctg agtaacagatacttgcagcccacccaggtctctgagcaggaaatccaaggtcaggagctg gcagagctaacaggtgtgactgtagagttgccagaggagccacagcgtctgctggaacag gcagaagctgaagggtgtcagaggaggaggctgctggttccagtgaatgacgctgactcc gctgacatcagcaccttgctggatgcctcggcaacactggaagaaggacatgcaaaggaa acaattcaggaccaactggtgggctccgaaaagctcttttatgaagaagatgaggcaggc tctgctacgtcctgcctgtga >gi568815590r:23037873_23263935|GENSCAN_predicted_peptide_6|107_aa MRDLAPRASSCGGCTGSPSSASPLALCSISHRALAAFLRRRARDLQPAMPEASHPLHGFL CGPSLPDEHLPLLHGAPIPWTTQGLRSVSTRHGTGGQLHLQPQCGIH >gi568815590r:23037873_23263935|GENSCAN_predicted_CDS_6|324_bp atgagggacttagccccccgggccagcagctgcggagggtgtactgggtcccccagcagt gccagcccactggccctgtgctcgatttctcaccgagccttagctgccttcctgcggcgc agggctcgggacctgcagcccgccatgcctgaggcctcccacccactccatgggttcctg tgcggcccaagcctccccgacgagcacctccccctgctccatggcgccccaatcccatgg accacccaaggtctgaggagtgtgagcacacggcacgggactggcgggcagctccacctg cagccccagtgcgggatccactag >gi568815590r:23037873_23263935|GENSCAN_predicted_peptide_7|496_aa MGQHGPSARARAGRAPGPRPAREASPRLRVHKTFKFVVVGVLLQVVPSSAATIKLHDQSI GTQQWEHSPLGELCPPGLIRVLNLGHINIIELLHNLFDLVLVGFNIYNEHKCVVVVFYLL HGILSGQWKLDDSIVVKLVSPGGALQSLSILGHRKEFAPFIQGSHRSEHPGACNRCTEGV GYTNASNNLFACLPCTACKSDEEERSPCTTTRNTACQCKPGTFRNDNSAEMCRKCSRGCP RGMVKVKDCTPWSDIECVHKESGNGHNIWVILVVTLVVPLLLVAVLIVCCCIGSGCGGDP KCMDRVCFWRLGLLRGPGAEDNAHNEILSNADSLSTFVSEQQMESQEPADLTGVTVQSPG EAQCLLGPAEAEGSQRRRLLVPANGADPTETLMLFFDKFANIVPFDSWDQLMRQLDLTKN EIDVVRAGTAGPGDALYAMLMKWVNKTGRNASIHTLLDALERMEERHAREKIQDLLVDSG KFIYLEDGTGSAVSLE >gi568815590r:23037873_23263935|GENSCAN_predicted_CDS_7|1491_bp atgggacagcacggacccagtgcccgggcccgggcagggcgcgccccaggacccaggccg gcgcgggaagccagccctcggctccgggtccacaagaccttcaagtttgtcgtcgtcggg gtcctgctgcaggtcgtacctagctcagctgcaaccatcaaacttcatgatcaatcaatt ggcacacagcaatgggaacatagccctttgggagagttgtgtccaccaggcctaatcagg gtgttgaaccttggccacataaacatcatagagcttcttcacaacctgtttgatctggtg cttgttggttttaacatctacaatgaacacaagtgtgttgttgtcgtcttctatcttctt catggcatactcagtggtcagtggaagcttgatgatagcatagtggtcaagcttgtttct cctgggggtgctctccagagtctcagtatcttgggccaccggaaggaatttgcccctttc atccaaggatctcatagatcagaacatcctggagcctgtaaccggtgcacagagggtgtg ggttacaccaatgcttccaacaatttgtttgcttgcctcccatgtacagcttgtaaatca gatgaagaagagagaagtccctgcaccacgaccaggaacacagcatgtcagtgcaaacca ggaactttccggaatgacaattctgctgagatgtgccggaagtgcagcagagggtgcccc agagggatggtcaaggtcaaggattgtacgccctggagtgacatcgagtgtgtccacaaa gaatcaggcaatggacataatatatgggtgattttggttgtgactttggttgttccgttg ctgttggtggctgtgctgattgtctgttgttgcatcggctcaggttgtggaggggacccc aagtgcatggacagggtgtgtttctggcgcttgggtctcctacgagggcctggggctgag gacaatgctcacaacgagattctgagcaacgcagactcgctgtccactttcgtctctgag cagcaaatggaaagccaggagccggcagatttgacaggtgtcactgtacagtccccaggg gaggcacagtgtctgctgggaccggcagaagctgaagggtctcagaggaggaggctgctg gttccagcaaatggtgctgaccccactgagactctgatgctgttctttgacaagtttgca aacatcgtgccctttgactcctgggaccagctcatgaggcagctggacctcacgaaaaat gagatcgatgtggtcagagctggtacagcaggcccaggggatgccttgtatgcaatgctg atgaaatgggtcaacaaaactggacggaacgcctcgatccacaccctgctggatgccttg gagaggatggaagagagacatgcaagagagaagattcaggacctcttggtggactctgga aagttcatctacttagaagatggcacaggctctgccgtgtccttggagtga >gi568815590r:23037873_23263935|GENSCAN_predicted_peptide_8|434_aa MWSPEREAEAPAGGDPAGLLPPEWEEDEERMSFLFSAFKRSREVNSTDWDSKMGFWAPLV LSHSRRQGVVRLRLRDLQEAFQRKGSVPLGLATVLQDLLRRGELQRESDFMASVDSSWIS WGVGVFLLKPLKWTLSNMLGDNKVPAEEVLVAVELLKEKAEEVYRLYQNSPLSSHPVVAL SELSTLCANSCPDERTFYLVLLQLQKEKRVTVLEQNGEKIVKFARGPRAKVSPVNDVDVG VYQLMQSEQLLSRKVESLSQEAERCKEEARRACRAGKKQLALRSLKAKQRTEKRIEALHA KLDTVQGILDRIYASQTDQMVFNAYQAGVGALKLSMKDVTVEKAESLVDQIQELCDTQDE VSQTLAGGVTNGLDFDSEELEKELDILLQDTTKEPLDLPDNPRNRHFTNSVPNPRISDAE LEAELEKLSLSEGX >gi568815590r:23037873_23263935|GENSCAN_predicted_CDS_8|1302_bp atgtggtccccggagcgggaggccgaggccccagccgggggagacccggcgggccttctg ccccccgagtgggaggaggacgaggagcgcatgtccttcctgttctccgctttcaagagg agtcgcgaggtgaacagcaccgactgggacagcaagatgggcttctgggcgccgttggtg ctgagccacagccgccgccagggggtggtgcgcctgcgtctgcgggacttgcaggaggcc tttcagcgcaaggggagcgtcccgctggggctggccacggtgctgcaggacctgctgcgt cgaggggagctgcagcgggagtcagacttcatggccagtgtagacagcagctggatctcc tggggggttggggtcttcctgctgaagcctctcaagtggactctttctaacatgctggga gataataaggttccagctgaggaggtccttgtcgctgtggagctgttgaaggaaaaggct gaggaggtgtatcgtctgtatcagaactcgcccctctcctcccaccccgtggtggccctg tcagagctcagcaccctctgtgctaactcctgcccagatgagaggaccttctacttggtg ttgctgcagctgcagaaggagaagagggtcacagtcctcgagcagaacggggagaagatt gtgaagtttgcccgagggccacgtgccaaggtctctccagtcaatgacgtagatgttggg gtgtaccagctgatgcagagtgaacagcttctctcacgcaaagtggagtccttatcccag gaagcagagaggtgtaaagaagaagcccgccgggcatgccgagcaggaaagaagcagctg gcactgaggtctctcaaggccaagcaacggacagagaagcgcatcgaggccttgcatgcc aagctggacactgttcaaggcatcctggaccggatctatgcctcccagacagatcagatg gtttttaacgcctaccaggctggggtaggagcactcaaactctccatgaaggatgtcaca gtggagaaggcagagagcctcgtggatcagatccaagagctctgtgacacccaggatgaa gtttctcagactctggctggtggggtaacaaatggcttagattttgacagtgaagaactg gagaaggaattggacatcctccttcaggataccaccaaagaacctttggatctgcctgac aacccccgcaataggcattttaccaacagcgtgcctaaccctaggatctcagatgctgaa cttgaagctgaacttgagaaactgtccttatcagagggagnn