GENSCAN 1.0 Date run: 8-Nov-116 Time: 13:43:03 Sequence gi568815596r:229937383_230159352 : 221970 bp : 42.59% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 5025 5064 40 -3.65 1.01 Init + 5637 5812 176 2 2 53 35 116 0.227 1.67 1.02 Intr + 7708 7783 76 1 1 76 88 73 0.652 4.70 1.03 Intr + 19787 19909 123 1 0 38 24 127 0.003 1.16 1.04 Intr + 38859 38967 109 2 1 82 116 20 0.047 3.14 1.05 Intr + 59369 59541 173 0 2 100 92 102 0.993 10.54 1.06 Intr + 66632 66706 75 1 0 121 76 32 0.934 4.09 1.07 Term + 73314 73502 189 2 0 121 48 202 0.995 15.97 1.08 PlyA + 73811 73816 6 1.05 2.03 PlyA - 73838 73833 6 1.05 2.02 Term - 84047 83948 100 1 1 43 47 104 0.404 -1.58 2.01 Init - 86394 85970 425 0 2 68 75 289 0.670 21.46 2.00 Prom - 101157 101118 40 -5.75 3.08 PlyA - 101741 101736 6 1.05 3.07 Term - 102769 102587 183 1 0 80 48 70 0.315 -1.24 3.06 Intr - 106188 106046 143 1 2 79 84 24 0.428 0.25 3.05 Intr - 109340 108363 978 0 0 77 97 751 0.034 64.87 3.04 Intr - 112522 112379 144 2 0 69 107 138 0.110 13.13 3.03 Intr - 121984 121712 273 2 0 58 71 238 0.117 15.69 3.02 Intr - 131662 131475 188 0 2 4 66 133 0.478 1.11 3.01 Init - 132413 132280 134 2 2 48 32 187 0.466 8.76 3.00 Prom - 135830 135791 40 -6.45 4.00 Prom + 138045 138084 40 -7.15 4.01 Sngl + 147329 147619 291 1 0 76 34 243 0.819 13.14 4.02 PlyA + 147730 147735 6 1.05 5.00 Prom + 152590 152629 40 -1.85 5.01 Init + 155905 156220 316 0 1 80 61 127 0.082 6.74 5.02 Intr + 167611 167820 210 2 0 12 59 137 0.017 1.16 5.03 Intr + 171537 171810 274 1 1 42 95 170 0.023 8.77 5.04 Term + 174251 174359 109 2 1 52 49 147 0.607 4.20 5.05 PlyA + 175129 175134 6 1.05 6.03 PlyA - 175701 175696 6 1.05 6.02 Term - 178284 177363 922 2 1 31 44 308 0.201 11.37 6.01 Init - 179847 179624 224 0 2 88 72 172 0.197 13.68 6.00 Prom - 182079 182040 40 -6.85 7.00 Prom + 183045 183084 40 -6.15 7.01 Init + 184457 184865 409 1 1 60 61 242 0.025 15.46 7.02 Term + 187789 187934 146 2 2 76 41 135 0.225 4.69 7.03 PlyA + 188310 188315 6 1.05 8.02 PlyA - 189811 189806 6 1.05 8.01 Sngl - 194261 194061 201 2 0 55 54 266 0.449 14.83 8.00 Prom - 195231 195192 40 -5.55 9.00 Prom + 195698 195737 40 -8.15 9.01 Init + 196761 196903 143 2 2 76 42 110 0.333 4.85 9.02 Intr + 198339 198432 94 0 1 68 111 25 0.258 1.85 9.03 Intr + 202686 202728 43 2 1 121 87 41 0.428 4.19 9.04 Term + 203599 203987 389 2 2 48 48 172 0.448 3.32 9.05 PlyA + 204866 204871 6 1.05 10.00 Prom + 206427 206466 40 -3.95 10.01 Init + 210331 210441 111 0 0 80 82 76 0.278 6.46 10.02 Intr + 216972 217166 195 2 0 64 103 62 0.040 3.99 10.03 Term + 219107 219262 156 1 0 72 50 112 0.022 2.75 10.04 PlyA + 219936 219941 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 182999 183087 89 2 2 36 97 89 0.884 3.30 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:229937383_230159352|GENSCAN_predicted_peptide_1|306_aa MQRWVHSPIQANEMLKKGALPLSLNLVVVMLTRVQQSCCQHQRQPTYRCSQCGGTGANRC EAVNNIGPQIGSRVGEGRQEDSSEIGTEKGIDAGIPQQEERKPHSKEGNVTYAERLKEAQ KSNYEVIFRWWKISLRSEYRSTKPGEAKETHEDFLENSHLQGQTALIFGARILDYVINLC KGKFDFLERLSDDLLLTIISYLDLEDIARLCQTSHRFAKNEVHSRSQHSHGALALPPATL CPSQLCMSDKLWEQIVQSTCDTITPDVRALAEDTGWRQLFFTNKLQLQRQLRKRKQKYGN LREKQP >gi568815596r:229937383_230159352|GENSCAN_predicted_CDS_1|921_bp atgcagagatgggtccactccccaatccaagccaatgagatgcttaagaaaggcgcactt cctttatccctgaacttggttgtagtgatgttaaccagggtccagcagtcatgctgtcag caccagagacaacctacctacagatgcagccaatgtggagggacaggagccaatagatgt gaagcggtaaacaatattgggccacagattgggagtagagtgggggaaggaagacaggaa gacagcagtgagataggcactgaaaagggcatagatgcaggcatcccgcagcaggaagaa aggaagccacacagtaaagagggaaatgttacatatgcagagagactgaaggaggcacag aaatcaaactatgaagtaatctttagatggtggaagatctctctaaggagtgagtatcga tcaacaaaacctggagaagcaaaagaaacccatgaagacttcctagagaattcacatctt caaggtcaaactgccttaatatttggtgcaagaatattagactatgtcatcaatttgtgc aaaggtaaatttgacttccttgaacggctctcagacgatttgctcctgactatcatttct tatctggatcttgaagatattgccaggctttgtcaaacatcacacagatttgcaaagaat gaggtccacagtcgtagccagcacagccacggggcccttgccctgcctcctgccaccctc tgccccagccagctgtgcatgtctgataaactgtgggaacagatagtccagtcgacctgc gacaccatcactcctgacgtgagggccctggcggaggacacaggctggagacagctgttc ttcaccaacaagctccagctccagcggcagctccgcaagaggaaacaaaaatatggaaac ctgagagaaaagcaaccttag >gi568815596r:229937383_230159352|GENSCAN_predicted_peptide_2|174_aa MVLRINSTQRSISESPRKRCHLPTPFSPSPTTPNQGSPMRTGLGLSASMAFPSVERSASR HKADTETPPGESPLEVCPISQGCVKGGRVTELSPGQHSLDGLGFTSSPLGPTLLVSKGTA AGLSEVETGEESFKGGRKRVERLQGKSVSRVERFTNPGIQRGQVVKYSQGGAAG >gi568815596r:229937383_230159352|GENSCAN_predicted_CDS_2|525_bp atggtgctgagaataaactctactcagaggagcatatcagaatctccaaggaagaggtgt caccttcctactccttttagcccctcccctactacccccaaccaaggaagccccatgaga acaggactaggtctgtctgcttccatggcattcccaagcgtggagcgcagtgcctccagg cacaaggcagacactgaaacaccacctggagaaagtcctctggaagtgtgccccatttcc caagggtgcgtgaaaggaggaagagtcactgagctctctccagggcagcattccttggat ggccttggattcaccagcagccctttaggacctactctgctggtcagtaaaggaacagct gcaggactttcagaggtggaaacaggagaagaatccttcaaaggtggaagaaaaagagta gagaggttacagggcaagagcgtttcacgcgttgagcgtttcactaatcctggaattcag agagggcaggtagtcaagtacagtcagggaggggctgctggatga >gi568815596r:229937383_230159352|GENSCAN_predicted_peptide_3|680_aa MERRQNDAAVLALKMEEEPPAKECGRPAEAGGSPLQPREGTQPSRTFPPVINVQRLSGTS AKAINFNRPQECACEVTAFFFPAKPELAGIGMNEHEDLKLLRLEGSPEKSWKMYTSHEDI GYDFEDGPKDKKTLKPHPNIDGGWAWMMVLSSFFVHILIMGSQMALGVLNVEWLEEFHQS RGLTAWVSSLSMGITLIVGPFIGLFINTCGCRQTAIIGGLVNSLGWVLSAYAANVHYLFI TFGVAAGLGSGMAYLPAVVMVGRYFQKRRALAQGLSTTGTGFGTFLMTVLLKYLCAEYGW RNAMLIQGAVSLNLCVCGALMRPLSPGKNPNDPGEKDVRGLPAHSTESVKSTGQQGRTEE KDGGLGNEETLCDLQAQECPDQAGHRKNMCALRILKTVSWLTMRVRKGFEDWYSGYFGTA SLFTNRMFVAFIFWALFAYSSFVIPFIHLPEIVNLYNLSEQNDVFPLTSIIAIVHIFGKV ILGVIADLPCISVWNVFLLANFTLVLSIFILPLMHTYAGLAVICALIGFSSGYFSLMPVV TEDLVGIEHLANAYGIIICANGISALLGPPFAATPKHPPVWALQALYSNTFWQEYLSPVG EGIPWVQDKALPAPFFPSLSSKDEGWDGEKGILGCSEGDSIYYYPTQKGEQDMFRQGAKR NVFEMLTNSAWLVLMSGGEM >gi568815596r:229937383_230159352|GENSCAN_predicted_CDS_3|2043_bp atggaaagaagacaaaatgacgcagcagtgctggctttgaagatggaggaagagccacca gccaaggaatgtgggcggcctgcagaagctggtggctctcccctacagccacgagaagga acacagccttcaaggacctttccgccagtgataaatgtgcagcgccttagcggaacctca gccaaagctataaacttcaaccggccacaagagtgtgcatgtgaggtgactgcatttttt ttccctgccaaaccagaattagccggtataggaatgaacgagcatgaagatttgaaattg ctccgattggaaggaagcccagaaaaatcttggaaaatgtataccagtcatgaagatatt gggtatgattttgaagatggccccaaagacaaaaagacactgaagccccacccaaacatt gatggcggatgggcttggatgatggtgctctcctctttctttgtgcacatcctcatcatg ggctcccagatggccctgggtgtcctcaacgtggaatggctggaagaattccaccagagc cgcggcctgaccgcctgggtcagctccctcagcatgggcatcaccttgatagtgggccct ttcatcggcttgttcattaacacctgtgggtgccgccagactgcgatcattggagggctc gtcaactccctgggctgggtgttgagtgcctatgctgcaaacgtgcattatctcttcatt acttttggagtcgcagctggcctgggcagcgggatggcctacctgccagcggtggtcatg gtgggcaggtatttccagaagagacgcgccctcgcccagggcctcagcaccacggggacc ggattcggtacgttcctaatgactgtgctgctgaagtacctgtgcgcagagtacggctgg aggaatgccatgttgatccaaggtgccgtttccctaaacctgtgtgtttgtggggcgctc atgaggcccctctctcctggtaaaaacccaaacgacccaggagagaaagatgtgcgtggc ctgccagcgcactccacagaatctgtgaagtcaactggacagcagggaagaacagaagag aaggatggtgggctcgggaacgaggagaccctctgcgacctgcaagcccaggagtgcccc gatcaggccgggcacaggaagaacatgtgtgccctccggattctgaagactgtcagctgg ctcaccatgagagtcaggaagggcttcgaggactggtattcgggctactttgggacagcc tctctatttacaaatcgaatgtttgtagcctttattttctgggctttgtttgcatacagc agctttgtcatccccttcattcacctcccagaaatcgtcaatttgtataacttatcggag caaaacgacgttttccctctgacgtcaattatagcaatagttcacatctttggaaaagtg atcctgggcgtcatagccgacttgccttgcattagtgtttggaatgtcttcctgttggcc aacttcacccttgtcctcagtatttttattctgccgttgatgcacacgtacgctggcctg gcggtcatctgtgcgctgatagggttttccagtggttatttctccctaatgcccgtagtg actgaagacttggttggcattgaacacctggccaatgcctacggcatcatcatctgtgct aatggcatctctgcattgctgggaccaccttttgcagccacccccaagcaccctcctgtc tgggccctccaggccctttattctaataccttctggcaggaatatctctccccagtgggt gaaggaatcccatgggtacaggacaaagccctccctgcccccttttttcccagtctttct agtaaagatgaaggatgggatggggaaaaggggatcctagggtgctctgagggtgacagt atttattactatcccacccagaagggggaacaggacatgttcagacaaggagccaagaga aatgtatttgagatgctaacaaattcagcatggcttgtgctgatgagcgggggagagatg tag >gi568815596r:229937383_230159352|GENSCAN_predicted_peptide_4|96_aa MDALHPTKAQMLACPQSLISCGTEKQEQRRQKVNTMMERPLDRFLGVWWMTEATGAGQLH GSVCRVTLQGRGCTTWEEPPNIHGIWEQKSQVSDHG >gi568815596r:229937383_230159352|GENSCAN_predicted_CDS_4|291_bp atggacgcactacaccccacaaaagcccagatgcttgcatgtccccagtctctcatttcg tgtggcacagagaaacaggaacagagaaggcaaaaggtgaataccatgatggagaggcca ctggacagatttcttggagtctggtggatgactgaggctacgggagctggacagctgcat ggcagtgtctgccgggtgacgttgcagggaagaggctgcacaacttgggaagagcctcct aacatccatggaatttgggaacaaaaatctcaagtttctgaccatggataa >gi568815596r:229937383_230159352|GENSCAN_predicted_peptide_5|302_aa MDLNVKSKTIKTLDDNLGNTILDIGTGKDFMMEMLKVISIKAKIDEWDLIKLKRFCTAKE TINRVNRQPTEWEKIFANYAFDKGLISSIYEELKFTRKKQTTPLKSLRDPGKPLMLVQES KSRRTWSLIFEDRKHPAWGKDEGLKTQQVKSFPHSSAWFILATLAADYTVTTQIEGPEML SKSLGLDSETPRAFLLLDPTVAKLVPKVQDKLPFTFPPAFLKQEEIFAVVTTAGNVLGHT CSQHVSEAKAHSVTGDYPVPKGSLFSRKLLQPFQPSAAAILISQQPSTSRQEPPSAKKVM TH >gi568815596r:229937383_230159352|GENSCAN_predicted_CDS_5|909_bp atggatttaaatgtaaaatccaaaactataaaaaccctggacgacaacctaggcaatacc atcctggacataggaacaggcaaagatttcatgatggagatgctcaaagtaatcagtata aaagcaaaaattgatgaatgggacctaattaaacttaagagattctgcacagcaaaagaa actatcaacagagtaaacaggcaacctacagaatgggagaaaatatttgcaaactatgca tttgacaaaggtctaatatccagcatttatgaggaacttaaatttacaaggaaaaaacaa acaaccccattaaaaagcctgagagaccctggcaaaccactgatgttagtccaagagtcc aaaagccgaagaacttggagtctgatatttgaggacaggaagcatccagcatggggaaaa gatgaaggcttgaagactcagcaagtcaagtcttttccacactcttctgcctggtttatt ctagccactctggcagctgattacacggtgaccacccagattgagggtccagaaatgctg tccaagagcctaggcctggactcagagaccccaagagccttcttgttgcttgaccccact gtggccaagctggtacctaaggtgcaagacaaactcccctttacttttccccctgctttt ctcaaacaggaggaaatttttgctgtagtcaccacagctgggaatgtgctgggtcacact tgtagtcagcacgtctcagaggccaaggcccacagtgttactggtgattatccagtgccc aagggttctttattcagcaggaaattgctacagccattccaaccttcagcagccgccatc ttgatcagtcagcagccatcaacatcaaggcaagaacctccttcagcaaaaaaggttatg actcactga >gi568815596r:229937383_230159352|GENSCAN_predicted_peptide_6|381_aa MGRDQSRKAENSKNHSTSSPPKDRSSSPAREQNWKENEFDELTEVGFRRSVTTNFSELKE HILTHRKKAKNIEKRSTRQKVDKDIQDLNSALDQADLIHIYRTLHPTSTEYTFFSAPHHT YSKIDHIIGSKTLLSKCKRTEITTNCLSDHSAIKLELRIKKLTQNHTTTWKLNNLLLNDY WVNNEMKAEINMFFETNENKDTTYQNLWDMFKAVCRGKFIALNAHKRKQERSKIDTLTSQ LKELEKQEQTNSKASRSQEITKIRTELKEIETQKTLQNINESRSRFFEKINKIDRLLARL IKKKREKNQIDAIKNDKGDITTDPTEIQTTIREYYKHLYAKKLENLEEMHKFLDKYTLPR LNQEKVESLNRPITTEAIINY >gi568815596r:229937383_230159352|GENSCAN_predicted_CDS_6|1146_bp atggggagagaccagagcagaaaagctgaaaattccaaaaaccacagcacttcttctcct ccaaaggatcgcagctcctcgccagcaagggaacaaaactggaaggagaatgagtttgac gagttgacagaagtaggcttcagaaggtcggtaacaacaaacttctccgagctaaaggag cacattttaacccatcgcaaaaaagctaaaaacattgaaaaaagatcaacaagacagaag gttgacaaggatatccaggacttgaactcagctctggaccaagcagacctaatacacatc tacagaaccctacaccccacatcaacagaatatacattcttctcagcaccacatcacact tattctaaaattgaccacataattggaagtaaaacactcctcagcaaatgtaaaagaaca gaaatcacaacaaactgtctctcagaccacagtgcaatcaaattagagctcaggattaag aaactcactcaaaaccacacaactacatggaaactgaacaacctgctcctgaatgactac tgggtaaataacgaaatgaaggcagaaataaacatgttctttgaaacaaatgagaacaaa gacacaacgtaccagaatctctgggacatgtttaaagcagtgtgtagagggaaatttata gcactaaatgcccacaagagaaagcaggaaagatctaaaattgacaccctaacatcacaa ttaaaagaactagagaagcaagagcaaacaaattcaaaagctagcaggagtcaagaaata actaagatcagaacagaactgaaggagatagagacacaaaaaacccttcaaaatataaat gaatccaggagccggttttttgaaaagatcaacaaaattgatagactgctagcaagacta ataaagaagaaaagagagaagaatcaaatagacgcaataaaaaatgataaaggggatatc accaccgatcccacagaaatacaaactaccatcagagaatactataaacacctctacgca aagaaactagaaaatctagaagaaatgcataaattcctggacaaatacaccctcccaaga ctaaaccaggaaaaagttgaatctctgaatagaccaataacaactgaggcaataattaat tattga >gi568815596r:229937383_230159352|GENSCAN_predicted_peptide_7|184_aa MLPAGDTTTIPLKWKLRWLPRHFGLLLPLSQQAKKGVTVLAGVIDLDYQDEISLLLHNRG KEEYAWNTGDPLGHLLVLLCPVIKVNGKRQQPNPGRTTIGPNPSGMKVWVTPPGKKPQPT EVLAEGKGNTEWVVEEEGPLYAEVEALASGSSTSSWTEDAEHFFRKWSPQWSPAFSDFLQ LSTQ >gi568815596r:229937383_230159352|GENSCAN_predicted_CDS_7|555_bp atgctgccagcaggagacacaacaacgattccattaaagtggaagttaagatggctacct cgacactttgggctcctcctacctttaagtcaacaggctaagaagggagttacagtgtta gctggggtgattgacctggactatcaagatgaaatcagtctactactccacaacagaggt aaggaagaatatgcatggaatacaggagatccattagggcatctcttagtattactatgc cctgtgattaaggtcaatgggaaacgacaacagcccaatccaggaaggactacaattggc ccaaacccctcaggaatgaaggtttgggtcactccaccaggaaaaaaaccacaacccact gaggtgcttgctgaaggcaaagggaatacagaatgggtagtagaagaagaaggacccctg tatgcagaagtagaggcgctggcctcaggctcctccacttcctcttggacagaagacgct gagcattttttcaggaagtggtcaccccagtggtccccggctttctcagacttcctgcag ctctctacacagtga >gi568815596r:229937383_230159352|GENSCAN_predicted_peptide_8|66_aa MTGVLVSLQEEHEEIRTEKYTRNAYTEREHHAEREQGDSRLQAKKRSLREKQPCRSLDLG LLALEL >gi568815596r:229937383_230159352|GENSCAN_predicted_CDS_8|201_bp atgactggtgtccttgtgtccttacaagaagaacatgaggagattaggacagagaaatac accaggaatgcatacactgagagagaacaccatgcagagagggagcaaggggacagccgt ctgcaagccaagaaaaggagcctcagggaaaagcaaccctgccggagccttgatcttgga cttctggcccttgaactgtga >gi568815596r:229937383_230159352|GENSCAN_predicted_peptide_9|222_aa MTQGACTVAEPPEPSFHTDVEIKTNQMRTSKGYVFRAWILAETQRQAEMSSLTVSDHGDL IPFISEQLSHSTLDFLILLVKKILASSLRVRHDSASAPAMTKGGQSTAWAMASEGAIPRS WWLPRRVGPAGVPKSRGEVWEPLPRFQMMYGNAWMSRHKSAAGAEPSQRTSTRAVQRGNV GLEPPHRVLTGALPSGAVKRGPPSFRPQNGRSTDTLYCAPTS >gi568815596r:229937383_230159352|GENSCAN_predicted_CDS_9|669_bp atgactcaaggtgcatgtactgttgcagagccaccagagccatcttttcatactgatgta gaaataaagaccaaccagatgagaacaagcaaaggctatgtattcagagcttggattttg gcagagacccaaaggcaggcagagatgtcttctctgactgtatcagaccatggtgatctg attccttttatttctgaacaactctcccatagcacacttgactttcttattcttttggta aagaagattcttgcttcctctttacgcgtccgccatgattctgcttcagctccagccatg actaaagggggccaaagtacagcttgggccatggcttcagagggtgcaatccctaggtct tggtggcttccacgtcgtgttgggcctgcaggtgtgccaaagtcaagaggtgaggtttgg gaacctctgcctaggtttcagatgatgtatggaaatgcctggatgtccagacataagtct gctgcaggggcagagccctcacagagaacctctactagggcagtgcagaggggaaatgtg gggttggaacccccacacagagtcctcactggggcactgcctagtggagctgtgaaaaga gggccaccatccttcagacctcagaatggcagatccaccgacaccttgtactgtgcacct acaagttga >gi568815596r:229937383_230159352|GENSCAN_predicted_peptide_10|153_aa MACASMTSDIIPGSILGQEQSPDVSVSSHLQTIRGRNHCRWHGTPGSAGYGCSATLKVKV EGSSKKKKTLVPDPVLGVGVGISLGAFKQQHIEEGQNSLARQLQSLLSSPPSPRSTESGI SSLGLSTLLTASLREQRPQLQSRRDCGPRGAAG >gi568815596r:229937383_230159352|GENSCAN_predicted_CDS_10|462_bp atggcctgtgcgtctatgacctcagacattattccagggagcatcttagggcaagagcaa tcccctgatgtttcagtcagcagccatttacaaaccattagaggaaggaatcattgcaga tggcacggaacccctggttcagcgggctatgggtgctctgcaacactgaaggtgaaggta gaaggaagcagcaaaaaaaaaaaaaccctggtccctgatcctgttttgggggttggggta ggcatttccttgggagcatttaaacaacagcacatcgaggagggacaaaactccttggcc agacagctccaatccctgctgtcttccccgccttccccacgaagcacagagtcaggcatt tcttccctgggcctgtccacactcctcactgcttccttgagagaacaacgaccccagctc cagtccaggagggactgtgggcctagaggggccgctggctag