GENSCAN 1.0 Date run: 6-Nov-116 Time: 05:08:53 Sequence gi568815592f:41950542_42177241 : 226700 bp : 48.37% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 2044 2125 82 0 1 101 115 22 0.779 7.33 1.02 Term + 60433 60512 80 2 2 95 41 74 0.259 1.33 1.03 PlyA + 61880 61885 6 1.05 2.00 Prom + 68556 68595 40 -2.86 2.01 Init + 75328 75348 21 0 0 74 103 10 0.008 0.80 2.02 Intr + 82226 82354 129 1 0 52 75 49 0.045 0.89 2.03 Term + 93554 93916 363 1 0 71 43 188 0.368 7.17 2.04 PlyA + 94884 94889 6 1.05 3.00 Prom + 96449 96488 40 -3.56 3.01 Init + 100001 100045 45 1 0 86 103 69 0.925 8.60 3.02 Intr + 100816 100972 157 0 1 63 68 160 0.999 11.08 3.03 Intr + 104990 105088 99 0 0 71 92 104 0.995 9.18 3.04 Intr + 105411 105473 63 1 0 106 99 53 0.992 6.89 3.05 Intr + 106848 106972 125 1 2 52 117 118 0.932 11.40 3.06 Intr + 115771 115918 148 0 1 87 47 152 0.999 10.81 3.07 Intr + 117924 118066 143 1 2 76 64 142 0.937 10.67 3.08 Intr + 126559 126698 140 0 2 48 81 230 0.415 17.56 3.09 Intr + 142372 142492 121 1 1 57 60 27 0.212 -2.70 3.10 Term + 142659 142736 78 2 0 118 48 67 0.644 3.46 3.11 PlyA + 143294 143299 6 1.05 4.08 PlyA - 144375 144370 6 1.05 4.07 Term - 153337 153220 118 0 1 135 52 140 0.986 13.11 4.06 Intr - 157042 153922 3121 2 1 145 99 1566 0.796 150.78 4.05 Intr - 159750 159675 76 0 1 122 111 62 0.907 10.99 4.04 Intr - 167757 167595 163 2 1 12 99 39 0.038 -2.62 4.03 Intr - 172471 172362 110 1 2 71 81 76 0.194 4.28 4.02 Intr - 178237 178131 107 2 2 88 105 99 0.444 11.53 4.01 Init - 191903 191759 145 2 1 80 109 212 0.801 22.78 4.00 Prom - 194854 194815 40 -10.15 5.00 Prom + 196352 196391 40 -9.65 5.01 Init + 196759 196761 3 0 0 37 115 0 0.292 -2.40 5.02 Intr + 199121 199272 152 1 2 112 35 116 0.660 7.66 5.03 Intr + 206144 206259 116 2 2 81 58 43 0.416 0.69 5.04 Intr + 207839 207981 143 0 2 119 80 44 0.484 6.77 5.05 Term + 212379 212708 330 2 0 129 47 373 0.998 31.96 5.06 PlyA + 219079 219084 6 1.05 6.00 Prom + 220954 220993 40 -4.96 6.01 Init + 223073 223273 201 1 0 69 105 399 0.863 38.58 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:41950542_42177241|GENSCAN_predicted_peptide_1|53_aa MDSDSQKKPSSKVTVSALGSEAGALSNETLQEFIQAVNRSFHKDPMKAFKSDS >gi568815592f:41950542_42177241|GENSCAN_predicted_CDS_1|162_bp atggactctgatagtcagaagaagccctcaagcaaagtgacagtgtcagcacttggaagt gaggctggtgcactctcgaatgaaacacttcaggagttcatccaagcagtcaatcggtca tttcacaaggatccaatgaaggctttcaaaagtgattcttga >gi568815592f:41950542_42177241|GENSCAN_predicted_peptide_2|170_aa MVPGVSMKPTHFHDVQTGRGNSQLPSSSCSSLFRAPGTLSISKPGSAAFLGKDSQEGERE DVEEIPDLGTGVCRRGGGLHVLAGAWTRQKSTFYGGFAGSKAVLWAPVCKQLVMGWEQRL PETSQRLKNPATLFIHPIQSPQGDTRGNSSTWWTIRDYHGPGSMGEQRMT >gi568815592f:41950542_42177241|GENSCAN_predicted_CDS_2|513_bp atggtacccggtgtatccatgaaacctacccatttccacgatgtgcagactggcagaggt aacagccagctgccctcatcttcctgtagttctctctttcgagcccctggcactttgtcc atttccaagcctggttctgcagctttcctaggaaaggacagtcaggaaggggaaagagaa gatgtggaagaaatccctgacctggggacgggtgtatgccgcagaggcggaggcctgcac gttctggccggagcctggacaagacagaagagcaccttttacgggggctttgctggatca aaagcagttctctgggccccggtgtgtaaacagctggtgatgggctgggagcagaggctc cctgaaaccagtcagcgcctgaagaacccggcaacactcttcattcacccaatccagtca ccccaaggtgacacacggggcaacagctccacctggtggacgatcagggactaccacggg ccagggagtatgggagagcaaagaatgacatag >gi568815592f:41950542_42177241|GENSCAN_predicted_peptide_3|372_aa MADAAATAGAGGSGTRSGSKQSTNPADNYHLARRRTLQVVVSSLLTEAGFESAEKASVET LTEMLQSYISEIGRSAKSYCEHTARTQPTLSDIVVTLVEMGFNVDTLPAYAKRSQRMVIT APPVTNQPVTPKALTAGQNRPHPPHIPSHFPEFPDPHTYIKTPTYREPVSDYQVLREKAA SQRRDVERALTRFMAKTGETQSLFKDDVSTFPLIAARPFTIPYLTALLPSELEMQQMEET DSSEQDEQTDTENLALHISMEDSGAEKENTSVLQQNPSLSGSRNGEENIIDNPYLRPVKK PKIRRKKIMKSLEVCPPNLLSTPMLKYLSPPSPKSTPKQSSHWDSFRDVNEKISPDLPDI TKYPLGSKIGPG >gi568815592f:41950542_42177241|GENSCAN_predicted_CDS_3|1119_bp atggccgacgcggcggccacagctggggccggtggctccggaacgagatcgggaagtaaa cagtccactaaccctgccgataactatcatctggcccggaggagaaccctgcaggtggtt gtgagctccttgctgacagaggcagggtttgagagtgccgagaaagcatccgtggaaacg ctgacagagatgctgcagagctacatttcagaaattgggagaagtgccaagtcttactgt gagcacacagccaggacccagcccacactgtccgatatcgtggtcacacttgttgagatg ggtttcaatgtggacactctccctgcttatgcaaaacggtctcagaggatggtcatcact gctcctccggtgaccaatcagccagtgacccccaaggccctcactgcagggcagaaccga ccccacccgccgcacatccccagccattttcctgagttccctgatccccacacctacatc aaaactccgacgtaccgtgagcccgtgtcagactaccaggtcctgcgggagaaggctgca tcccagaggcgcgatgtggagcgggcacttacccgtttcatggccaagacaggcgagact cagagtcttttcaaagatgacgtcagcacatttccattgattgctgccagacctttcacc atcccctacctgacagctcttcttccgtctgaactggagatgcaacaaatggaagagaca gattcctcggagcaggatgaacagacagacacagagaaccttgctcttcatatcagcatg gaggattctggagccgagaaggagaacacctctgtcctgcagcagaacccctccttgtcg ggtagccggaatggggaggagaacatcatcgataacccttatctgcggccggtgaagaag cccaagatccgcaggaagaagatcatgaaatccttggaggtttgccctccaaaccttctc tccactcccatgctcaagtatctctcaccaccttcaccaaagtccactcccaagcagtca tcacactgggactccttcagggatgtcaatgaaaaaatatctccagatcttccagatatc accaaatatcctctgggaagcaaaattggccctggataa >gi568815592f:41950542_42177241|GENSCAN_predicted_peptide_4|1279_aa MKKKQTVQGTFSKLFGKKHTTTPSTSLYATNPPWIFTQEAPEEGTGGFDGIYYGDNRFNT VSESGTATLKARPRVRPLLTFLPLSESSHRQGDAYEELKIKCQCGKYGKTGPTGPQGCPG RTLRTLPPTYSCLYPAAPLASLESEDNLEAIQLSVPTPCHRGVTLSVQVTAHTTQNAQEN HGLAVPTPSVPDDFADKEVTGTSSLVNGNLRLYSSVGDLRPGQYGQDLLIPPPPPGPAPG PPQDISEPPGGSPLPSPPSTAPPPPPLLLEPPPPPSMAPPPPPVLEALSPPHTLSSPSIP TPPDFIPPAPPLAFLAPPPPPVPAPAPPAPASPHTVGTRLFPPGGVTKWKSDVALNGRQA EATRASPPRSPAEPKGSALGPNPEPHLTFPRSFKVPPPTPVRTSSIPVQEAQEAPRKEEG ATKKAPSRLPLPPSFHIRPASQVYPDRAPEPDCPGELKATAPASPRLGQSQSQADERAGT PPPAPPLPPPAPPLPPPAPPLPPAAPPLPCAQKAAHPPAGFTKTPKSSSPALKPKPNPPS PENTASSAPVDWRDPSQMEKLRNELAAYLCGSRREDRFLSHRPGPTVAPQSKEGKKGPRL PEKETLLSLPAKDTPPGVPEKSLGGSSLTETEAAPSLTLPSVDYIPQDSPTPSVRQIRNE LEARLSSAAEKEAKPSIGSLPPKPRLEGGRICENGADDDKLSKPVAKNLPPQSTTLLPTT SLQPKAMLGPAIPPKATPEPAIPPKATLWPATPPKATLGPATPLKATSGPTTPLKATSGP AIASTATTLPTTTSQLMAEKDSGPAGQPEKPASQEVSTPSQARGEGSPSEATRLPTQGAR SSAAFPPKTSPGGGEVPCLYKPHCHQSSLSREVAVVMPTLARGGAAGPGEPVEVKEPPGL PAKPPASAQPTDELLRHPVTGEVVERGSPMALLLAARQRAQKGRSVGAALGRSSLPGSLR DHSHQAEASSDSIFHSQGTPNSFTVVPKLPKEAEKDSPLTTEIPNKWGPRLGRDAEGTEL SRRHNWTKPEPQAPVAWERVAPSNLPQGHPLPKSFSSPPSPSNKREEEEEEFNFEVIPPP PEFSNDPEPPAPALQYLGRQSSPPRNNYSDLRQLPNAGPGAPPALGFSRFPAGARYAGAG GLERFSGGGRSLIKKRLYVGEPHRGPGLPHGGTGRSLSSPNCFGPQPGGPEMRRVNSAGR APPGGLHAPRLSLEGAARGAAEAKHKAPGSADYGFAPAAGRSPYTTTRYGSPINTFTVRP GTRHPISYVCSGAHRKATS >gi568815592f:41950542_42177241|GENSCAN_predicted_CDS_4|3840_bp atgaaaaagaagcagacggtgcagggcaccttcagcaaactcttcgggaagaagcacacc acgacccccagcacctccctctacgccaccaatccgccctggatcttcacccaggaggcc ccggaggaggggaccgggggcttcgatggcatctattatggagacaatcggtttaacaca gtgagcgagtcaggaacagccacgctgaaagctcggccaagagtccggcccctgctgacc ttccttccgctgtctgaatcaagccatcgccagggtgatgcatatgaggaattaaaaatt aaatgccagtgtggcaagtatgggaaaactggcccaacaggtccccaggggtgccctggg cgaactttgagaactctccctcccacctactcctgcctctaccctgctgcccctttggca tctctggagtcagaggacaacctcgaggcaatacagttgtctgtcccaacaccctgtcat cgaggggtgaccctgagtgtccaggtcactgcccacacgacgcagaatgcccaggaaaac catgggctggctgtgcccaccccctcggttccagatgattttgcagacaaagaagtgaca ggtaccagctcactagtcaatggcaacctccgactgtacagctctgtgggtgacctgagg cctggacaatatggccaggatctactcatccccccacctcccccaggcccagccccaggg ccccctcaggacatttcagaacctccaggggggtcgccactgccatctccaccttccaca gcacccccaccacctcccctgctgctggaacccccacccccgcccagcatggccccacct ccacccccagtattggaggccctatccccaccacacactctttcctccccatccataccc acccctcctgacttcattccccctgccccacccttggcctttctagcccccccaccgcct cctgtgccagccccagcacccccagctccagcatctcctcacacagtggggactcgtctc tttccccctgggggtgtcaccaagtggaaatcagatgtagcactgaatggcaggcaggca gaggccaccagagccagccccccgagaagccctgctgagccaaaggggagcgccctggga cctaacccagagccccatctcaccttcccccgttctttcaaagtgcctcccccaacccca gtcaggacttcgtccatcccagttcaggaagcacaagaggctccccgaaaggaagagggg gccaccaagaaggctcccagccgactcccactgcctcccagcttccacatccgccccgcc tcccaggtctacccagacagggcccccgagccagactgccctggggagctcaaggccaca gcaccagccagcccaaggcttggccagtcccagtcccaagcagatgaacgagctgggact ccgcctccagcccctcccctgccccctcctgcaccccccctccctcccccagcaccccca cttcccccagctgcacctcctttgccctgtgctcagaaggcagcccatccacctgctggg tttacaaaaacccctaaatccagctctcctgctctcaaacccaaacccaacccccccagc ccagagaacacagcgtcttcagcacctgtggactggagggaccccagccagatggaaaag ctgcggaacgagctggcagcctatctctgtggctccaggagagaggaccgattcctcagt cacaggccaggcccaacagtggcccctcagagcaaggagggcaagaagggcccccgcctg cctgagaaggagactctcctgagcctgccagcaaaggacactcccccaggtgttcctgaa aagagtcttggcggcagcagcctgacagagacagaggctgcccccagcctgaccctgccc tctgtggactacattccccaagactctccaactcccagtgtgcggcagatccggaatgag ctggaggcccggctctcctcagcagcagagaaggaggctaagcccagcataggatctctg ccccctaagcctcggctagaagggggaagaatttgtgaaaacggggctgatgatgacaaa ctctccaagcctgtggccaagaatctgccacctcaatccaccaccctgctgccaactaca tcactccagcccaaggctatgttgggaccagccataccacccaaggccacacctgagcca gccataccacccaaggctacactttggccagccacaccacccaaggccacacttgggcca gccacaccactcaaggccacatctgggccaaccacaccactcaaggccacatctggccct gccatagcatctacagccacaactctgcccaccaccacatcccaactgatggcagagaag gactcaggcccagctggccagccagagaagccagcatctcaagaagtttccactccctcc caggcaaggggagaggggtccccctcagaggccactaggctgcccacacagggagcccgc tcatctgcagccttcccaccaaagacatctcctggtggaggagaggtgccatgtctctac aagccccactgccaccagagcagcctcagccgtgaggttgctgtggtgatgcccaccctg gccagaggaggggctgcagggccaggggagcccgtggaggtgaaggagcccccagggctg ccagccaagcctcctgcctcggcccagcccactgatgaactcctcaggcacccggtgact ggggaggtggtggagcggggctcgccgatggccctgctcctggcggccaggcagagggct cagaagggaaggtctgtaggggctgccctgggtcggtcctctctgccaggaagtctccgt gaccacagccaccaagccgaggccagctctgacagcatcttccacagccagggcacgccc aactccttcactgtggtgcccaagttacccaaggaggctgagaaggactccccgctgacg accgaaatacccaataagtgggggccgcggctgggaagagacgcagagggcacagagctg agccgcaggcacaactggacaaagccagagccccaggcccctgtggcctgggaaagagta gctccctccaacctcccccagggccacccgctgcccaagtccttctcctccccaccttct ccttcgaacaagagggaggaggaggaggaggagttcaacttcgaggtcatcccaccgccg ccagagttcagcaatgaccctgagcccccggccccggccctccagtatctgggccgccag agctcccctccccggaacaactactcagacttgaggcagctcccgaacgctggccccggg gcgcccccggctctcggcttctcgcgctttcccgcgggcgcgcgctacgccggggctggg ggcctggagcgcttctcgggagggggccgctcgctcataaagaagcgcctgtacgtcggg gagccgcaccgaggcccagggctaccccacggtggcaccggccgcagcctgagctctccc aactgcttcgggccgcagcccggaggccccgagatgcggcgcgtgaactcggcgggtcgc gcgccccccggaggcctgcacgcgccgaggctgtccctggagggcgccgcccggggcgcc gcggaggccaagcacaaagcgcccggcagcgccgactacggcttcgccccagctgccggc aggtctccctacaccaccacccgctatggaagccccatcaacacgttcaccgtgaggcct gggacccgccatcccatctcctatgtctgctcaggggcccatcggaaagccacctcctga >gi568815592f:41950542_42177241|GENSCAN_predicted_peptide_5|247_aa MVAKAPYPALGMEMLNREQQKHNRDHERLQKSHVNSTKKLKKENSTEIRETGTHGTPPQA GELDSLIIPILQMRPLKLKEAQPLAQGRLTYCLCPSFCEGFIPGCSGDPGCPLPLLPPES CMLQQQHSKHSRGVDPGQDSQKPSVPSHGPKTPSCKGVKAPHSSRPRAWKQDLEQSLAAA YVPVVVDSKGQNPDKLRFNFYTSQYSNSLNPFYTLQKPTCGYLYRRDTDHTRKRFDVPPA NLVLWRS >gi568815592f:41950542_42177241|GENSCAN_predicted_CDS_5|744_bp atggtagccaaagctccctaccctgctttgggcatggagatgctaaacagagaacagcaa aaacacaacagagaccatgagaggctgcagaagagccatgtcaattcaacaaagaaactc aagaaggaaaattccactgagatccgggagacaggcactcatggcaccccaccacaagct ggtgaattagatagccttattatccccattttgcagatgaggcccctgaagctcaaagag gctcagccacttgcccaaggtcgcctgacctactgtctgtgtcccagtttttgtgagggc ttcatccctggatgttccggggaccctggctgtcccctacccctgctgcccccagagtcc tgtatgttgcagcagcagcattctaaacacagccgaggtgtggacccagggcaggactca cagaaaccctctgtacccagtcatgggccaaagacaccgtcatgcaagggggtgaaggct ccacactcgtcccggccccgggcgtggaagcaggacctcgagcagtctctggcagcagcc tatgtgccggtcgttgtggactctaaggggcagaatccggacaagctcaggttcaatttc tacacctcccagtactccaactccctgaaccccttctacactttgcagaagcctacctgt ggctacctgtaccgccgggacactgaccacacccgcaagcgctttgatgtgcctcctgcc aacttggtcttgtggcgctcctag >gi568815592f:41950542_42177241|GENSCAN_predicted_peptide_6|67_aa MGNVMEGKSVEELSSTECHQWYKKFMTECPSGQLTLYEFRQFFGLKNLSPSASQYVEQMF ETFDFNK >gi568815592f:41950542_42177241|GENSCAN_predicted_CDS_6|201_bp atgggcaacgtgatggagggaaagtcagtggaggagctgagcagcaccgagtgccaccag tggtacaagaagttcatgactgagtgcccctctggccaactcaccctctatgagttccgc cagttcttcggcctcaagaacctgagcccgtcggccagccagtacgtggaacagatgttt gagacttttgacttcaacaag