GENSCAN 1.0 Date run: 8-Nov-116 Time: 14:05:49 Sequence gi568815594f:75456083_75664436 : 208354 bp : 38.74% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 1373 1412 40 -2.65 1.01 Init + 1989 2113 125 2 2 89 58 82 0.449 5.00 1.02 Term + 15862 16072 211 1 1 116 41 87 0.287 2.58 1.03 PlyA + 16196 16201 6 1.05 2.06 PlyA - 20536 20531 6 1.05 2.05 Term - 26584 26456 129 1 0 93 49 80 0.251 1.80 2.04 Intr - 34619 34499 121 1 1 63 76 98 0.187 5.68 2.03 Intr - 57095 57020 76 0 1 88 47 83 0.215 1.85 2.02 Intr - 57908 57788 121 2 1 61 89 51 0.718 1.65 2.01 Init - 58204 58073 132 1 0 89 47 194 0.884 15.59 2.00 Prom - 59175 59136 40 -13.11 3.00 Prom + 59488 59527 40 -4.65 3.01 Init + 59938 59945 8 0 2 92 83 10 0.967 0.73 3.02 Intr + 60690 60897 208 0 1 94 72 112 0.967 8.36 3.03 Intr + 65654 65779 126 1 0 79 78 28 0.642 0.86 3.04 Term + 70878 71132 255 2 0 104 44 195 0.998 11.30 3.05 PlyA + 72293 72298 6 1.05 4.04 PlyA - 72449 72444 6 1.05 4.03 Term - 77031 76436 596 1 2 -35 54 275 0.448 5.50 4.02 Intr - 95219 95188 32 1 2 55 103 1 0.031 -4.84 4.01 Init - 96348 96239 110 0 2 70 36 202 0.545 12.94 4.00 Prom - 96974 96935 40 -4.25 5.15 PlyA - 97685 97680 6 1.05 5.14 Term - 98165 98098 68 0 2 71 49 78 0.012 -0.68 5.13 Intr - 99631 99485 147 2 0 57 78 75 0.015 2.69 5.12 Intr - 106129 106010 120 2 0 65 65 83 0.008 3.25 5.11 Intr - 127549 127375 175 1 1 30 30 172 0.012 4.19 5.10 Intr - 135843 135737 107 1 2 63 92 69 0.290 3.81 5.09 Intr - 140258 140161 98 1 2 109 71 32 0.544 2.33 5.08 Intr - 141154 140853 302 1 2 58 80 185 0.922 9.31 5.07 Intr - 142130 141995 136 1 1 63 99 86 0.980 6.85 5.06 Intr - 144287 144199 89 2 2 82 65 54 0.977 0.35 5.05 Intr - 147874 147735 140 2 2 92 84 88 0.982 8.06 5.04 Intr - 149552 149440 113 1 2 82 82 34 0.979 1.30 5.03 Intr - 151279 151101 179 1 2 74 71 106 0.923 5.30 5.02 Intr - 158367 158173 195 0 0 60 101 49 0.755 2.09 5.01 Init - 169906 169739 168 1 0 46 113 167 0.831 14.59 5.00 Prom - 170666 170627 40 -7.65 6.04 PlyA - 171916 171911 6 1.05 6.03 Term - 174171 173956 216 0 0 54 42 115 0.576 -0.34 6.02 Intr - 174302 174223 80 0 2 104 42 90 0.901 4.35 6.01 Init - 174822 174378 445 0 1 81 84 233 0.567 16.43 6.00 Prom - 188072 188033 40 -3.35 7.11 PlyA - 188668 188663 6 1.05 7.10 Term - 189620 189348 273 2 0 55 48 369 0.999 24.09 7.09 Intr - 190374 190256 119 1 2 96 106 128 0.999 14.66 7.08 Intr - 191075 190947 129 0 0 70 80 162 0.999 13.35 7.07 Intr - 192659 192557 103 2 1 73 109 146 0.999 14.03 7.06 Intr - 199164 198984 181 2 1 43 72 294 0.992 22.25 7.05 Intr - 199788 199686 103 1 1 54 79 154 0.995 9.41 7.04 Intr - 200932 200842 91 1 1 51 100 105 0.999 6.65 7.03 Intr - 201648 201475 174 0 0 34 100 143 0.997 9.31 7.02 Intr - 202842 202761 82 2 1 94 101 94 0.995 10.02 7.01 Intr - 205967 205849 119 2 2 94 99 45 0.715 4.54 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 108032 108170 139 0 1 145 37 103 0.902 9.50 S.002 Init - 127579 127375 205 1 1 89 30 176 0.811 11.26 S.003 Init + 175053 175179 127 2 1 73 82 196 0.972 17.87 S.004 Term + 178932 179095 164 1 2 88 43 89 0.822 1.52 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594f:75456083_75664436|GENSCAN_predicted_peptide_1|111_aa MKRSMWLGEIISGREGSQVMGPDHIGPSKEGEREKEKWGEERVDPTLWGGRSSVSDYVGM FNLEQGRTNSVWPWFQATISFPALSSISDSHIYYTIDSELDSKLQKRIIPL >gi568815594f:75456083_75664436|GENSCAN_predicted_CDS_1|336_bp atgaagaggtcgatgtggctgggagaaataataagtgggagagaagggagccaggtaatg gggccagatcatatagggcctagtaaggaaggagagagggagaaagagaagtggggtgaa gaaagggtagatcctaccctgtggggaggaaggtcttcagtcagcgactatgtgggtatg ttcaatttagagcaaggaagaacaaactcagtttggccttggttccaagccacaatctct tttccagctttatcatcaatatctgactctcatatctactacacaattgattcagaactt gattcaaaactccagaagaggattatacccctttaa >gi568815594f:75456083_75664436|GENSCAN_predicted_peptide_2|192_aa MAATAREDGASGQERGQRGCEHYDRGCLLKVTPSMDSSLTSFRQQNPSWEVIRLAQWSLP VGLFVFCCRSRLRQPRACTGVFRSGHGLTNIAPLNWMKLLKNVGHPPSVVGYRCPLCMHS ALDMTRYWRQLDDEVAQTPMPSEYQNMTVDILCNDCNGRSTVQFHILGMKCKICESYNTA QAGGRRISLDQQ >gi568815594f:75456083_75664436|GENSCAN_predicted_CDS_2|579_bp atggcggcgacggcccgggaagatggcgccagcggtcaagagcgaggtcagcggggctgc gagcactatgacagaggatgtctcctaaaggtgacgccttctatggactcttccctgaca agcttccgtcagcaaaacccttcttgggaagtgattaggcttgcacagtggtctctgcca gtgggcttattcgtcttttgctgccgaagtagactgaggcagccccgggcctgtactggg gttttccgttcaggtcatggcttgacgaacattgctccactgaactggatgaagctcctt aaaaatgttggtcatccaccaagtgttgtaggctacagatgtccattatgtatgcactct gctttagatatgaccaggtattggagacagctggatgatgaagtagcacagactcctatg ccatcagaatatcagaacatgactgtggatattctctgcaatgactgtaatggacgatcc actgttcagtttcatatattaggcatgaaatgtaagatttgtgaatcctataatactgct caagctggaggacgtagaatttcactggatcagcaatga >gi568815594f:75456083_75664436|GENSCAN_predicted_peptide_3|198_aa MKEFPTDENIKRKWVLAMKRLDVNAAGIWEPKKGDVLCSRHFKKTDFDRSAPNIKLKPGV IPSIFDSPYHLQGKREKLHCRKNFTLKTVPATNYNHHLVGASSCIEEFQSQFIFEHSYSV MDSPKKLKHKLDHVIGELEDTKESLRNVLDREKRFQKSLRKTIRELKDECLISQETANRL DTFCWDCCQESIEQDYIS >gi568815594f:75456083_75664436|GENSCAN_predicted_CDS_3|597_bp atgaaggaattccccacagatgaaaacatcaaaaggaaatgggtattagcaatgaaaaga cttgatgtgaatgcagccggcatttgggagcctaaaaaaggagatgtgttgtgttcgagg cactttaagaagacagattttgacagaagtgctccaaatattaaactgaaacctggagtc ataccttctatctttgattctccatatcacctacaggggaaaagagaaaaacttcattgt agaaaaaacttcaccctcaaaaccgttccagccactaactacaatcaccatcttgttggt gcttcctcatgtattgaagaattccaatcccagttcatttttgaacatagctacagtgta atggacagtccaaagaaacttaagcataaattagatcatgtgatcggcgagctagaggat acaaaggaaagtctacggaatgttttagaccgagaaaaacgttttcagaaatcattgagg aagacaatcagggaattaaaggatgaatgtctgatcagccaagaaacagcaaatagactg gacactttctgttgggactgttgtcaggagagcatagaacaggactatatttcatga >gi568815594f:75456083_75664436|GENSCAN_predicted_peptide_4|245_aa MVIIADGSSMHVIAPEDLPVEQDVEVEDSDSDDPDPVATSHPASSICGKESIKVQKICSP TCNKKKIPFSEEKFKPAAETCVSNEELNVNPQDNGENVSRACQRSSRQPLPSQVWRPRRK VWFCGPGPGSPCCVQPRDLVPCVPATPAMAERSQHTPWAVASEGASLKPWQLPYDVEPLS AQKSRTEVWASPPKFQMYGNAWMPRQKFVVGPGSSWRTSARAVQKGNVELEPPHRVPTGH CLVEL >gi568815594f:75456083_75664436|GENSCAN_predicted_CDS_4|738_bp atggttatcatagcagatggcagctctatgcatgttattgcccctgaagatcttccagtg gaacaagatgtggaggtggaagacagtgacagtgatgatcctgaccccgtagcaacttcc catcctgcaagctccatttgtggaaaggagagcataaaagttcagaaaatttgcagccca acttgcaataaaaagaaaatcccattttctgaagagaaattcaagccggctgcagaaact tgcgtaagtaatgaggagctgaatgttaatccccaagacaatggggaaaatgtctccagg gcatgtcagcgatcttcaaggcagccccttccatcacaggtctggaggcctaggaggaaa gtatggttttgtgggccaggcccagggtccccgtgttgtgtgcagcccagggacttggtg ccctgtgtcccagccactccagccatggctgaaaggagccaacatacaccttgggccgtg gcttcagagggtgcaagtctcaagccttggcagcttccatatgatgttgagcctttgagt gcacagaagtcaagaactgaggtttgggcatctccgcctaaatttcagatgtatggaaat gcctggatgcccaggcagaagtttgttgtagggccagggtcctcatggagaacctctgct agggcagtgcagaagggaaatgtggagttggaacccccacacagagtccctactgggcac tgcctagtggagctgtga >gi568815594f:75456083_75664436|GENSCAN_predicted_peptide_5|678_aa MEKYENLGLVGEGSYGMVMKCRNKDTGRIVAIKKFLESDDDKMVKKIAMREIKLLKQLRH ENLVNLLEVCKKKKRWYLVFEFVDHTILDDLELFPNGLDYQVVQKYLFQIINGIGFCHSH NIIHRDIKPENILVSQSGVVKLCDFGFARTLAAPGEVYTDYVATRWYRAPELLVGDVKYG KAVDVWAIGCLVTEMFMGEPLFPGDSDIDQLYHIMMCLGNLIPRHQELFNKNPVFAGVRL PEIKEREPLERRYPKLSEVVIDLAKKCLHIDPDKRPFCAELLHHDFFQMDGFAERFSQEL QLKVQKDARNVSLSKKSQNRKKEKEKDDSLVEERKTLVVQDTNADPKIKDYKLFKIKGSK IDGEKAEKGNRASNASCLHDSRTSHNKIVPSTSLKDCSNVSVDHTRNPSVAIPPLTHNLS AVAPSINSGMGTETIPIQGYRVDEKTKKCSIPFVKPNRHSPSGIYNINVTTLVTRNSRLT KKESKILSESRIPSLAAIDLHTPSITLHQMCGLLAKCLQFHIVGAFIVSLGVAAVCKIAV AEPRKKTYADFYRNYDSVKDLEEMGKAVPHTYILFHRRTLEMSVVSPTHLLAMADWTTDR HSTHSLSAYLLKKRNKRERALPGGFLTPVVCSGELGIILRPPTRVYAASLLCFCPGVSDT GQGAGAKDKIPALMELIV >gi568815594f:75456083_75664436|GENSCAN_predicted_CDS_5|2037_bp atggaaaaatatgaaaacctgggtttggttggagaagggagttatggaatggtgatgaag tgtaggaataaagatactggaagaattgtggccataaagaagttcttagaaagtgacgat gacaaaatggttaaaaagattgcaatgcgagaaatcaagttactaaagcaacttaggcat gaaaacttggtgaatctcttggaagtgtgtaagaaaaaaaaacgatggtacctagtcttt gaatttgttgaccacacaattcttgatgacttggagctctttccaaatggactagactac caagtagttcaaaagtatttgtttcagattattaatggaattggattttgtcacagtcac aatatcatacacagagatataaagccagagaatatattagtctcccagtctggcgttgtc aagctatgcgattttggatttgcgcgaacattggcagctcctggggaggtttatactgat tatgtggcaacccgatggtacagagctccagaactattggttggtgatgtcaagtatggc aaggctgttgatgtgtgggccattggttgtctggtaactgaaatgttcatgggggaaccc ctatttcctggagattctgatattgatcagctatatcatattatgatgtgtttaggtaat ctaattccaaggcatcaggagctttttaataaaaatcctgtgtttgctggagtaaggttg cctgaaatcaaggaaagagaacctcttgaaagacgctatcctaagctctctgaagtggtg atagatttagcaaagaaatgcttacatattgaccccgacaaaagacccttctgtgctgag ctcctacaccatgatttctttcaaatggatggatttgctgagaggttttcccaagaacta cagttaaaagtacagaaagatgccagaaatgtttctttatctaaaaaatcccaaaacaga aagaaggaaaaagaaaaagatgattccttagttgaagaaagaaaaacacttgtggtacag gataccaatgctgatcccaaaattaaggattataaactatttaaaataaaaggctcaaaa attgatggagaaaaagctgaaaaaggcaatagagcttcaaatgccagctgtctccatgac agtaggacaagccacaacaaaatagtgccttcaacaagcctcaaagactgcagcaatgtc agcgtggaccacacaaggaatccaagcgtggcaattcccccacttacacacaatctttct gcagttgctcccagcattaattctggaatggggactgagactataccaattcagggttac agagtggatgagaaaactaagaagtgttctattccatttgttaaaccgaacagacattcc ccatcaggcatttataacattaatgtgaccacattagtaactcgaaattccaggctaaca aagaaagagagcaaaattctttcagaatctcgaattccttctctggctgctattgacctg cacacccccagtattacattacatcagatgtgtggtcttctggccaaatgtctgcaattt catattgttggagcctttattgtatccctgggggttgcagctgtctgtaagattgctgtg gctgaaccaagaaagaagacatatgcagatttctacagaaattatgattccgtgaaagat ttggaggagatggggaaggctgtccctcatacatatattctgtttcatcgacgaaccctg gaaatgtcagttgtcagtcctacgcaccttctggctatggctgattggaccactgataga cactcaactcattcactgtctgcttacctacttaaaaagcgaaacaagagggagagggca ttacccggaggcttcctgaccccggtggtttgcagtggagagttggggatcattcttagg cccccaaccagggtttatgctgcttctctgctctgcttttgtccaggagtgtcagacact ggtcaaggtgctggggctaaagataagattcctgctctcatggaacttatagtttag >gi568815594f:75456083_75664436|GENSCAN_predicted_peptide_6|246_aa MESRSGRAYATQGARRPPLPQRRCYEGNRIAFCIVFAVLHNRHLYPSRGHHNRLLPLGKA ERRLLTRTKACGDAGPLRVREQNLGRALGEGAGWLALSQSWLVTTRLGQSEEGRPGALGA GEKPPPLAAPRPTPSPRGSWSGAGSQAPGVGACRERILTWAVSVSQDFVPMVGDRCELAA DRGPPASLPASPWTRDCVRRRNHNSAGIASLLGSIFWTCEPYAFHLKFVSKLKTFGSFYA NGLKKR >gi568815594f:75456083_75664436|GENSCAN_predicted_CDS_6|741_bp atggaatcccggtcaggccgcgcctacgcgactcagggcgcccggcgcccgcccctgccc cagcggcgatgctatgagggaaaccgtatcgcattttgcatagtcttcgcagtcctacat aaccgccacctttacccttcgcgtgggcatcacaatcgcctcctcccgctggggaaggca gaaaggcgcctcctgacgagaaccaaggcgtgtggggacgcagggcctctgcgtgtcagg gagcagaacctgggccgagccctaggtgaaggggcggggtggttggccctgagccaatca tggctcgtgacgactcggctcggccaatcagaagaagggaggcctggcgctctcggggcg ggtgagaaaccgcccccccttgcagctccgcggccaacgccttcgcccaggggtagttgg agcggtgcaggttcccaggctccaggtgttggtgcctgccgtgaacgcattctgacctgg gccgtatctgtctcccaagactttgtgcctatggttggggacagatgtgagcttgcggcg gaccgaggcccacctgcctccctgcctgcttcgccctggactcgtgactgcgtccgcaga agaaatcacaacagcgctggaattgctagtttgctaggcagcatcttttggacctgcgaa ccatatgcatttcacctcaaatttgtttccaagttgaaaacctttgggtctttctatgcg aacggattgaagaaacggtaa >gi568815594f:75456083_75664436|GENSCAN_predicted_peptide_7|457_aa HLTLCSKEMVMEKPSPLLVGREFVRQYYTLLNKAPEYLHRFYGRNSSYVHGGVDASGKPQ EAVYGQNDIHHKVLSLNFSECHTKIRHVDAHATLSDGVVVQVMGLLSNSGQPERKFMQTF VLAPEGSVPNKFYVHNDMFRYEDEVFGDSEPELDEESEDEVEEEQEERQPSPEPVQENAN SGYYEAHPVTNGIEEPLEESSHEPEPEPESETKTEELKPQVEEKNLEELEEKSTTPPPAE PVSLPQEPPKPRVEAKPEVQSQPPRVREQRPRERPGFPPRGPRPGRGDMEQNDSDNRRII RYPDSHQLFVGNLPHDIDENELKEFFMSFGNVVELRINTKGVGGKLPNFGFVVFDDSEPV QRILIAKPIMFRGEVRLNVEEKKTRAARERETRGGGDDRRDIRRNDRGPGGPRGIVGGGM MRDRDGRGPPPRGGMAQKLGSGRGTGQMEGRFTGQRR >gi568815594f:75456083_75664436|GENSCAN_predicted_CDS_7|1374_bp catttgacattgtgcagcaaagaaatggttatggagaagcccagtccgctgcttgtaggg cgggagtttgtgaggcaatattatactttgctgaataaagctccggaatatttacacagg ttttatggcaggaattcttcctatgttcatggtggagtagatgctagtggaaagccccag gaagctgtttatggccaaaatgatatacaccacaaagtattatctctgaacttcagtgaa tgtcatactaaaattcgtcatgtggatgctcatgcaaccttgagtgatggagtagttgtc caggtcatgggtttgctgtctaacagtggacaaccagaaagaaagtttatgcaaaccttt gttctggctcctgaaggatctgttccaaataaattttatgttcacaatgatatgtttcgt tatgaagatgaagtgtttggtgattctgagcctgaacttgatgaagaatcagaagatgaa gtagaagaggaacaagaagaaagacaaccatctcctgaacctgtgcaagaaaatgctaac agtggttactatgaagctcaccctgtgactaatggcatagaggagcctttggaagaatcc tctcatgaacctgaacctgagccagaatctgaaacaaagactgaagagctgaaaccacaa gtggaggagaagaacttagaagaactagaggagaaatctactactcctcctccggcagaa cctgtttctctgccacaagaaccaccaaagccaagagtcgaagctaaaccagaagttcaa tctcagccacctcgtgtgcgtgaacaacgacctagagaacgacctggttttcctcctaga ggaccaagaccaggcagaggagatatggaacagaatgactctgacaaccgtagaataatt cgctatccagatagtcatcaactttttgttggtaacttgccacatgatattgatgaaaat gagctaaaggaattcttcatgagttttggaaacgttgtggaacttcgcatcaataccaag ggtgttgggggaaagcttccaaattttggttttgtggtttttgatgactctgaaccagtt cagagaatcttaattgcaaaaccgattatgtttcgaggggaagtacgtttaaatgtggaa gagaaaaaaacaagagctgcaagagagcgagaaaccagaggtggtggtgatgatcgcagg gatattaggcgcaatgatcgaggtcccggtggtccacgtggaattgtgggtggtggaatg atgcgtgatcgtgatggaagaggacctcctccaaggggtggcatggcacagaaacttggc tctggaagaggaaccgggcaaatggagggccgcttcacaggacagcgtcgctga