GENSCAN 1.0 Date run: 8-Nov-116 Time: 04:58:00 Sequence gi568815585f:49360106_49591833 : 231728 bp : 39.63% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 2346 2546 201 2 0 57 60 168 0.776 9.82 1.02 Term + 5289 5510 222 2 0 19 38 234 0.991 7.43 1.03 PlyA + 5848 5853 6 1.05 2.04 PlyA - 7861 7856 6 1.05 2.03 Term - 13731 13588 144 0 0 70 42 112 0.970 1.73 2.02 Intr - 17026 16862 165 1 0 64 99 149 0.930 12.84 2.01 Init - 24113 23991 123 2 0 53 74 115 0.343 6.82 2.00 Prom - 33060 33021 40 -6.75 3.00 Prom + 37133 37172 40 -3.95 3.01 Init + 47940 48000 61 2 1 38 88 87 0.963 5.16 3.02 Intr + 52764 52846 83 1 2 73 92 76 0.535 4.94 3.03 Intr + 57698 57845 148 1 1 53 58 72 0.287 -0.41 3.04 Term + 61667 61779 113 0 2 69 47 76 0.131 -0.66 3.05 PlyA + 61930 61935 6 1.05 4.05 PlyA - 62182 62177 6 1.05 4.04 Term - 63118 62997 122 2 2 86 42 115 0.579 4.36 4.03 Intr - 74118 73981 138 1 0 80 94 84 0.203 7.71 4.02 Intr - 80807 80762 46 2 1 47 89 58 0.104 -1.14 4.01 Init - 92564 92505 60 2 0 90 98 23 0.039 4.80 4.00 Prom - 93800 93761 40 -5.65 5.00 Prom + 95406 95445 40 -9.65 5.01 Init + 95759 95840 82 1 1 61 92 77 0.907 6.58 5.02 Intr + 100002 100127 126 1 0 46 86 115 0.925 6.83 5.03 Term + 103916 103962 47 0 2 94 54 36 0.416 -2.81 5.04 PlyA + 104659 104664 6 1.05 6.00 Prom + 112001 112040 40 -1.05 6.01 Init + 116492 116934 443 1 2 53 84 199 0.846 11.60 6.02 Intr + 120114 120230 117 0 0 85 82 25 0.416 0.26 6.03 Intr + 120842 121011 170 2 2 74 103 47 0.770 3.47 6.04 Intr + 122632 122857 226 2 1 99 75 112 0.917 7.12 6.05 Intr + 123359 123458 100 2 1 84 53 36 0.729 -1.11 6.06 Intr + 128187 128525 339 2 0 -11 82 291 0.029 13.34 6.07 Intr + 132247 132335 89 0 2 22 84 31 0.202 -6.05 6.08 Intr + 135794 135990 197 2 2 30 82 136 0.100 5.34 6.09 Intr + 146515 146651 137 2 2 51 111 40 0.006 1.97 6.10 Intr + 152954 153061 108 1 0 70 111 141 0.994 14.16 6.11 Intr + 155738 155768 31 1 1 81 103 10 0.708 -1.41 6.12 Intr + 163980 164111 132 1 0 78 84 75 0.544 5.80 6.13 Term + 164808 164911 104 1 2 51 54 95 0.580 -0.14 6.14 PlyA + 165604 165609 6 1.05 7.00 Prom + 168148 168187 40 -3.75 7.01 Init + 171180 171280 101 2 2 97 23 111 0.460 5.28 7.02 Term + 173933 174122 190 2 1 67 38 135 0.416 2.34 7.03 PlyA + 174174 174179 6 1.05 8.14 PlyA - 174345 174340 6 1.05 8.13 Term - 177119 177010 110 0 2 77 43 95 0.795 1.59 8.12 Intr - 180108 179979 130 0 1 68 75 58 0.769 1.85 8.11 Intr - 180901 180771 131 2 2 98 78 51 0.953 4.59 8.10 Intr - 181722 181571 152 2 2 85 52 165 0.984 11.49 8.09 Intr - 184758 184632 127 1 1 83 68 160 0.995 12.32 8.08 Intr - 186513 186456 58 0 1 97 101 11 0.804 0.84 8.07 Intr - 186786 186732 55 2 1 14 105 39 0.257 -3.74 8.06 Intr - 191363 191221 143 2 2 89 88 94 0.915 7.73 8.05 Intr - 192180 192073 108 0 0 111 98 102 0.990 13.06 8.04 Intr - 195568 195410 159 1 0 136 78 148 0.999 17.86 8.03 Intr - 199979 199813 167 0 2 80 107 124 0.999 12.26 8.02 Intr - 206663 206513 151 2 1 54 116 126 0.996 10.81 8.01 Init - 207174 207049 126 0 0 78 94 120 0.876 11.83 8.00 Prom - 220045 220006 40 -2.35 9.03 PlyA - 220526 220521 6 1.05 9.02 Term - 225802 225630 173 2 2 92 53 106 0.433 4.51 9.01 Intr - 227239 227111 129 2 0 114 70 16 0.440 2.15 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 90577 90386 192 1 0 74 48 154 0.884 4.89 S.002 Intr + 128185 128525 341 2 2 27 82 295 0.884 17.07 S.003 Init + 152526 152531 6 2 0 87 94 0 0.847 1.53 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585f:49360106_49591833|GENSCAN_predicted_peptide_1|140_aa MLKNASEPLNRTDQVEERISELADRLFENMQLEETKEKRIKNNEECLRDLGNSLERANLR LTDLKEEKLLEQNPTSTGNQSKNGQMGPHQIEKLLHSEGYNQQSEEKTYRTQAFYEKVLN ITDYQRNANQNYNEISSHDS >gi568815585f:49360106_49591833|GENSCAN_predicted_CDS_1|423_bp atgctgaagaatgcatcagagcctcttaacagaactgatcaagtagaagaaagaattagt gagcttgcagacaggctatttgaaaacatgcagttagaggagacaaaagaaaaaagaata aaaaacaatgaagaatgcctacgagatctaggaaatagtctagaaagggcaaatctaaga cttactgaccttaaagaggagaaacttcttgagcaaaaccccacaagcacaggcaatcaa agcaaaaatggacaaatgggaccacatcaaattgaaaagcttctgcacagtgaagggtac aatcaacaaagtgaagagaaaacctacagaacacaggccttctatgaaaaggtgctcaac atcactgattatcagagaaatgcaaatcaaaactacaacgagatatcctctcacgacagt taa >gi568815585f:49360106_49591833|GENSCAN_predicted_peptide_2|143_aa MESTPAEDEMTTKDLEYYINLADKAVAGFERTDFNFESSSTASEEVSKSLQAMKEILCGT NEKEPPTEAVAQLAQELYSSGLLVTLIADLQLIDFETYSSDTEPPPPCLLVHGLHWDSHF CVTLSTRTVFATPSGMLICELFE >gi568815585f:49360106_49591833|GENSCAN_predicted_CDS_2|432_bp atggaatctactcctgctgaagatgaaatgacaacaaaggatttagaatattacataaat ttagctgataaagcagtggcaggttttgagaggactgacttcaattttgaaagcagttct actgcttcagaagaagtgtctaaatcactgcaagcaatgaaagaaattctgtgtggtaca aacgagaaagaacccccaacagaagcagtggctcagctagcacaagaactctacagcagt ggcctgctagtgacactgatagctgacctgcagctgatagactttgagacttactccagc gacacagaacccccacccccatgtctcttggttcatggcttgcactgggacagccacttc tgtgttaccttgagtacgaggactgtgtttgccacacctagtggcatgttaatttgtgaa ttgtttgaatga >gi568815585f:49360106_49591833|GENSCAN_predicted_peptide_3|134_aa MNVNLAEESKGKETVGDEIKGGGIQMVMLASQPLTSYCAARFLTGHGQKLKSELKKTLQV IMLNIQKVRIPTRDGYNEKDILSTGKNVEKLELSNIAGGQLRTGREPSPSGGNEATLSTP SSSVGGCLLKHRFK >gi568815585f:49360106_49591833|GENSCAN_predicted_CDS_3|405_bp atgaatgtgaatttggcagaagaatcaaagggcaaagaaacagtaggtgatgaaattaaa ggaggcggaattcagatggtaatgcttgccagccagccactcacctcctactgtgctgcc cggttcctaacaggccatggacagaaattaaaatctgagttaaaaaaaaccctgcaagtc attatgctaaatattcagaaagtgagaatacccactagggatggctataatgaaaaagac atactaagcactggcaaaaatgtagagaaactggaactctcaaacattgctggtggtcaa ctgagaacagggagagaaccaagcccctctggtggtaatgaggcaaccctgtctaccccc agcagtagtgttggagggtgcctgctaaaacacagatttaagtaa >gi568815585f:49360106_49591833|GENSCAN_predicted_peptide_4|121_aa MTSKWNVVSWMGSWNGKRHQKTLKTPPKALGTDNFRYPSNIEKKEYQEQSVLSCCSERKD ANPKSVVCSFFMQEQCTKGEKHLFNLKFCGKDPVICFTCRPGDPDDTNDRASMIYGIVLH E >gi568815585f:49360106_49591833|GENSCAN_predicted_CDS_4|366_bp atgacaagtaaatggaatgtggtgtcctggatgggatcttggaatggaaaaagacatcag aaaaccctaaagactccaccaaaggctcttggaactgataacttcagatatccatcaaat attgagaagaaagaatatcaggagcaaagtgttctaagttgctgttcagaacgtaaagat gcgaaccccaaatcagtggtttgttcattcttcatgcaagagcaatgcactaaaggagag aagcatttatttaatctgaaattctgcggaaaggacccagtaatctgtttcacctgccgt ccaggtgatcctgatgatactaatgatagagcatcaatgatctatggtatagtccttcat gagtag >gi568815585f:49360106_49591833|GENSCAN_predicted_peptide_5|84_aa MAVQMIVITYVVSLKVYLDEIGGEDHNSDAKTFWMELEDDGKVDFIFEQVQNVLQSLKQK IKDGSATNKGASQKEVNAQSSGEI >gi568815585f:49360106_49591833|GENSCAN_predicted_CDS_5|255_bp atggctgtacaaatgattgtaataacttatgttgtttcattaaaggtgtacttagatgaa attggtggtgaagatcacaatagcgatgcaaaaactttctggatggagctagaagatgat ggaaaagtggacttcatttttgaacaagtacaaaatgtgctgcagtcactgaaacaaaag atcaaagatgggtctgccaccaataaaggagcatcacagaaagaagtgaatgcccaaagc agtggtgagatttga >gi568815585f:49360106_49591833|GENSCAN_predicted_peptide_6|730_aa MPLNLKGENPLQLPIKCHFQRRHAKTNSHSSALHVSYKTPCGRSLRNVEEVFRYLLETEC NFLFTDNFSFNTYVQLARNYPKQKEVVSDVDISNGVESVPISFCNEIDSRKLPQFKYRKT VWPRAYNLTNFSSMFTDSCDCSEGCIDITKCACLQLTARNAKTSPLSSDKITTGYKYKRL QRQIPTGIYECSLLCKCNRQLCQNRVVQHGPQVRLQVFKTEQKGWGVRCLDDIDRGTFVC IYSGRLLSRANTEKSYGIDENGRDENTMKNIFSKKRKLEVACSDCEVEVLPLGLETHPRT AKTEKCPPKFSNNPKELTVETKYDNISRIQYHSVIRDPESKTAIFQHNGKKMDSSSNHVD EFEDNLLIESDVIDITKYREETPPRSRCNQATTLDNQNIKKAIEVQIQKPQEGRSTACQR QQVFCDEELLSETKNTSSDSLTKFNKGNVFLLDATKEGNVGRFLNYFNTCWRSNIWIDKT LSHQPFEWGEGEALLDSKRQPRFKKTKPSAAGALPGSRYPAATRSCSTVMAQASPPRPER VLGASSPEARPAQEALLLPTGILLIGVFQVAEKMEKRTCALCPKDVEYNVLYFAQSENIA AHENCLLYSSGLVECEDQDPLNPDRSFDVESVKKEIQRGRKLKDKTQLLTLAYATVKVPF LKKCKEAGLLNYLLEEILDKVHSIPEKLMDETTSESEVSNRLATKRMCHSEIDSVTYAPL PPPCIPVAKI >gi568815585f:49360106_49591833|GENSCAN_predicted_CDS_6|2193_bp atgccactgaacttgaagggagaaaaccctctgcagctgccaatcaaatgtcacttccaa agacgacatgcaaagacaaactctcattcttcagcactccacgtgagttataaaacccct tgtggaaggagtctacgaaacgtggaggaagtttttcgttacctgcttgagacagagtgt aactttttatttacagataacttttctttcaatacctatgttcagttggctcggaattac ccaaagcaaaaagaagttgtttctgatgtggatattagcaatggagtggaatcagtgccc atttctttctgtaatgaaattgacagtagaaagctcccacagtttaagtacagaaagact gtgtggcctcgagcatataatctaaccaacttttccagcatgtttactgattcctgtgac tgctctgagggctgcatagacataacaaaatgtgcatgtcttcaactgacagcaaggaat gccaaaacttcccccttgtcaagtgacaaaataaccactggatataaatataaaagacta cagagacagattcctactggcatttatgaatgcagccttttgtgcaaatgtaatcgacaa ttgtgtcaaaaccgagttgtccaacatggtcctcaagtgaggttacaggtgttcaaaact gagcagaagggatggggtgtacgctgtctagatgacattgacagagggacatttgtttgc atttattcaggaagattactaagcagagctaacactgaaaaatcttatggtattgatgaa aacgggagagatgagaatactatgaaaaatatattttcaaaaaagaggaaattagaagtt gcatgttcagattgtgaagttgaagttctcccattaggattggaaacacatcctagaact gctaaaactgagaaatgtccaccaaagttcagtaataatcccaaggagcttactgtggaa acgaaatatgataatatttcaagaattcaatatcattcagttattagagatcctgaatcc aagacagccatttttcaacacaatgggaaaaaaatggactcaagttcaaaccatgttgat gagtttgaagataatctgctgattgaatcagatgtgatagatataactaaatatagagaa gaaactccaccaaggagcagatgtaaccaggcgaccacattggataatcagaatattaaa aaggcaattgaggttcaaattcagaaaccccaagagggacgatctacagcatgtcaaaga cagcaggtattttgtgatgaagagttgctaagtgaaaccaagaatacttcatctgattct ctaacaaagttcaataaagggaatgtgtttttattggatgccacaaaagaaggaaatgtc ggccgcttccttaattattttaacacttgttggagaagcaatatctggatcgataaaaca ctgtcccatcaaccatttgagtggggagagggagaagctcttcttgactcaaagcgacag cccagatttaagaaaacgaaacctagtgcagctggggcacttccgggatctcgctatccg gccgccacccgcagctgcagcacagtcatggcccaggcgtcgccgccccggcccgagagg gtgctcggcgccagcagcccggaggcccggcccgcgcaggaggcgctcctccttcccacc gggatattacttataggtgtctttcaggttgcagaaaagatggaaaaaaggacatgtgca ctctgccccaaagatgtcgaatataatgtcctatactttgcacaatcagagaatatagct gctcatgagaattgtttgctgtattcttcaggacttgtggaatgtgaggatcaggatcca cttaatcctgatagaagttttgatgtggaatcagtaaagaaagaaatccagagaggaagg aagttgaaagataaaacccaactccttactctggcatatgcaactgtgaaagttcctttt cttaagaaatgcaaggaagcaggacttcttaattacttacttgaagaaatattagacaaa gttcattcaattccagaaaaactcatggatgagactacttcagaatcagaggtgtctaac aggttggccactaagagaatgtgccattcagagattgattcggtcacatatgctcccctg ccaccgccctgcattcctgttgctaagatctga >gi568815585f:49360106_49591833|GENSCAN_predicted_peptide_7|96_aa MHLKQQSPTFLAPGTGFVEDNFSTDHGDGAVGQGITSPVEHKLDTSSTVPQSTHTEPSSL ALQFLKAPHLLALAMNSFSRGPSICQNAAVCVTSVK >gi568815585f:49360106_49591833|GENSCAN_predicted_CDS_7|291_bp atgcacctaaagcagcagtccccaacctttttggcaccagggactggtttcgtggaagac aatttttccacggaccatggggatggggcggttggtcagggaatcacatcacccgtagag cacaaactggacacatcctcaacagtgccccagagcactcacacagaacccagcagcctt gcgcttcagttcttaaaggctccacatttactggctttagcaatgaattcctttagcaga gggccatccatttgccaaaatgctgcagtctgtgtaacttctgtcaaatga >gi568815585f:49360106_49591833|GENSCAN_predicted_peptide_8|538_aa MVDVGKWPIFTLLSPQEIASIRKACVFGTSASEALYVTDNDEVFVFGLNYSNCLGTGDNQ STLVPKKLEGLCGKKIKSLSYGSGPHVLLSTEDGVVYAWGHNGYSQLGNGTTNQGIAPVQ VCTNLLIKQVVEVACGSHHSMALAADGEVFAWGYNNCGQVGSGSTANQPTPRKVTNCLHI KRVVGIACGQTSSMAVLDNGEVYGWGYNGNGQLGLGNNGNQLTPVRVAALHSVCVNQIVC GYAHTLALTDEGLLYAWGANTYGQLGTGNKNNLLSPAHIMVEKERPYACTGPWPVINWVA RQEMGPSSCRKTSSGLPLILHYEHEDFLTVAESLKKEFDSPETADLKFRIDGKYIHVHKA VLKIRCEHFRSMFQSYWNEDMKEVIEIDQFSYPVYRAFLQYLYTDTVDLPPEDAIGLLDL ATSYCENRLKKLCQHIIKRGITVENAFSLFSAAVRYDAECLAHIRHLINVELDEDSTLKE ALRMISELKAGMVGFTDSERIKVTLTTGWTELGVVRDLVSSGMCLVGSCDAEAPAHPG >gi568815585f:49360106_49591833|GENSCAN_predicted_CDS_8|1617_bp atggtggatgtcggaaagtggcccatcttcactctactctcccctcaagagatcgcgtct attcggaaggcgtgtgtcttcggcacctcagccagtgaagcactgtacgttactgacaat gatgaggtctttgtatttggactgaactatagtaactgtctaggaactggagataaccag agtacacttgtacccaaaaagctagaaggcttatgtggaaagaagattaaaagcctcagt tacgggagtggaccacatgttcttctcagcaccgaagatggagtggtttatgcctggggc cacaatggatatagccagcttgggaatgggacgaccaaccaaggcattgctcccgtccag gtctgtaccaatctcttgatcaagcaagtggtggaagtagcttgtggctcacatcattca atggctctggcagctgatggagaggtgtttgcttggggttataacaactgtggccaagtg ggatcaggttctacagcaaatcaaccaactcctcgaaaagttacaaactgtttacatatt aagagggtagttggcattgcctgtggtcagacttcatccatggctgttctggacaatggc gaggtatatggctggggttacaatggcaacggtcagctgggcctgggaaacaatggcaac cagctgacccctgtgagagtggcagctttgcacagcgtgtgtgtgaaccagattgtctgc ggttacgcacatactctagcactaacagatgagggcttgctgtatgcctggggagctaac acatatgggcagctgggaactggcaataaaaataacctgctaagcccagcacacatcatg gtggagaaagaaaggccatatgcctgtactggtccgtggcctgttattaactgggttgca cggcaggagatggggccgtccagttgtaggaaaacaagctcaggactcccactgattcta cattatgagcatgaagactttttaacagttgcagagtcactgaagaaagaatttgatagt ccagaaactgctgatctgaagtttcgaattgatggaaaatatattcatgtccataaagct gttttgaaaatcaggtgtgagcattttcgatccatgttccagtcgtattggaatgaagac atgaaggaagtgatagaaatcgatcagttttcttacccagtgtatcgtgcctttctccag tacctctacacagacacagtcgacctgccgccagaagatgctataggtcttctggatttg gcgacatcttactgtgaaaacagactgaaaaaactttgtcagcacattatcaagagagga attactgtggagaatgccttttcgctattctctgctgcagtcagatatgatgcagagtgc ctggcacatattaggcacctgataaatgttgaactggatgaagatagtaccttaaaggaa gcattaaggatgatcagtgagctgaaggcgggtatggttggtttcacagattcagagaga ataaaagttactttaacaacaggttggacagagttaggtgtggttcgtgatcttgtgtca tctggaatgtgtcttgtggggagctgtgatgccgaggcacctgctcaccctggttag >gi568815585f:49360106_49591833|GENSCAN_predicted_peptide_9|100_aa XMFTQVLLIGLQPKETDFSFFEQNRPLLVNSLNCHYFGQATSFSLWGGRPTLPTAKPSRG AEQEPVPRAFLVPQTQKPLSPGSSHRISWGHSSGRASGRA >gi568815585f:49360106_49591833|GENSCAN_predicted_CDS_9|303_bp nacatgtttacacaagttcttttgatcgggttgcaaccaaaagaaactgattttagtttc tttgagcaaaacaggcctcttctagtaaattcactcaactgccactactttgggcaagcc acttcattcagtctctggggtggtaggccgaccctccccacagccaagccatctcgggga gcagagcaggagcccgtgcctcgcgcgttcctggttcctcagacacaaaagcctctaagt cccggcagcagccaccggatttcatggggacactccagtggcagggcctcggggcgggcc tga