GENSCAN 1.0 Date run: 5-Nov-116 Time: 08:46:03 Sequence gi568815596f:181357655_181635524 : 277870 bp : 36.44% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 17074 17261 188 2 2 30 47 161 0.169 2.87 1.02 PlyA + 19524 19529 6 1.05 2.05 PlyA - 19949 19944 6 1.05 2.04 Term - 21183 21062 122 1 2 78 28 87 0.052 -0.64 2.03 Intr - 53933 53795 139 2 1 40 82 183 0.746 12.02 2.02 Intr - 60267 60178 90 0 0 66 86 53 0.684 2.17 2.01 Init - 64734 64549 186 0 0 94 52 106 0.857 6.70 2.00 Prom - 91566 91527 40 -3.65 3.02 PlyA - 91850 91845 6 -0.45 3.01 Sngl - 93838 91898 1941 1 0 70 47 564 0.922 44.14 3.00 Prom - 97115 97076 40 -4.15 4.00 Prom + 98735 98774 40 -5.95 4.01 Init + 100001 100197 197 1 2 55 101 269 0.813 21.35 4.02 Intr + 100589 100663 75 2 0 39 92 118 0.434 5.01 4.03 Intr + 109889 109954 66 2 0 103 108 37 0.773 4.30 4.04 Intr + 114347 114399 53 2 2 56 83 50 0.472 -1.07 4.05 Intr + 117306 117412 107 1 2 66 66 84 0.542 3.01 4.06 Intr + 117505 117634 130 0 1 56 116 41 0.816 3.05 4.07 Intr + 123944 124029 86 0 2 102 116 146 0.999 17.42 4.08 Intr + 124706 124768 63 1 0 94 85 33 0.747 1.60 4.09 Intr + 124860 124997 138 2 0 39 88 169 0.945 11.74 4.10 Intr + 128227 128338 112 0 1 118 86 135 0.999 15.33 4.11 Intr + 135671 135765 95 0 2 14 110 92 0.953 2.76 4.12 Intr + 137068 137158 91 0 1 94 95 121 0.997 12.05 4.13 Intr + 137717 137762 46 0 1 116 116 77 0.999 9.85 4.14 Intr + 138129 138283 155 0 2 100 60 82 0.902 5.39 4.15 Intr + 140969 141123 155 0 2 67 62 77 0.641 1.87 4.16 Intr + 143857 143949 93 0 0 18 95 95 0.583 2.44 4.17 Intr + 152004 152153 150 2 0 71 11 172 0.603 7.24 4.18 Intr + 154045 154121 77 0 2 83 100 29 0.231 0.99 4.19 Intr + 164537 164687 151 2 1 36 119 127 0.253 9.84 4.20 Intr + 165783 165878 96 2 0 77 110 19 0.796 2.29 4.21 Intr + 166517 166596 80 1 2 100 35 54 0.798 -1.27 4.22 Intr + 167548 167637 90 1 0 47 106 84 0.761 4.29 4.23 Intr + 169643 169733 91 2 1 91 66 78 0.989 4.98 4.24 Intr + 171887 171994 108 1 0 88 98 69 0.992 7.46 4.25 Intr + 172870 172995 126 0 0 92 111 100 0.998 12.66 4.26 Intr + 174003 174122 120 2 0 50 115 47 0.814 3.37 4.27 Intr + 176618 176716 99 1 0 77 66 79 0.694 3.99 4.28 Term + 177778 177873 96 0 0 78 49 41 0.522 -3.81 4.29 PlyA + 177950 177955 6 -0.45 5.07 PlyA - 178134 178129 6 1.05 5.06 Term - 179982 179976 7 2 1 108 49 0 0.202 -5.54 5.05 Intr - 181610 181438 173 2 2 60 97 144 0.498 10.32 5.04 Intr - 187142 187046 97 1 1 37 80 57 0.480 -1.11 5.03 Intr - 190950 190891 60 2 0 86 100 42 0.626 2.13 5.02 Intr - 191203 191026 178 2 1 47 97 182 0.730 13.06 5.01 Init - 194230 194191 40 1 1 94 69 26 0.566 1.70 5.00 Prom - 199074 199035 40 -4.75 6.04 PlyA - 200193 200188 6 1.05 6.03 Term - 201054 200880 175 2 1 80 46 142 0.645 5.45 6.02 Intr - 219493 219343 151 2 1 -2 91 110 0.006 0.60 6.01 Intr - 246425 246183 243 0 0 84 111 77 0.062 6.15 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 239734 239937 204 0 0 79 46 158 0.863 7.09 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:181357655_181635524|GENSCAN_predicted_peptide_1|62_aa XAVCEKQGSGQPRETINPTFTGKADNEQSVACYFTADTSSHKKEKKGEDDINCESFTDVC SS >gi568815596f:181357655_181635524|GENSCAN_predicted_CDS_1|189_bp ntggctgtttgtgaaaagcagggcagtggtcaaccaagggagacgataaacccaaccttt accggcaaagctgataatgaacagtcagttgcatgctatttcacagctgatacctcttcc cacaaaaaggaaaaaaagggggaagatgatattaattgtgaaagtttcacagatgtctgc agcagttga >gi568815596f:181357655_181635524|GENSCAN_predicted_peptide_2|178_aa MGLGIYLDQYTRQKGQDPVAELKQLIPLVVSLSAPNLEMPLLKKKTTNPSTFLKSLSGGL NLKTECSRVVDAIHVSQTPGSWSQMKKEGKQKAQGDLNSVPETLAEVIGDPAGNPPPAEE GWVRVRPEEALWLQTATAGWQHELSWAPWSSAQPSAEAVNSAITTAESDHSPDEITLI >gi568815596f:181357655_181635524|GENSCAN_predicted_CDS_2|537_bp atggggcttggaatctatttagatcaatacacaaggcagaaaggacaagacccagttgct gaactgaaacaactgattccattggtagtctctctatcagcaccaaacctggaaatgcca ctcctcaagaaaaagaccaccaatccaagcacctttctaaaatcacttagtggaggcctg aatcttaagacagaatgcagtagagtcgtcgacgccatccatgttagtcaaacccctggg tcatggagccagatgaagaaggagggcaaacagaaggcccagggagacctgaattctgtc cctgagactctggctgaagttattggagatcctgcagggaatcctcccccagctgaagaa ggatgggtcagggttagacctgaagaggcactctggttgcagactgccacagccggatgg cagcatgaactcagctgggccccgtggtcctcagcccagccctcggcagaagcagtgaat tcagccataactacagcagagtctgatcattctcctgatgaaattacactgatctaa >gi568815596f:181357655_181635524|GENSCAN_predicted_peptide_3|646_aa MDKFLDTYTLPMLNQEEVESLNRSITGSEIEAIINSLPTKKSPGPDGFTAEFYQRYKEEL VPFLLKPFQSIEKEGILPNSFYEASIILIPKAERDTTKKENFRPISLMNINAKILSKILE NRIQQHIKKLIHHDQVDFIPGMQGWFNIRKSINVIQHINTTKDKNHMIISIDAEKVFDKI QQPFMLKTLNKLGIDGTYLKIIRAIYDKPTANIILNGQKLEAFPLKTSTRQGCPLSPLLF NIVLKVLARAIRQKKEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQYLLKLISNFSKV SGYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPL LNKIKEDTNKWKNIPCSWIGRTDTVKMAILPKVIYRSNAMLIKLPRTFFTELEKTTLKFI WNQKRACIAKSILSQKNKAGGIMLPHFKLYYKVTVTKTARYWYQNRDIDQWNRTEPSEII PHIYNHLIFGKPDKNKKWGKDFLFNKWCWENWLAICRKLKLDPFLTPYTKINSRCIKDLH VRPKTIKTLEENLGNTIQDIGMGKDFISKTPKAMATKAKIDKWDLIKLKSFCTAKETTIR VNRQPTEWEKIFAIYSSDKGLISRIYKELKQIYKKKTTPSTSGQRI >gi568815596f:181357655_181635524|GENSCAN_predicted_CDS_3|1941_bp atggataaattcctggacacatacaccctcccaatgctaaaccaggaagaagttgaatcc ctgaatagatcaataacaggctctgaaattgaggcaataattaatagcctaccaaccaaa aaaagtccaggaccagacggattcacagccgaattctaccagaggtacaaggaggagctg gtaccattccttctgaaaccattccaatcaatagaaaaagagggaatcctccctaactca ttttatgaggccagcatcatcctgataccaaaggctgagagagacacaacaaaaaaagag aattttagaccaatatccctgatgaacatcaatgcaaaaatcctcagtaaaatactggaa aaccgaatccagcagcacatcaaaaagcttatccaccatgatcaagtggacttcatccct gggatgcaaggctggttcaacatacgcaaatcaataaatgtaatccagcatataaacaca accaaagacaaaaaccacatgattatctcaatagatgcagaaaaggtctttgacaaaatt cagcagcccttcatgctaaaaactctcaataaattaggtattgatgggacgtatctcaaa ataataagagctatttatgacaaacccacagccaatatcatcctgaatgggcaaaaattg gaagcattccctttgaaaactagcacaagacagggatgccctctctcaccactcctattc aacatagtgttgaaagttctggccagggcaatcaggcagaagaaagaaataaagggtatt caattaggaaaagaggaagtcaaattgtccctgtttgcagacgacatgattgtatatcta gaaaaccccattgtctcagcccaatatctccttaagctgataagcaacttcagcaaagtc tcaggatacaaaatcaatgtgcaaaaatcacaagcattcttatacaccaataacagacaa acagagagccaaatcatgagtgaactcccattcacaattgcttcaaagagaataaaatac ctaggaatccaacttacaagggatgtgaaggacctcttcaaggagaactacaaaccactg ctcaacaaaataaaagaggacacaaacaaatggaagaacattccatgctcatggatagga agaaccgataccgtgaaaatggccatactgcccaaggtaatttatagatccaatgccatg ctcatcaagctaccaaggactttcttcacagaattggaaaaaactactttaaagttcata tggaaccaaaaaagagcctgcattgccaagtcaatcctaagccaaaagaacaaagccgga ggcatcatgctacctcacttcaaactatactacaaggttacagtaaccaaaacagcacgg tactggtaccaaaacagagatatagaccaatggaacagaacagagccctcagaaataata ccacacatctacaaccatctgatctttggcaaacctgacaaaaacaagaaatggggaaag gatttcctatttaataaatggtgctgggaaaactggctggccatatgtagaaagctaaaa ctggatcccttccttacaccttatacaaaaattaattcaagatgcattaaagacttacat gttagacctaaaaccataaaaaccctagaagaaaacctaggcaataccattcaggacata ggcatgggcaaggacttcatttctaaaacaccaaaagcaatggcaacaaaagccaaaatt gataaatgggatctaattaaactaaagagcttctgcacagcaaaagaaactaccatcaga gtgaacaggcaacctacagaatgggagaaaatttttgcaatctactcatctgacaaaggg ctaatatccagaatctacaaagaactcaaacaaatttacaagaaaaaaacaaccccatca acaagtgggcaaaggatatga >gi568815596f:181357655_181635524|GENSCAN_predicted_peptide_4|981_aa MAWEARREPGPRRAAVRETVMLLLCLGVPTGRPYNVDTESALLYQGPHNTLFGYSVVLHS HGANRCDQSRGDLQMQDRKESRPDVRTAPAGGNAKILRTILNGGKEQGKIGNKELRGTHV ERKKFQGSDVSSPNGEPCGKTCLEERDNQWLGVTLSRQPGENGSIVTCGHRWKNIFYIKN ENKLPTGGCYGVPPDLRTELSKRIAPCYQGYSVGAGHFRSQHTTEVVGGAPQHEQIGKAY IFSIDEKELNILHEMKGKKLGSYFGASVCAVDLNADGFSDLLVGAPMQSTIREEGRVFVY INSGSGAVMNAMETNLVGSDKYAARFGESIVNLGDIDNDGFEDVAIGAPQEDDLQGAIYI YNGRADGISSTFSQRIEGLQISKSLSMFGQSISGQIDADNNGYVDVAVGAFRSDSAVLLR TRPVVIVDASLSHPESVNRTKFDCVENGWPSVCIDLTLCFSYKGKEVPGYIVLFYNMSLD VNRKAESPPRFYFSSNGTSDVITGSIQVSSREANCRTHQAFMRSRRSRFDTWRRGYCGNT GERMVPTKVVAVDMKDVRDILTPIQIEAAYHLGPHVISKRSTEEFPPLQPILQQKKEKDI MKKTINFARFCAHENCSADLQVSAKIGFLKPHENKTYLAVGSMKTLMLNVSLFNAGDDAY ETTLHVKLPVGLYFIKILELEEKQINCEVTDNSGVVQLDCSIGYIYVDHLSRIDISFLLD VSSLSRAEEDLSITVHATCENEEEMDNLKHSRVTVAIPLKYEVKLTVHGFVNPTSFVYGS NDENEPETCMVEKMNLTFHVINTGNSMAPNVSVEIMVPNSFSPQTDKLFNILDVQTTTGE CHFENYQRVCALEQQKSAMQTLKGIVRFLSKTDKRLLYCIKADPHCLNFLCNFGKMESGK EASVHIQLEGRPSILEMDETSALKFEIRATGFPEPNPRVIELNKDENVAHAGFFKRQYKS ILQEENRRDSWSYINSKSNDD >gi568815596f:181357655_181635524|GENSCAN_predicted_CDS_4|2946_bp atggcttgggaagcgaggcgcgaacccggcccccgaagggccgccgtccgggagacggtg atgctgttgctgtgcctgggggtcccgaccggccgcccctacaacgtggacactgagagc gcgctgctttaccagggcccccacaacacgctgttcggctactcggtcgtgctgcacagc cacggggcgaaccgatgtgatcaatcccggggcgatttacagatgcaggatcggaaagaa tcccggccagacgtgcgaacagctccagctggtggaaatgccaaaatactgagaaccatt ctaaatggaggaaaagagcagggcaagatagggaacaaagagctgagaggcactcatgtg gagagaaagaagtttcagggaagtgatgttagtagccctaatggagaaccttgtggaaag acttgtttggaagagagagacaatcagtggttgggggtcacactttccagacagccagga gaaaatggatccatcgtgacttgtgggcatagatggaaaaatatattttacataaagaat gaaaataagctccccactggtggttgctatggagtgccccctgatttacgaacagaactg agtaaaagaatagctccgtgttatcaaggatattcagtcggagctggtcattttcggagc cagcatactaccgaagtagtcggaggagctcctcaacatgagcagattggtaaggcatat atattcagcattgatgaaaaagaactaaatatcttacatgaaatgaaaggtaaaaagctt ggatcgtactttggagcttctgtctgtgctgtggacctcaatgcagatggcttctcagat ctgctcgtgggagcacccatgcagagcaccatcagagaggaaggaagagtgtttgtgtac atcaactctggctcgggagcagtaatgaatgcaatggaaacaaacctcgttggaagtgac aaatatgctgcaagatttggggaatctatagttaatcttggcgacattgacaatgatggc tttgaagatgttgctatcggagctccacaagaagatgacttgcaaggtgctatttatatt tacaatggccgtgcagatgggatctcgtcaaccttctcacagagaattgaaggacttcag atcagcaaatcgttaagtatgtttggacagtctatatcaggacaaattgatgcagataat aatggctatgtagatgtagcagttggtgcttttcggtctgattctgctgtcttgctaagg acaagacctgtagtaattgttgacgcttctttaagccaccctgagtcagtaaatagaacg aaatttgactgtgttgaaaatggatggccttctgtgtgcatagatctaacactttgtttc tcatataagggcaaggaagttccaggttacattgttttgttttataacatgagtttggat gtgaacagaaaggcagagtctccaccaagattctatttctcttctaatggaacttctgac gtgattacaggaagcatacaggtgtccagcagagaagctaactgtagaacacatcaagca tttatgcggagtaggagatcaagattcgacacatggagaagaggctactgtggcaacaca ggtgagaggatggttccaacaaaagtggtggcagtggatatgaaagatgtgcgggacatc ctcaccccaattcagattgaagctgcttaccaccttggtcctcatgtcatcagtaaacga agtacagaggaattcccaccacttcagccaattcttcagcagaagaaagaaaaagacata atgaaaaaaacaataaactttgcaaggttttgtgcccatgaaaattgttctgctgattta caggtttctgcaaagattgggtttttgaagccccatgaaaataaaacatatcttgctgtt gggagtatgaagacattgatgttgaatgtgtccttgtttaatgctggagatgatgcatat gaaacgactctacatgtcaaactacccgtgggtctttatttcattaagattttagagctg gaagagaagcaaataaactgtgaagtcacagataactctggcgtggtacaacttgactgc agtattggctatatatatgtagatcatctctcaaggatagatattagctttctcctggat gtgagctcactcagcagagcggaagaggacctcagtatcacagtgcatgctacctgtgaa aatgaagaggaaatggacaatctaaagcacagcagagtgactgtagcaatacctttaaaa tatgaggttaagctgactgttcatgggtttgtaaacccaacttcatttgtgtatggatca aatgatgaaaatgagcctgaaacgtgcatggtggagaaaatgaacttaactttccatgtt atcaacactggcaatagtatggctcccaatgttagtgtggaaataatggtaccaaattct tttagcccccaaactgataagctgttcaacattttggatgtccagactactactggagaa tgccactttgaaaattatcaaagagtgtgtgcattagagcagcaaaagagtgcaatgcag accttgaaaggcatagtccggttcttgtccaagactgataagaggctattgtactgcata aaagctgatccacattgtttaaatttcttgtgtaattttgggaaaatggaaagtggaaaa gaagccagtgttcatatccaactggaaggccggccatccattttagaaatggatgagact tcagcactcaagtttgaaataagagcaacaggttttccagagccaaatccaagagtaatt gaactaaacaaggatgagaatgttgcgcatgctggcttctttaaaagacaatacaaatct atcctacaagaagaaaacagaagagacagttggagttatatcaacagtaaaagcaatgat gattaa >gi568815596f:181357655_181635524|GENSCAN_predicted_peptide_5|184_aa MMPQLENSTPHVTGHVQLVDVCTFSTAGKLLRFGFSAMFGFGGRTLALAEKYRWMSPNQR RDFAVVKALAKLKAEDCEISFLPFNSSDDVQERLNNGSMALIIARNTSRPEFIKHLKRYA SVKNQFNFPFVETYTVEEVKVHPRNNTGGYNPEEEEDETASENCFPWNVDGDLMEVASEV HIRI >gi568815596f:181357655_181635524|GENSCAN_predicted_CDS_5|555_bp atgatgccacaactggaaaattccacacctcatgtgacagggcatgtacagctggtcgac gtctgcaccttcagcaccgctggcaagcttcttcgctttgggttctcagccatgtttggc tttggtggaagaactttggctctggcagaaaaatatcgatggatgtcccctaaccaacgg agagattttgctgttgttaaggcactggcaaaacttaaggcagaagactgtgaaatatca tttttaccatttaacagctctgatgatgtgcaagaaagattaaataatggaagtatggct cttataattgcccgaaacacttctcggccagaatttataaaacacctgaaaagatatgcc agtgtaaaaaatcagttcaattttccatttgttgagacttacactgttgaggaagtaaaa gttcatccaaggaataatactggtggatataatccagaggaggaggaggatgaaactgct tcagaaaattgtttcccttggaatgtagatggtgacttaatggaagttgcatcagaggtc catattaggatttga >gi568815596f:181357655_181635524|GENSCAN_predicted_peptide_6|189_aa XDSKYDLLCKEEFIELKDIFSVKLKRRCSVKQQRSGTLLGITLFICLKKEQNKLKNSTLD LINLSEDHCDIWFRQFKKILAECWFSKCGPFADIVSTWEPVINMNLQTTPQTTKPKILGA GAQNLCFKKHVSVVCVGGDGSASEVAHALLLRAQKNAGMETDRILTPVRAQLPLGLIPAG KGVATDSLN >gi568815596f:181357655_181635524|GENSCAN_predicted_CDS_6|570_bp ngtgattctaagtatgacttgctatgtaaagaagaatttattgaactcaaagacatattc tctgtgaaactgaaacggcgttgttctgttaaacagcagagaagtggtactttattaggt atcacactcttcatctgcttgaaaaaggaacaaaataaactaaagaattctacacttgat cttattaatttaagtgaagaccactgtgacatatggtttagacagttcaagaaaatattg gcagaatgttggttctcaaaatgtggcccctttgcagacatcgtcagcacctgggaacct gttataaatatgaatctgcagaccacaccccagactactaagcccaaaattctgggggct ggagcccagaatctatgttttaagaagcacgtcagtgttgtctgtgttggtggagatgga tctgctagcgaagtagcccatgctttgcttctgagagctcagaagaatgctgggatggaa acagaccgaatcctgactcctgtcagagcacagcttccacttggcttaataccagcaggc aagggagtggctacagattcattaaactga