GENSCAN 1.0 Date run: 4-Nov-116 Time: 17:54:15 Sequence gi568815592r:137393355_137594170 : 200816 bp : 39.91% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 556 551 6 1.05 1.02 Term - 3645 3543 103 2 1 70 38 131 0.127 3.07 1.01 Init - 15490 15411 80 1 2 83 63 63 0.079 3.88 1.00 Prom - 15667 15628 40 -3.65 2.00 Prom + 17257 17296 40 -5.95 2.01 Init + 21334 21560 227 0 2 59 18 138 0.070 1.89 2.02 Intr + 21873 22056 184 0 1 37 41 105 0.039 -0.43 2.03 Intr + 34661 34883 223 1 1 45 -7 230 0.060 5.98 2.04 Intr + 38111 38147 37 0 1 84 80 56 0.056 0.80 2.05 Intr + 39139 39371 233 1 2 65 59 159 0.137 7.19 2.06 Intr + 40198 40345 148 2 1 24 82 109 0.344 2.27 2.07 Intr + 41909 41981 73 2 1 71 115 43 0.593 3.79 2.08 Intr + 46231 46333 103 0 1 105 61 -18 0.021 -3.97 2.09 Term + 72437 72606 170 0 2 98 40 158 0.145 9.06 2.10 PlyA + 75239 75244 6 1.05 3.00 Prom + 77276 77315 40 -2.45 3.01 Init + 80360 80448 89 1 2 81 65 74 0.565 4.56 3.02 Intr + 83261 83404 144 2 0 24 59 112 0.196 0.38 3.03 Term + 83650 83749 100 1 1 86 54 78 0.279 0.82 3.04 PlyA + 83752 83757 6 1.05 4.06 PlyA - 84484 84479 6 1.05 4.05 Term - 95042 94847 196 1 1 32 32 202 0.432 4.90 4.04 Intr - 95772 95556 217 1 1 102 50 186 0.850 12.84 4.03 Intr - 96251 96137 115 2 1 62 47 91 0.539 1.50 4.02 Intr - 96862 96784 79 0 1 0 62 68 0.270 -5.97 4.01 Init - 100678 100002 677 1 2 69 63 739 0.256 62.18 4.00 Prom - 100825 100786 40 -12.03 5.06 PlyA - 101667 101662 6 1.05 5.05 Term - 104269 103672 598 0 1 62 48 357 0.021 22.11 5.04 Intr - 104850 104761 90 2 0 48 110 62 0.023 2.59 5.03 Intr - 113454 113342 113 0 2 33 86 78 0.005 0.36 5.02 Intr - 125678 125590 89 0 2 99 111 5 0.412 2.67 5.01 Init - 137718 137643 76 0 1 83 89 61 0.590 7.00 5.00 Prom - 140856 140817 40 -4.45 6.06 PlyA - 141631 141626 6 1.05 6.05 Term - 150865 150543 323 2 2 2 38 403 0.480 20.70 6.04 Intr - 151103 150941 163 2 1 68 76 68 0.052 2.23 6.03 Intr - 159778 159650 129 1 0 106 43 55 0.472 2.77 6.02 Intr - 160177 159993 185 2 2 90 86 71 0.427 5.69 6.01 Init - 163702 163495 208 1 1 73 72 191 0.984 15.03 6.00 Prom - 169583 169544 40 -5.25 7.00 Prom + 170818 170857 40 -6.55 7.01 Init + 186182 186440 259 1 1 60 71 191 0.484 11.95 7.02 Intr + 188088 188272 185 1 2 -26 63 146 0.110 -0.71 7.03 Intr + 191816 191875 60 1 0 69 86 66 0.048 2.51 7.04 Term + 198064 198222 159 0 0 16 38 127 0.008 -2.54 7.05 PlyA + 198354 198359 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 11891 11889 3 2 0 113 81 0 0.855 1.85 S.002 Init - 25490 25353 138 2 0 87 107 99 0.918 11.89 S.003 Term + 34661 34888 228 1 0 45 43 234 0.807 10.25 S.004 Sngl - 104352 103672 681 0 0 68 48 415 0.854 29.43 S.005 Term + 175638 175808 171 2 0 74 47 112 0.891 2.54 S.006 Sngl + 195141 195476 336 2 0 68 42 164 0.808 5.60 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:137393355_137594170|GENSCAN_predicted_peptide_1|60_aa MDEAGNHHSEQTITRTENQTPHVHTHRQEEDTMWNKRESIKLYESGQAIKAELEPEFEIK >gi568815592r:137393355_137594170|GENSCAN_predicted_CDS_1|183_bp atggatgaagctggaaatcatcattctgagcaaactatcacaaggacagaaaaccaaaca ccgcatgttcacactcatagacaagaagaggacacaatgtggaacaaacgagagtccata aagttatacgagtctgggcaagccatcaaggctgaactggaaccagaatttgaaattaag tga >gi568815592r:137393355_137594170|GENSCAN_predicted_peptide_2|465_aa MSLKDKPVQDQKQSSTLLVNLHVHLTLQPIVWERPGESRVHGTLPTCWERCMALTGQCQN RSCCLYSCSVPMTPGRQQPWQHFTKSSTRKLATRLKFQFVCCYWSYIREFFSMPDWSSDL NLAQLKSKRTGNTNLAANSTRKSQCLGGAIGCVAVDRRPALIIKRRGLSINSSSTVWTVT SGPLPLPLCQVDEMVQENFRFADDRNSEASEILHGAANGAVHDRHHAHQAQQRGVGLVRG RNGAQELKLNPDMLDLKQRANSSLVPSVRRHPSTRLFIFTHSLIPAVYILSAYYAPGAVV GGIMLEGSWVLAGDKHLEMNIENSALELEGPRSYPVPSSTFSGNLLDDIPSDGELDCLHL SAITNNAKVNLEVIMAIHPWGSLSCLCAKISLGTGHMDQAQGKPASQMQDSLSINQLSAL FVPRSLSGVQKESVHVDKLKGGESRGFYCWIEVALSKMDGELESE >gi568815592r:137393355_137594170|GENSCAN_predicted_CDS_2|1398_bp atgtctcttaaagacaaacctgttcaagaccaaaagcagagcagcaccttgctggttaat ctgcatgttcatttgaccttgcagccaatcgtctgggagaggccaggagagagcagagtg catgggactctgcccacttgttgggaacgctgcatggccctcactgggcagtgtcaaaac cgctcctgctgtctctacagctgctctgttccaatgacccccggaaggcagcagccctgg cagcattttactaaatcctcaacaagaaaactcgcaactcgtctgaagtttcagtttgtt tgttgttactggagttatatcagggagttcttttctatgccagactggagttcagattta aatcttgcccaactcaaaagcaagcgcacaggaaacacgaacctcgccgccaattccaca aggaaatctcagtgccttggaggggcaataggctgtgtggctgtagacagaaggccagcg ctgataataaagaggcgaggtttatcaattaactcaagcagcactgtgtggacagtgacc agcggtccactcccacttccgttgtgtcaggttgatgaaatggtccaggaaaacttcaga tttgctgatgacagaaacagtgaagcatctgaaattctccatggagcagcaaatggtgct gttcatgacaggcatcatgctcatcaagctcagcagaggggagttggtctggtgagaggc agaaatggagcccaggaactgaagctgaaccctgacatgcttgacttaaagcagagagca aattcttccttggtgccatctgtccgccgtcatccatccactcgtttattcatattcact cattcactcattccggcagtatacatattgtctgcctactatgcaccaggggctgtggtg ggagggatcatgctagaaggttcttgggtcctggcaggtgataaacacttggaaatgaat attgaaaattcagcattggaattggagggccctagaagttatccagtgccatcctctact ttttctgggaatctcctggatgacattcctagtgatggagagctggattgcctccacctc tctgctatcacaaacaacgctaaagtgaacttagaggtcattatggcaatccatccctgg ggtagcttatcctgtctttgtgccaaaatttcactgggcacaggccatatggatcaggcc caaggaaagcctgcctcccaaatgcaggacagcttaagcattaaccagctcagtgccctc tttgtacccaggtccttgtccggcgtccagaaagaatcagttcacgtggacaaattgaag ggtggtgaatccaggggattttattgctggatagaggtggctctcagcaagatggatggg gagctggaaagcgaatag >gi568815592r:137393355_137594170|GENSCAN_predicted_peptide_3|110_aa MKFIQAVSGVNNLSPTAFVSAAAVPLIRLRTYDQDRMERRSVLSHSNGSPLKFSSTSSLP EQLVEQAWACPGGEMMVGTRQQGTILKAETRPSPDTKYWYLDLGLPSLQN >gi568815592r:137393355_137594170|GENSCAN_predicted_CDS_3|333_bp atgaagttcattcaggctgtcagtggagtcaataacttatccccaactgcttttgtttca gcagcggcagtgcctctgatcaggctgaggacttatgaccaggacagaatggaacgacgt tctgtgctctcacacagcaatggttctccactcaagttctcttccacatcttcccttcct gagcagttagtggaacaagcctgggcctgcccaggaggtgaaatgatggtaggaacacga caacaaggcaccatcctgaaagcagagaccaggccctcaccagacaccaaatactggtac cttgatcttggacttcccagcctccagaactga >gi568815592r:137393355_137594170|GENSCAN_predicted_peptide_4|427_aa MMQKMPGESLSRAGAKAAGESSKYKIKKQLSEQDLQQLRLKINGRERKRMHDLNLAMDGL REVMPYAHGPSVRKLSKIATLLLARNYILMLTSSLEEMKRLVGEIYGGHHSAFHCGTVGH SAGHPAHAANSVHPVHPILGGALSSGNASSPLSAASLPAIGTIRPPHSLLKAPSTPPALQ LGSGFQHWAGLPCPCTICQMPPPPHLSALSTANMARLSAESKDLLKQTFTVIALPFKYSN YRRQKKDFMTAKHIAQTPTFNFAFHVPSRDSGALCSPFLTLHWGGGVTAKESWGLSCTQC PFKGPTEIKLRKLGPAESVFRRASVFSEAFPGTLRGGRGWGALDDEGARQQPQSALAVAP EPLCVSSLGTSQRTPGPRSPWPTRITAPGREPLLLTQAPGLPVAPERKYGRLSSQPGTGK RNLTRLG >gi568815592r:137393355_137594170|GENSCAN_predicted_CDS_4|1284_bp atgatgcagaagatgcccggggaaagcctctcgcgggctggcgccaaggccgcgggagag agcagcaagtacaaaatcaagaagcagctgtcggagcaggacctacagcagttgaggctg aagatcaacggacgcgaacgcaagcggatgcacgacctgaacctagccatggacgggctg cgcgaagtcatgccctacgcgcatgggccgtcggtgcgcaagctctccaagatcgccaca ctcctgctcgccagaaactacatcctcatgctcaccagctccctggaggagatgaagagg ctggttggcgagatctatgggggccaccactcggcctttcactgcgggaccgtgggccac tcggccggccaccccgcgcacgcggccaactccgtgcacccggtgcaccccatcttgggc ggcgcgctctcatctggcaacgcctcgtcaccgctgtccgccgcctcacttcccgccatc ggcaccatccggcctccccactcgctactcaaggcgccctccacgccgcccgcgctgcag ctgggcagcggcttccagcactgggctggtctgccctgcccctgcaccatctgccagatg ccgccgccgccgcacctgtccgctctctccacagccaacatggcccggctgtcggccgag tccaaggacttgctcaagcagacttttactgtcattgcacttccctttaaatactcgaat tatcgacgccagaaaaaagatttcatgaccgcaaagcacatcgctcagacaccaacattt aacttcgcgttccacgtcccttccagagactccggggccctctgttccccttttctcacc ctccactggggaggaggagtgactgctaaggagagttggggcctctcctgtacccagtgc ccgtttaaggggcccacagaaatcaaactccgaaagctcgggcctgcagagtcagttttc cgtcgagcgtctgttttcagtgaagccttccctgggacactgcggggcgggagaggatgg ggtgcgttggacgacgaaggagcccgccagcagccgcagtccgcgctcgcggtggccccg gagcccctctgcgtctcttcgctggggacctcacagaggaccccggggcctcgctctccc tggccgacccggatcacagcccctggaagagagccgttgcttctaacccaagcccctgga ctgcccgtggctcccgaacgaaagtacggacgtttgtcctcgcagccaggcaccggcaaa aggaatttaactcgcctcggttaa >gi568815592r:137393355_137594170|GENSCAN_predicted_peptide_5|321_aa MDGARSHYPRKTNTGTENQTPHVLTCYKTSEGQRLCLFTMLGVVLNTALYTDECQDDTGS PVRNWIQNPDIRCKMRQRTSRQYPEVSRCCGQRQSRQLLERNSVAKAAEQWASGTERRAL GVSLGWALVVSPQKEIRVPALLSSGPAPSRVQGSIRGTELRWRGSKRDSTKQDVRGDSAF WAPASINPFYGKQSCGTFTHSPRLVVKPRNGWEVGELASKSTYWKEALNLGPPTLTHYGP SKKQRVTEERRARHVPASEPGSPGLVPADLPFQAPLLLELKTLREEGRECPGLFGQRPSR PFPAHRSPLGTGCGRRQPPLG >gi568815592r:137393355_137594170|GENSCAN_predicted_CDS_5|966_bp atggatggagctagaagccattatcctcggaaaactaacacaggaacagaaaaccaaaca ccacatgttctcacttgttataaaacctcagaaggtcaaagactgtgtctgtttaccatg cttggtgttgtattaaacacagcactatacacagatgaatgccaggatgatactggtagt ccagtaagaaactggatccaaaacccagatattaggtgcaagatgagacaacgcacatca cgtcaatatcctgaagtttcaaggtgctgtggtcaaaggcagagccggcagctgctggag aggaactcggtggctaaggctgcggagcagtgggcgtcgggcaccgagcggcgcgcactg ggtgtcagtctggggtgggcgttggtagtcagccctcagaaagaaatccgtgtgcctgcc ctcctgagctccgggcctgctccttctcgggttcagggctcaatccgaggaactgagctg cggtggcgggggtcgaaaagagactccacgaagcaagacgtgcggggcgactcagccttt tgggccccggcgagtattaacccattttacggtaagcaaagttgtggcacctttactcac tcccctcgtctagttgtcaagccccgcaacggatgggaagtgggggaactagcttctaaa agtacttactggaaggaggccctgaaccttggaccacccacactcacgcattacggacct tcgaaaaagcagagggtgaccgaggagcggagggcgcgccacgtcccggcctcggagcca gggagtcccggcctggtccccgcagatctcccatttcaggcacctctgttactggagctg aaaacccttagggaagaaggaagggaatgtcctggtctgttcgggcaaaggcccagtcgc cccttcccggcccatcgcagccccctcggaaccggctgcggcaggcgtcagccccctctc ggctga >gi568815592r:137393355_137594170|GENSCAN_predicted_peptide_6|335_aa MEIKKVFELNDNGDTTYQNLWDTAKAVLRGKFMALNACIKKTERAQTDTLRSHLKELEKQ EQTKLKPSRISFFTSYSFTTCVGQSCPQTSGQTLDFSVPIHNGFHHQHPRPGHSAHIAID LLNGFTFLSFKSLVSTLVICNQYKDSAYKYGLVSSSPCTLLPLHFPPGRYVAYKFRRKDE AAILLLPCAGIGGPSVLQIYQQHESRKVSQTSGSGPERGQGYTSQKEGAGIEEVNMIKDD RTVIHFNNPKVQASLSANTFAITGHAEAKPITEMLPGILSQLGADSLTILKKLAEQFPWQ VLDSKAPKPEDIDEEEEDVLDPVEKSDKASKNEAK >gi568815592r:137393355_137594170|GENSCAN_predicted_CDS_6|1008_bp atggaaattaaaaaagtatttgaactgaacgacaatggtgatacaacctatcaaaacctc tgggatacagcaaaggcggtgctaagaggaaagttcatggccctaaatgcctgcatcaag aagactgaaagagcacaaactgacactctaaggtcacacctcaaggaactagagaaacaa gaacaaaccaaactcaaacccagcagaatctctttcttcacctcttactccttcaccacc tgtgttggacagagctgtcctcagacctcaggccaaacacttgacttcagtgtccccatc cacaatggcttccatcaccagcatcccagaccaggtcactctgctcatatagcaattgat ttgctgaatggattcactttcctgtccttcaagtcacttgtttcaacacttgtcatctgc aaccagtacaaggactcagcctacaaatatggccttgtttcctcctcaccctgcacactg ctaccacttcatttccctccaggtcgctatgtagcttacaagtttagaagaaaagatgag gcagccatcttgctcttgccatgtgctggtattggaggaccctccgtgcttcagatttac caacagcatgaatcaagaaaagttagccaaacttcaggctcaggtccggaaagggggcaa gggtacacctcacagaaagaaggtgctggtattgaagaggtgaacatgattaaagatgat aggacagttattcatttcaacaatcccaaagtccaagcttccctttctgctaataccttt gcaattactggtcatgcagaagccaaaccaatcacagaaatgcttcctggaatattaagt cagcttggtgctgacagcttaacaatccttaagaagttagctgaacagttcccatggcaa gtcttggacagtaaagcaccaaaaccagaagacattgatgaggaggaggaggatgttcta gatcctgtagaaaaatctgataaggcatcaaagaatgaagctaaataa >gi568815592r:137393355_137594170|GENSCAN_predicted_peptide_7|220_aa MTFCTGQGKRVLVGGNKCMVTKGSNNCVESAAQSPLGSSSLEKGKRLQFLVCVSNVSFPE EHGQFLGQCDQYPGTAVICPEQCRSVENTCALAQKQQELRCSVLQQHKCGERRKRIKPDS STGLNNNDLESFREESRFYSEGNAEQLKLYVEDPILSVTVFGDRAFKKSTQIHTTRSKKL LNLRKQSDNNTIIVRNFDTPLTTLSSSSKQKINKETLDLD >gi568815592r:137393355_137594170|GENSCAN_predicted_CDS_7|663_bp atgacattttgtacaggacaaggaaaaagagttttggtgggtgggaacaagtgcatggtc accaagggaagcaacaactgcgttgaatctgcagctcaatctcctttgggcagctcttcc cttgagaaagggaagaggctgcagttccttgtctgtgtcagtaatgtgagctttccagag gagcatggtcagttccttggacagtgtgatcagtacccaggtactgctgtgatctgtcct gagcaatgtcgtagtgtggaaaacacgtgtgcgttggcccagaaacaacaagaactgaga tgctcagttctgcagcagcacaagtgtggggagagaagaaagagaataaagccagacagc agcactggcctgaataacaatgaccttgaaagcttcagggaggaatctagattttattct gagggtaatgcggagcaattaaagttgtatgttgaagaccccattctcagtgtgactgta tttggagacagggcctttaaaaagagcacccagattcatactactagatctaagaaactg ctaaacctaagaaaacagagtgacaacaatacaataatagtgaggaactttgacaccccg ctgacaacactaagcagttcatcaaagcagaagatcaacaaagaaactctggacttagac tag