GENSCAN 1.0 Date run: 6-Nov-116 Time: 13:21:08 Sequence gi568815579r:45919241_46123021 : 203781 bp : 46.76% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.07 PlyA - 1057 1052 6 1.05 1.06 Term - 3319 3311 9 1 0 137 44 0 0.438 -1.31 1.05 Intr - 8819 8688 132 2 0 26 103 128 0.858 8.94 1.04 Intr - 21705 20656 1050 0 0 100 36 2010 0.037 187.60 1.03 Intr - 34706 34540 167 0 2 102 76 119 0.895 11.78 1.02 Intr - 41913 41770 144 1 0 119 105 310 0.999 36.05 1.01 Init - 54303 54027 277 0 1 102 78 548 0.965 50.35 1.00 Prom - 69538 69499 40 -3.76 2.00 Prom + 78850 78889 40 -4.86 2.01 Init + 80604 80664 61 2 1 97 73 20 0.621 2.82 2.02 Intr + 83768 83926 159 0 0 91 68 294 0.984 27.66 2.03 Intr + 84179 84261 83 0 2 42 91 47 0.991 -0.24 2.04 Intr + 87319 87476 158 0 2 91 75 297 0.998 27.51 2.05 Intr + 88900 89061 162 1 0 104 73 269 0.985 26.09 2.06 Intr + 95809 96019 211 1 1 109 83 267 0.976 27.22 2.07 Intr + 96105 96187 83 2 2 104 64 148 0.999 12.44 2.08 Intr + 96814 96983 170 1 2 77 62 81 0.773 3.99 2.09 Intr + 97078 97153 76 2 1 101 26 131 0.784 6.77 2.10 Intr + 97454 97593 140 2 2 99 93 76 0.997 9.31 2.11 Intr + 97751 97829 79 0 1 76 60 76 0.989 2.21 2.12 Intr + 98007 98064 58 0 1 89 101 11 0.978 1.39 2.13 Intr + 98838 98910 73 2 1 71 105 57 0.974 4.68 2.14 Term + 99050 99147 98 0 2 79 39 178 0.956 10.13 2.15 PlyA + 99354 99359 6 1.05 3.04 PlyA - 99932 99927 6 1.05 3.03 Term - 100179 99998 182 1 2 138 54 202 0.999 19.57 3.02 Intr - 100407 100286 122 2 2 87 92 194 0.999 19.84 3.01 Init - 103781 103495 287 2 2 75 89 444 0.991 37.75 3.00 Prom - 106098 106059 40 -3.76 4.04 PlyA - 106314 106309 6 1.05 4.03 Term - 112868 112745 124 1 1 86 38 138 0.977 6.46 4.02 Intr - 113565 113429 137 0 2 57 79 42 0.081 -0.43 4.01 Init - 118305 118153 153 0 0 60 80 204 0.496 14.81 4.00 Prom - 119134 119095 40 -4.16 5.05 PlyA - 119418 119413 6 -0.45 5.04 Term - 120544 119558 987 1 0 91 48 327 0.958 21.07 5.03 Intr - 121176 120917 260 1 2 53 110 124 0.585 8.28 5.02 Intr - 121473 121278 196 0 1 16 94 62 0.144 -1.41 5.01 Init - 121817 121812 6 2 0 64 115 10 0.286 1.61 5.00 Prom - 124025 123986 40 -7.06 6.00 Prom + 136846 136885 40 -3.46 6.01 Init + 143384 143469 86 1 2 95 49 55 0.220 2.59 6.02 Intr + 148366 148417 52 1 1 106 94 9 0.822 2.21 6.03 Intr + 148549 148630 82 0 1 101 54 83 0.594 5.41 6.04 Intr + 155627 155733 107 0 2 10 121 96 0.550 5.03 6.05 Intr + 156108 156246 139 2 1 55 95 50 0.632 2.44 6.06 Term + 158174 158334 161 0 2 98 43 64 0.705 1.00 6.07 PlyA + 158834 158839 6 1.05 7.00 Prom + 159400 159439 40 -6.66 7.01 Sngl + 159515 159952 438 1 0 41 36 716 0.874 57.96 7.02 PlyA + 160857 160862 6 1.05 8.03 PlyA - 160926 160921 6 1.05 8.02 Term - 167873 167702 172 1 1 1 42 182 0.515 2.20 8.01 Init - 170789 170707 83 2 2 58 60 51 0.545 -0.26 8.00 Prom - 176416 176377 40 -1.86 9.03 PlyA - 176538 176533 6 1.05 9.02 Term - 177180 176966 215 1 2 38 41 183 0.877 5.99 9.01 Init - 180731 180686 46 2 1 70 69 79 0.926 5.04 9.00 Prom - 202597 202558 40 -2.06 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 21705 20623 1083 0 0 100 54 1981 0.962 188.01 S.002 Init - 114351 114138 214 0 1 103 91 112 0.982 9.93 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815579r:45919241_46123021|GENSCAN_predicted_peptide_1|592_aa MRMMAAGAVHGLFTASAAPQPPPPPPPPPPQPQPPQQPSPPPQQPPPPPPQPPQQQQPPP QAPPMEPEAPDSRKRPLETPPEVVCTKRSNTGEEGEYFLKVLIPSYAAGSIIGKGGQTIV QLQKETGATIKLSKSKDFYPGTTERVCLVQGTAEALNAVHSFIAEKVREIPQAMTKPEVV NILQPQTTMNPDRAKQAKLIVPNSTAGLIIGKGGATVKAVMEQSGAWVQLSQKPEGINLQ ERVVTVSGEPEQVHKAVSAIVQKVQEDPQSSSCLNISYANVAGPVANSNPTGSPYASPAD VLPAAAAASAAAASGLLGPAGLAGVGAFPAALPAFSGTDLLAISTALNTLASYGYNTNSL GLGLNSAAASGVLAAVAAGANPAAAAAANLLASYAGEAGAGPAGGAAPPPPPPPGALGSF ALAAAANGYLGAGAGGGAGGGGGPLVAAAAAAGAAGGFLTAEKLAAESAKELVEIAVPEN LVGAILGKGGKTLVEYQELTGARIQISKKGEFLPGTRNRRVTITGSPAATQAAQYLISQR VTYEQGRSERAVQEPLLLLRDQEAASRLHRRSLSPGSALAFGLPLAQLTSGS >gi568815579r:45919241_46123021|GENSCAN_predicted_CDS_1|1779_bp atgaggatgatggccgccggcgcggtgcacggcctcttcacggcctccgcggccccgcag ccgccgccgcccccgccgccgccgccgccgcaaccccagcctccccagcagccgtcgccg ccgccacagcagccgccgccgccgccgccgcagccgccgcagcagcagcagccgccgccc caggccccccccatggagcccgaggccccggattcccgcaagaggcccctcgaaacgccc cccgaggtggtctgcaccaagcgcagcaacacgggagaggaaggcgaatacttcctgaag gtgctgatccccagctacgcggcgggctccatcattggcaagggcgggcagaccatcgtg cagctgcagaaggagaccggagccaccatcaagctctccaagtccaaagacttctacccc ggaaccacagagcgggtatgcctagtacagggcacggcagaggccttgaatgctgtgcac agctttattgccgagaaggtccgagaaatcccacaagcgatgaccaagcctgaggtggtc aacatccttcaaccccaaaccacgatgaaccccgacagagccaagcaggccaagctgatc gtccccaacagcacggcgggcctgatcatcggcaagggaggcgccacggtgaaagccgtg atggaacagtcaggagcatgggtgcagctgtcccagaagccggagggcatcaacctgcag gagcgcgtggtgacggtcagcggcgagcccgagcaggtgcacaaggccgtgagcgccatc gtgcagaaggtacaagaagacccccagagcagcagctgcctcaacatcagctacgccaac gtggcaggccccgtggccaactccaaccccaccggctctccgtacgccagccccgcggat gtgctgccagccgcggccgcagcgtcggccgccgccgcctccggcctgctgggccccgcc gggctggctggcgtgggggcctttcccgccgcgctgcccgccttctcaggcaccgacctg ctggccatcagcacggcgcttaacacgctggcaagttacggctacaacaccaactccctg ggcctgggcctcaactcggccgcagcttccggcgtcctggccgccgtggccgccggggcc aacccagcagccgccgccgccgccaacctcctggcatcctacgcgggcgaggccggggcc gggccagccggaggggccgccccgccgccgcccccgcctcccggagccctggggtccttt gcgttggccgcagccgccaacggctacctcggggccggggcgggcggcggggcgggcgga gggggcggcccgctggtggccgctgcagccgcggccggggcggccgggggcttcctgacg gcggagaagctggcggctgagagtgccaaggagctggtggagattgcggtgcctgagaac ctggtgggagccatcctggggaaggggggcaagacgttggtggagtaccaggagctgacg ggcgctcgcatccagatctccaagaagggcgagttcctgccaggcacgcggaaccggcgg gtcaccatcacgggcagccccgcggccacgcaagccgctcaatacctcatcagtcagcgg gtcacctacgagcagggacgctcagaaagggccgtccaggaaccgctgctcctcctgcgc gatcaggaagccgccagccggctccaccgccgcagcctctcgccgggatctgcactggcc ttcgggctgcctctcgcccagcttacttcgggcagttag >gi568815579r:45919241_46123021|GENSCAN_predicted_peptide_2|536_aa MGPGGQPHSARRDGVVTSWVATLAMDQPAGLQVDYVFRGVEHAVRVMVSGQVLELEVEDR MTADQWRGEFDAGFIEDLTHKTGNFKQFNIFCHMLESALTQSSESVTLDLLTYTDLESLR NRKMGGRPGSLAPRSAQLNSKRYLILIYSVEFDRIHYPLPLPYQGKPDPVVLQGIIRSLK EELGRLQGLDGQNTRDTRENEIWHLREQVSRLASEKRELEAQLGRSREEALAGRAARQEA EALRGLVRGLELELRQERGLGHRVAGRRGQDCRRLAKELEEAKASERSLRARLKTLTSEL ALYKRGRRTPPVQPPPTREDRASSSRERSASRGRGAARSSSRESGRGSRGRGRPARPSPS PTGGRALRFDPTAFVKAKERKQREIQMKQQQRNRLGSGGSGDGPSVSWSRQTQPPAALTG RGDAPNRSRNRSSSVDSFRSRCSSASSCSDLEDFSESLSRGGHRRRGKPPSPTPWSGSNM KSPPVERSHHQKSLANSGGWVPIKEYSSEHQAADMAEIDARLKALQEYMNRLDMRS >gi568815579r:45919241_46123021|GENSCAN_predicted_CDS_2|1611_bp atggggcctggagggcagccgcacagtgctcgcagggatggggtggtgacgtcatgggtt gcaaccttggccatggaccagccggctggcctgcaggtggactacgtcttccggggtgtg gagcatgccgtgcgggtgatggtttctgggcaggtgctggagctggaggtggaggaccgg atgacggctgaccagtggcggggcgagttcgatgctggcttcattgaagatttgactcac aagacagggaacttcaaacagttcaacatcttctgtcatatgctggagtcagccctcact cagagtagtgagtcagtcaccctggacctgctgacctacacagacctggagtccctgcgg aaccgcaagatggggggccgcccaggctccttggcccccaggtcggcccagctcaactcc aagcgctacctgatcctcatctactccgtggagtttgacaggattcactacccgctgccc ctcccgtaccagggcaagccagaccccgtggttctgcagggcatcatccggtcactgaag gaggaactgggccgcctgcaagggctggatggccagaacactcgggacacccgggagaat gagatctggcatctgcgggagcaggtgtcgcgcctggcgtccgagaagcgggagctggag gcgcagctgggccgatcgcgcgaggaggcgctggccgggcgcgcggcacgccaggaggcc gaggcgctgcgcgggctggtgcgcgggctggagctggagctgcggcaggagcgcggcctc gggcacagggtggccggccgtcgcggccaggactgccgccgtctggccaaggagctcgag gaggcgaaggcatcggagcggagcctgcgcgcccggctgaagacgctgaccagcgagctg gcattgtacaagagggggaggcggactccgccggtgcagccgcccccgacgcgggaggac cgggcctcatcgtcccgggagcgctccgcgtcgcgaggccgcggcgccgcgcgctcctca tcccgggagagcggccgcgggagccggggtcggggccgccctgcgcgcccctcgccctcg cccacaggtggtcgcgcgctccgcttcgaccccacggcctttgtgaaagccaaggaaagg aagcagagagagatccagatgaagcagcagcagcggaaccgcttaggcagtgggggaagc ggggacggtccgtccgtctcctggtctcgccagacccagccccctgctgccttgactggc cgaggggacgcccctaaccgctcccgaaaccgcagctcctcagtggacagtttccgcagc cgctgctcgtctgccagctcctgcagcgatttggaggatttctctgagtcgctctccaga gggggtcaccgccgccgtgggaagcctcccagcccaacgccctggagtgggtccaatatg aagtctccccccgtggaacgcagccaccatcagaaatctctggccaactccgggggctgg gtccccatcaaagagtacagctcggagcaccaggcggctgacatggccgaaatagacgca cgcctgaaggccttgcaggagtacatgaaccgactggacatgcggtcataa >gi568815579r:45919241_46123021|GENSCAN_predicted_peptide_3|196_aa MSRRSMLLAWALPSLLRLGAAQETEDPACCSPIVPRNEWKALASECAQHLSLPLRYVVVS HTAGSSCNTPASCQQQARNVQHYHMKTLGWCDVGYNFLIGEDGLVYEGRGWNFTGAHSGH LWNPMSIGISFMGNYMDRVPTPQAIRAAQGLLACGVAQGALRSNYVLKGHRDVQRTLSPG NQLYHLIQNWPHYRSP >gi568815579r:45919241_46123021|GENSCAN_predicted_CDS_3|591_bp atgtcccgccgctctatgctgcttgcctgggctctccccagcctccttcgactcggagcg gctcaggagacagaagacccggcctgctgcagccccatagtgccccggaacgagtggaag gccctggcatcagagtgcgcccagcacctgagcctgcccttacgctatgtggtggtatcg cacacggcgggcagcagctgcaacacccccgcctcgtgccagcagcaggcccggaatgtg cagcactaccacatgaagacactgggctggtgcgacgtgggctacaacttcctgattgga gaagacgggctcgtatacgagggccgtggctggaacttcacgggtgcccactcaggtcac ttatggaaccccatgtccattggcatcagcttcatgggcaactacatggatcgggtgccc acaccccaggccatccgggcagcccagggtctactggcctgcggtgtggctcagggagcc ctgaggtccaactatgtgctcaaaggacaccgggatgtgcagcgtacactctctccaggc aaccagctctaccacctcatccagaattggccacactaccgctccccctga >gi568815579r:45919241_46123021|GENSCAN_predicted_peptide_4|137_aa MKTFALVLLMALLCMKRAQGLHCYRCLAVSERNSCRVVMGLFQEGICVSQKAAYAKGHLQ ASTKLPSAPPQLSSYAHRCPKSGGGCGRGLVCQHCPGPHDGSGSSRRATAAIITMSIYMN NESQLLVDLEVSISSCD >gi568815579r:45919241_46123021|GENSCAN_predicted_CDS_4|414_bp atgaagacctttgccctggtcctgctgatggccctgctgtgcatgaagagagctcagggt ctgcactgctacaggtgcttggcagtctcggaaaggaactcctgccgtgtggtcatgggc ctcttccaggaggggatctgtgtctcccagaaagctgcttatgccaaagggcacctgcag gccagtaccaagctgccctcagcaccccctcagctttcctcctatgctcatcggtgccca aagtctggagggggctgtggcagggggctggtgtgtcagcactgccccggcccacatgat ggcagtggcagctccagacgggccactgctgccatcattactatgagcatctacatgaat aatgaatcacagctgctagtggacttagaagtttccattagcagctgtgattga >gi568815579r:45919241_46123021|GENSCAN_predicted_peptide_5|482_aa MASPGPAPTASPKIHSELLTATREAQRHHPVPRGQDLVTSESFSLSFCFSAAIFIFELLG SNSEGVTDLRLWLCQPAPRCGEWTYNPLEQCCDDGVILDLNQTRLCGSSCTFWPCFQHCC LESLGSQNQTVVRFKVPGMKPDCKSSPITRICAQAGVQISNFSKVSGYKINVQKSQAFLY TNNRQTESQIMSELPFTIASQRIKYLGIQLTRDVKDLFKENYKPLLKEIKEDTNKWKNIP CSWVGRINIVKMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRALIAKSILSQ KNKAGGITLPDFKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEITPHIYNYLIFDKPEKN KQWGKDSLFNKWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGI TIQDIGMSKDFMSKTPKAMATKDKIDKWDLIKLKSFCTAKETTIRVEQATYKMGENFCNL LI >gi568815579r:45919241_46123021|GENSCAN_predicted_CDS_5|1449_bp atggcgtccccaggacctgcacccacagccagtcccaaaatccactctgagctgttaaca gcaaccagagaagctcagagacatcatcctgtgccccgtggtcaagacctagtgacctca gaatcattctcgctctccttctgcttctcagctgccatcttcatttttgaacttttgggt tcaaactcagaaggagtcacagatcttagactgtggctatgccagccagcgcccaggtgc ggggagtggacctacaaccccttggagcagtgctgtgatgacggtgtcatcctagacttg aaccagacccggctctgcggctccagctgcaccttctggccctgcttccagcactgctgc ctggagtctttgggctctcagaaccagacagttgtgaggttcaaggtcccaggcatgaag ccagattgcaagtcctcccctatcaccaggatctgtgcccaggctggagtgcaaataagc aacttcagcaaagtctcaggatacaaaatcaatgtacaaaaatcacaagcattcttatac accaacaacagacaaacagagagccaaatcatgagtgaactcccattcacaattgcttca cagagaataaaatacctaggaatccaacttacaagggatgtgaaggacctcttcaaggag aactacaaaccactgctcaaggaaataaaagaggatacaaacaaatggaagaacattcca tgctcatgggtaggaagaatcaatatcgtgaaaatggccatactgcccaaggtaatttac agattcaatgccatccccatcaagctaccaatgactttcttcacagaattggaaaaaact actttaaagttcatatggaaccaaaaaagagccctcatcgccaagtcaatcctaagccaa aagaacaaagctggaggcatcacactacctgacttcaaactatactataaggctacagta accaaaacagcatggtactggtaccaaaacagagatatagatcaatggaacagaacagag ccctcagaaataacgccgcatatctacaactatctgatctttgacaaacctgagaaaaac aagcaatggggaaaggattccctatttaataaatggtgctgggaaaactggctagccata tgtagaaagctgaaactggatcccttccttacaccttatacaaaaatcaattcaagatgg attaaagacttaaacgttagacctaaaaccataaaaaccctagaagaaaacctaggcatt accattcaggacataggcatgagcaaggacttcatgtctaaaacaccaaaagcaatggca acaaaagacaaaattgacaaatgggatctaattaaactaaagagcttctgcacagcaaaa gaaactaccatcagagttgaacaggcaacctacaaaatgggagaaaatttttgcaaccta ctcatctga >gi568815579r:45919241_46123021|GENSCAN_predicted_peptide_6|208_aa MKEQTVVLSNRKERKEKGEKGRKALPVAGGLSASSSTCAQAQSEFQERVSQPAPRCGNQI CDIMENCSMDKPMKAKLRMQVFQEADENHVPDMEIKTLSPHSTGQGSHKPPLDCDSFSDF SLFLMTFTVMSSTGKELAYFLAQDASDPYQGLKGTGNLDRLTLKKTEVARGQPKETRGGD PSIIQEVDPAANSGAFQVSRASGSRNSQ >gi568815579r:45919241_46123021|GENSCAN_predicted_CDS_6|627_bp atgaaggagcagacagttgtcctgagtaacagaaaagagagaaaagagaaaggagagaaa gggagaaaagcattgcctgtggcagggggcttgtctgcatcctcatccacgtgtgctcag gctcagagtgagtttcaggaaagagtcagtcagccagcccccaggtgtggaaaccagatc tgtgacatcatggagaactgctccatggataagcccatgaaagccaagctccgcatgcaa gtattccaagaagctgatgaaaatcatgtccctgacatggaaatcaagacactttcacca cattctactggtcaaggaagtcacaagcctcctcttgactgtgacagtttctcagatttt tccttgtttttgatgaccttcacagttatgagcagtactggtaaggaacttgcttatttt ctggcacaagatgcttcagatccttaccagggtctgaagggcactgggaacttggaccgc ctgacattgaaaaagactgaggtagctcgggggcagccaaaagaaaccagaggcggggat cctagtatcattcaggaagtggacccagccgcaaactcaggagcgttccaagtgtccaga gcgtctggtagtcgcaatagccagtga >gi568815579r:45919241_46123021|GENSCAN_predicted_peptide_7|145_aa MAVGDCSENGERRPPCSAQASSVDIAQASSVDIAQASSVDIAQASSVDIAQASSVDIAQA SSVDIAQASSVDIAQASSVDIAQASSVDIAQASSVDIAQASSVDIAQASSVDIAQASSVD IAQASSVDIAQAMGEAALAAMFSSQ >gi568815579r:45919241_46123021|GENSCAN_predicted_CDS_7|438_bp atggcagtaggagactgctcagaaaatggagagcggcgcccaccttgcagtgcgcaggcg tcgtcagtagacatcgcgcaggcgtcgtcagtagacatcgcgcaggcgtcgtcagtagac atcgcgcaggcgtcgtcagtagacatcgcgcaggcgtcgtcagtagacatcgcgcaggcg tcgtcagtagacatcgcgcaggcgtcgtcagtagacatcgcgcaggcgtcgtcagtagac atcgcgcaggcgtcgtcagtagacatcgcgcaggcgtcgtcagtagacatcgcgcaggcg tcgtcagtagacatcgcgcaggcgtcgtcagtagacatcgcgcaggcgtcgtcagtagac atcgcgcaggcgtcgtcagtagacatcgcgcaggcgatgggtgaggcggctttggccgcc atgttttcgtcgcagtaa >gi568815579r:45919241_46123021|GENSCAN_predicted_peptide_8|84_aa MKKAYGIYETASEEQMFGSLELKRETKSDLEDQRLACLDLLTLMPVYATLGPKDRHIQPT TATVAQNLAYLASQSPAKLHNSLH >gi568815579r:45919241_46123021|GENSCAN_predicted_CDS_8|255_bp atgaagaaagcttatgggatttatgagacagcatcagaagagcaaatgtttgggtcactg gaattaaagagggagaccaagagcgacctggaggaccaacgattggcctgcctggacctg ctaacactgatgccagtgtatgccaccctggggcccaaagacaggcacattcagcccact actgccaccgtggcccaaaacctggcctacctggcatcccagtctccagcaaaacttcat aatagccttcactaa >gi568815579r:45919241_46123021|GENSCAN_predicted_peptide_9|86_aa MELKNTAQELRDANTAQKLLKLTSNFSKVSGYKIIVQKSQAFLYTNNRQAERQIMDELPF TIATKRIKYLEIQLTRDVKNPFKEYY >gi568815579r:45919241_46123021|GENSCAN_predicted_CDS_9|261_bp atggagctgaagaacacagcacaagaacttcgtgatgcgaacacagcccaaaagctcctt aagctgacaagcaacttcagcaaagtctcaggatataaaatcattgtgcaaaaatcacaa gcattcctatacaccaacaacagacaagcagagagacaaatcatggatgaactcccattc acaattgctacaaagagaataaaatacctagaaatacagctaacaagggatgtgaagaac cccttcaaggagtactactaa