GENSCAN 1.0 Date run: 4-Nov-116 Time: 16:37:17 Sequence gi568815594f:54000345_54201919 : 201575 bp : 40.76% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 21815 22007 193 0 1 61 110 197 0.459 16.83 1.02 Intr + 23602 23683 82 1 1 65 65 97 0.565 3.82 1.03 Term + 26361 26375 15 2 0 106 48 10 0.311 -3.94 1.04 PlyA + 27526 27531 6 1.05 2.00 Prom + 29366 29405 40 -3.25 2.01 Sngl + 44710 46176 1467 0 0 70 48 426 0.975 32.12 2.02 PlyA + 46220 46225 6 1.05 3.03 PlyA - 47518 47513 6 1.05 3.02 Term - 57339 57204 136 2 1 77 38 71 0.113 -2.49 3.01 Init - 63956 63838 119 2 2 85 58 279 0.836 24.32 3.00 Prom - 68861 68822 40 -5.55 4.00 Prom + 69603 69642 40 -4.95 4.01 Sngl + 86748 87020 273 2 0 59 35 235 0.979 10.38 4.02 PlyA + 87131 87136 6 1.05 5.00 Prom + 91351 91390 40 -7.65 5.01 Init + 92822 92934 113 1 2 74 94 85 0.955 7.43 5.02 Term + 92993 93161 169 2 1 31 44 163 0.591 2.57 5.03 PlyA + 93307 93312 6 1.05 6.00 Prom + 93579 93618 40 -7.75 6.01 Init + 100001 100574 574 1 1 79 45 264 0.158 15.38 6.02 Intr + 101238 101481 244 1 1 106 51 283 0.392 22.03 6.03 Term + 101617 101779 163 1 1 84 45 121 0.378 3.83 6.04 PlyA + 102132 102137 6 1.05 7.00 Prom + 106196 106235 40 -7.75 7.01 Init + 107250 107340 91 2 1 47 76 78 0.714 3.44 7.02 Intr + 107826 107961 136 1 1 67 43 98 0.779 1.91 7.03 Intr + 108573 108723 151 0 1 25 39 193 0.389 7.34 7.04 Intr + 109305 109589 285 2 0 111 60 48 0.016 0.91 7.05 Intr + 129177 129308 132 2 0 91 87 46 0.512 4.72 7.06 Intr + 138889 139135 247 0 1 91 56 110 0.420 4.21 7.07 Intr + 139605 139731 127 1 1 65 70 81 0.716 2.82 7.08 Term + 140947 141166 220 1 1 31 48 143 0.660 0.13 7.09 PlyA + 141452 141457 6 1.05 8.06 PlyA - 142271 142266 6 1.05 8.05 Term - 149703 149647 57 0 0 142 48 65 0.174 4.61 8.04 Intr - 165779 165693 87 2 0 82 97 24 0.697 1.95 8.03 Intr - 167153 167032 122 0 2 72 56 125 0.603 6.99 8.02 Intr - 169822 169671 152 0 2 35 46 81 0.348 -2.51 8.01 Init - 170068 169890 179 1 2 64 94 170 0.399 13.98 8.00 Prom - 175266 175227 40 -4.55 9.02 PlyA - 179701 179696 6 1.05 9.01 Term - 195013 194922 92 2 2 128 42 81 0.685 4.40 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594f:54000345_54201919|GENSCAN_predicted_peptide_1|96_aa XPYWISDCPTRPAATPRAPGTLAQGSLTDFFPDLLSLVAEDRRCPIASEASWTITDALGN SYSGGKLESFTASKGHQKVSDLIVQGNAYADKYAKL >gi568815594f:54000345_54201919|GENSCAN_predicted_CDS_1|291_bp ngaccctactggatatcagactgtccaactcgcccggcagccactcccagagcccctgga actctggcccaaggctctctgactgacttcttcccagatcttctcagcttagtggctgaa gaccgacgctgcccgatcgcctcagaagcttcctggaccatcacagacgctctgggtaac tcttacagtggagggaagctggagtcattcactgcaagcaagggccatcaaaaggtgtca gatctcatcgttcagggcaacgcttatgctgataagtatgccaaactctga >gi568815594f:54000345_54201919|GENSCAN_predicted_peptide_2|488_aa MDKFLDTYTLPRLNQEEVECLNRPITGSEIEAIISSLPIKKSPGPDGFTAEFYQRYKEEL VPFLLKLFQSIEKQGILPNSFYEASIILIPKPGRDTTKKENFRPISLMNSDAKILNKILA NQIQQHIKKLIHHDQVGFIPGMQGWFNIRKSINVIQHINRTKDKNHMIISIDAEKAFDKI QQPFMLKTLNKLGIDGTYLKIIRAIYDKPTANIILNGQKLEAFPLKTGTRQGCPLSPLIF NIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQNLLKLISNFSKV SGYKIKVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPL LKEIKKDTNKWKNIPCSWIARINIVKMDTLPTVIYRFNAIPIKLPMTFFTELEKTTLKLI WNQKRARIAKSILSQKNKAGGITLPDFKLYYKDTVTKTAWYWYQNRYRPMEQNRALRKNA TYLQLSDL >gi568815594f:54000345_54201919|GENSCAN_predicted_CDS_2|1467_bp atggataaattcctcgacacatacactctcccaagactaaaccaggaagaagttgaatgt ctgaatagaccaataacaggctctgaaattgaggcaataattagtagcttaccaatcaaa aaaagtccaggaccagatggattcacagccgaattctaccagaggtacaaggaggaactg gtaccattccttctgaaattattccaatcaatagaaaaacaaggaatcctccctaactca ttttatgaggccagcatcatcctgataccaaagccgggcagagacacaacaaaaaaagag aattttagaccaatatccttgatgaacagtgatgcaaaaattctcaataaaatactggca aaccaaatccagcaacacatcaaaaagcttatccaccatgatcaagtgggcttcatccct gggatgcaaggctggttcaacatacgaaaatcaataaacgtaatccagcatataaacaga accaaagacaaaaaccacatgattatctcaatagatgcagaaaaggcctttgacaaaatt caacaacccttcatgctaaaaactctcaataaattaggtattgatgggacatatctcaaa ataataagagctatctatgacaaacccacagccaatatcatactgaatgggcaaaaactg gaagcattccctttgaaaactggcacaagacagggatgccctctctcaccactcatattc aacatagtgttggaagttctggccagggcaatcaggcaggagaaggaaataaagggcatt caattaggaaaagaggaagtcaaattgtccctgtttgcagatgacatgattgtatatcta gaaaaccccatcgtctcagcccaaaatctccttaagctgataagcaacttcagcaaagtc tcaggatacaaaatcaaggtgcaaaaatcacaagcattcttatacaccaataacagacaa acagagagccaaatcatgagtgaactcccattcacaattgcttcaaagagaataaaatac ctaggaatccaacttacaagggatgtgaaggacctcttcaaggagaactacaaaccactg ctcaaggaaataaaaaaggatacaaacaaatggaagaacattccatgctcatggatagca agaatcaatattgtgaaaatggacacactgcccacggtaatttatagattcaatgccatc cccatcaagctaccaatgactttcttcacagaattggaaaaaactactttaaagctcata tggaaccaaaaaagagcccgcatcgccaagtcaatcctaagccaaaagaacaaagctgga ggcatcacgctacctgacttcaaactatactacaaggatacagtaaccaaaacagcatgg tactggtaccaaaacagatatagaccaatggaacagaacagagccctcagaaaaaatgcc acatatctacaactatctgatctttga >gi568815594f:54000345_54201919|GENSCAN_predicted_peptide_3|84_aa MADFDEIYEEEEDEERALEEQLLKYSPDPVVVRGSGHVTVYCTKLWVYNETQNETHIVSA LMEFTANRQILSDKVALKEMDEIL >gi568815594f:54000345_54201919|GENSCAN_predicted_CDS_3|255_bp atggcggatttcgacgaaatctatgaggaagaggaggacgaggagcgggccctggaggag cagctgctcaagtactcgccggacccggtggtcgtccgcggctccggtcacgtcaccgtg tattgcacaaagctctgggtatataatgaaacccaaaatgaaacacacatagtttctgcc ctcatggagtttacagctaatagacaaattttatctgataaagttgcactgaaggaaatg gatgagatcctgtga >gi568815594f:54000345_54201919|GENSCAN_predicted_peptide_4|90_aa MWGRSAENFLASGSVRKTRKNKQRTPGNRYGGSTSKAPQTPGKERAWADLTVESEDAFKN RMEVKVKMPEESKPWLAGDWNLVTKQKQLF >gi568815594f:54000345_54201919|GENSCAN_predicted_CDS_4|273_bp atgtggggtcgctctgcagagaacttcttagcttcaggatccgtgaggaagaccagaaag aacaaacagaggactcctggaaacagatatgggggcagtaccagcaaagcacctcagacc ccagggaaggaaagagcctgggcagacctcactgttgaaagcgaggacgcatttaagaac agaatggaagttaaagtgaagatgcctgaagaatcgaaaccatggcttgctggggactgg aacttagttaccaagcagaagcagctgttttaa >gi568815594f:54000345_54201919|GENSCAN_predicted_peptide_5|93_aa MQARLLVQELPHSESSENVPTSIVIYSKGDCFPPMQSRAQSQVGTARTPEPLTGVGSGEV NVTAEQTYARKPRDRKAPHWAPSGCERQAVEEP >gi568815594f:54000345_54201919|GENSCAN_predicted_CDS_5|282_bp atgcaggcaaggctcttggttcaagaactgccacacagtgagagctcagaaaatgtacct acgagtattgtcatatactcgaaaggcgactgttttcctcctatgcaaagcagggcgcag tcccaagtagggacggccaggacgccagagcctttgacgggggtggggagtggcgaggtt aatgtaactgctgaacaaacatatgcccggaaaccgagggatagaaaggccccacactgg gcgccttcagggtgcgagcgccaggctgtggaggagccttag >gi568815594f:54000345_54201919|GENSCAN_predicted_peptide_6|326_aa MSRSFYVDSLIIKDTSRPAPSLPEPHPGPDFFIPLGMPPPLVMSVSGPGCPSRKSGAFCV CPLCVTSHLHSSRGSVGAGSGGAGAGVTGAGGSGVAGAAGALPLLKGQFSSAPGDAQFCP RVNHAHHHHHPPQHHHHHHQPQQPGSAAAAAAAAAAAAAAAALGHPQHHAPVCTATTYNV ADPRRFHCLTMGGSDASQVPNGKRMRTAFTSTQLLELEREFSSNMYLSRLRRIEIATYLN LSEKQVKIWFQNRRVKHKKEGKGTQRNSHAGCKPGNAKAWGTQGPESLLIAVVPSGEETN LEVLEWTSRDLTLRVSFLTCLSRTPT >gi568815594f:54000345_54201919|GENSCAN_predicted_CDS_6|981_bp atgtcgcgctccttctatgtcgactcgctcatcatcaaggacacctcacggcctgcgccc tcgctgcctgaaccgcaccccgggccggatttcttcatcccgcttggcatgccgccccca ttggtgatgtccgtgtccggccccggctgcccgtcccgcaagagcggcgcgttctgcgtg tgccctctctgcgtcacttcgcacctgcactcctctcgggggtctgtgggcgccggcagc gggggcgcaggggccggggttaccggggccggaggcagtggggtggcaggggccgcaggg gcactgcctctgcttaagggccagttctcttcggctcctggggacgcgcagttttgcccg cgggtgaaccatgcgcatcatcaccaccacccgccgcagcaccaccatcaccatcatcag ccccagcagcctggctcggccgcggcggcggcagcagcagcagcggcggcggcggccgcg gcggccttggggcacccgcagcaccacgcacctgtctgcaccgccaccacctacaacgtg gcggacccgcggagattccactgcctcaccatgggaggctctgacgccagccaggtaccc aatggcaagaggatgaggacggcgttcactagcacgcaactcctggagctggagagagaa ttctcttccaacatgtacctgtctcgactccggaggattgaaatcgccacttacctgaac ctgtcggagaagcaggtgaaaatctggtttcagaaccgccgagtgaagcacaagaaggag gggaagggcacgcagaggaacagtcacgcgggctgcaaaccaggcaacgccaaggcgtgg ggcacccaggggccagaatccttgctcattgcagtggtcccatctggagaagaaacgaac ctggaagttctggaatggacaagccgggacctgacccttcgcgtctccttcttgacctgt ttatctaggactccaacttga >gi568815594f:54000345_54201919|GENSCAN_predicted_peptide_7|462_aa MGKSSLPMSGSRLQGAEASLVSVCCEGMEQVSPGHGSILLALRGKRRMLKLPFLLGYLLR AQASRNPTASSVAGSGPGFELSKQATRKDILNAALSINQEKRTAWYRNHNHLPVLQRLTW ALREKQVGGGEGRWVDAEGGRLSLGSLPPEQRRLCPRGKPRLSGSRGSEWLPRPLRSQQG QEDLAGTSVRAGRDVCPSLASWPSVGWDALDHRRRKEAPQPDFQYFPLLKVDILVLQILE KGFQFFPMQYDTSCESVIYGFYYVEQRPCSTENLGAWERESAAIVRLFIEPSAPLLQWKA EPGSTQLMFIWPSSSQREIVHPTGGSLSSSKPHYRGLECSGALSELKGTSSATVGWSNKQ TLQVPTSKPRLLVSISGPAMDQKEAHYSEENSEFYQTNLTKRLKLLKRNEILVLKSTTAM LKNASGSLNNRRDQAEERINELEDRLFENPQSEETKRKSKKQ >gi568815594f:54000345_54201919|GENSCAN_predicted_CDS_7|1389_bp atgggaaagagctccctgcctatgagtggcagccggctccaaggagcagaagccagcttg gtctcagtttgctgcgaagggatggagcaggtctcccctggccatggctcaatcctccta gctctccggggaaagaggaggatgctaaagctgccattcctacttgggtatctactgcgg gcacaggcatctcggaatcccactgcatcatcggtggcaggaagtggacctggatttgaa ctgtcaaaacaagcaacaagaaaagatattctgaacgcagccctgtcaatcaaccaggag aaacgaacagcgtggtatcgaaatcataatcaccttccagttcttcagcgcctcacttgg gccttacgtgagaagcaagtcgggggtggggaggggcggtgggtggacgcagagggagga aggctcagccttggatccttgccacctgaacaacgtcgcctctgcccacgtggaaagcct cgcttgtctggcagccggggttctgaatggttgccgcgacccctgcgctcccagcagggc caagaggacctcgcgggcaccagcgtccgggcgggaagggacgtgtgcccaagcctcgcc tcctggccctcagtgggctgggacgcccttgatcaccggcgcaggaaagaggctccccag cccgacttccagtactttccattgttgaaagtggacatacttgtgctccagatcttagag aaaggcttccagtttttccccatgcagtatgatactagctgtgagtctgtcatatatggc ttttattatgttgagcagcggccatgtagcacagagaatctgggtgcttgggagagggaa agtgcagcaattgtgaggcttttcattgaacccagtgctcccctgttacagtggaaagca gaaccaggttctactcagctgatgttcatttggcccagctctagccagagagaaattgtc catcccactggtgggagcttgagttccagcaagcctcattaccgtgggctggagtgctct ggggcactaagtgaacttaagggtaccagctcagccacagtggggtggagcaataagcag actcttcaggtccccacgtccaagcctaggctcttggttagcatttctggacctgccatg gaccagaaggaagcccactactctgaagagaattcagaattctatcagacaaatttaaca aagagattgaagttattaaaaagaaatgaaattctagtgctgaaaagtacaactgccatg ctgaagaatgcatctgggtctcttaataacagacgtgatcaagcagaagaaagaattaat gagcttgaagacaggctatttgaaaatccacagtcagaggagacaaaaagaaagagtaaa aaacaatga >gi568815594f:54000345_54201919|GENSCAN_predicted_peptide_8|198_aa MEAGTIELPNRGTDWTGTMSKKHNGFGSQGFVRKWSAPVTCNPGRLNERESSRRHYTEGR TGKRKAKQQELNSRQGFGAGFQSVGPEPVGARIPDTDRKGSEVEDRDWEHYMVFIPSGRY GNKAPSESREQPSPGTDPASTLILDFSASRTTSILHQSSRLWEVDHVLQQFPDSWNTVFK VGEIEVYKGEREEADAET >gi568815594f:54000345_54201919|GENSCAN_predicted_CDS_8|597_bp atggaggctggcacgatagagctgccaaatagaggtactgactggactggcacaatgtcc aagaaacacaacggatttggctctcagggttttgttcggaaatggtcagctcctgtgact tgcaatccaggtaggctaaatgagagggaatccagccgcagacactacacagagggcagg actggcaagaggaaagcaaagcagcaggagctcaattctaggcagggatttggagcaggg tttcagtcagtagggcctgaaccagtaggggccaggatcccagatacagacaggaaagga tcagaggtggaggatagagactgggagcactacatggtattcatcccctctggaagatac ggcaacaaggcaccatccgaaagcagagagcagccctcgccaggcactgaccctgccagc accttaatcttggacttctcagcctctagaactacatccattcttcaccaatcatccagg ctttgggaagtagaccatgtactgcagcaatttcctgactcctggaacaccgtcttcaag gtgggtgagatagaagtctataaaggggaaagagaagaagctgatgctgaaacttaa >gi568815594f:54000345_54201919|GENSCAN_predicted_peptide_9|30_aa XKLSMVDLQGWETEESTVAQHEEAEPGQDA >gi568815594f:54000345_54201919|GENSCAN_predicted_CDS_9|93_bp ngtaaattgagcatggtagacctccaaggttgggagacagaggaatccacagtggcccag catgaggaagcagagcctgggcaggatgcataa