GENSCAN 1.0 Date run: 6-Nov-116 Time: 02:17:42 Sequence gi568815592f:110775106_110993613 : 218508 bp : 42.81% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 35386 35393 8 0 2 95 110 11 0.458 3.77 1.02 Intr + 39443 39710 268 2 1 69 85 231 0.512 17.51 1.03 Intr + 40050 40320 271 2 1 91 56 75 0.485 0.79 1.04 Intr + 40536 40745 210 1 0 75 81 147 0.748 10.66 1.05 Intr + 48156 48322 167 1 2 31 -5 177 0.006 1.46 1.06 Intr + 83122 83787 666 0 0 88 80 728 0.042 63.00 1.07 Intr + 83867 83997 131 1 2 83 17 158 0.545 6.77 1.08 Term + 95623 95719 97 1 1 112 39 75 0.562 1.46 1.09 PlyA + 95789 95794 6 1.05 2.00 Prom + 97352 97391 40 -6.25 2.01 Init + 98900 98981 82 1 1 43 90 58 0.609 2.74 2.02 Intr + 99372 99507 136 1 1 92 -3 129 0.116 2.91 2.03 Intr + 99991 100254 264 1 0 41 91 331 0.117 24.40 2.04 Intr + 113752 113878 127 1 1 83 108 13 0.479 2.56 2.05 Intr + 115149 115251 103 2 1 39 108 63 0.548 2.23 2.06 Intr + 117220 117338 119 2 2 32 60 164 0.654 7.26 2.07 Intr + 117630 117722 93 2 0 117 76 131 0.999 14.04 2.08 Intr + 117805 117960 156 0 0 82 91 124 0.999 11.39 2.09 Term + 130038 130388 351 2 0 77 43 272 0.077 15.10 2.10 PlyA + 130454 130459 6 1.05 3.00 Prom + 130491 130530 40 -16.61 3.01 Sngl + 130599 131234 636 2 0 45 43 336 0.685 20.73 3.02 PlyA + 131462 131467 6 1.05 4.00 Prom + 131630 131669 40 -6.15 4.01 Sngl + 131723 132151 429 1 0 49 39 194 0.804 6.63 4.02 PlyA + 132370 132375 6 1.05 5.00 Prom + 133086 133125 40 -7.35 5.01 Sngl + 133553 134215 663 1 0 60 42 199 0.277 8.42 5.02 PlyA + 134479 134484 6 -0.45 6.00 Prom + 134851 134890 40 -3.65 6.01 Init + 142297 142610 314 0 2 72 74 390 0.814 32.84 6.02 Intr + 142847 143030 184 2 1 34 9 117 0.085 -2.73 6.03 Term + 146154 146558 405 2 0 3 28 258 0.065 5.50 6.04 PlyA + 146738 146743 6 1.05 7.00 Prom + 147827 147866 40 -6.35 7.01 Sngl + 148461 148862 402 2 0 77 43 473 0.999 37.72 7.02 PlyA + 151867 151872 6 1.05 8.00 Prom + 164463 164502 40 -5.95 8.01 Init + 183665 183724 60 1 0 115 60 121 0.005 13.30 8.02 Intr + 184067 184147 81 1 0 74 97 27 0.009 1.12 8.03 Intr + 185309 185372 64 1 1 113 97 115 0.020 12.27 8.04 Intr + 185467 185511 45 2 0 103 115 50 0.019 6.66 8.05 Intr + 187287 187400 114 1 0 92 78 65 0.975 5.40 8.06 Term + 192405 192685 281 1 2 68 43 259 0.954 14.02 8.07 PlyA + 195477 195482 6 1.05 9.02 PlyA - 197788 197783 6 1.05 9.01 Term - 207266 206871 396 2 0 14 44 311 0.087 13.39 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 99372 99595 224 1 2 92 47 211 0.876 13.50 S.002 Init + 100001 100254 254 1 2 63 91 303 0.813 24.96 S.003 Term + 118371 118511 141 2 0 82 38 124 0.953 3.75 S.004 Init + 127159 127313 155 0 2 100 51 78 0.805 4.81 S.005 Intr + 185309 185511 203 2 2 113 115 134 0.980 16.61 S.006 Init + 206855 206961 107 1 2 90 9 196 0.845 11.64 S.007 Intr + 209901 210033 133 0 1 62 103 124 0.864 11.03 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:110775106_110993613|GENSCAN_predicted_peptide_1|605_aa MARLPGEVSNRRPREALSRLEVPQRPSLPRAGAATPQRSSACPLRSPKRLDPTNSSSTAV PTRVPPPSPSTRRAPLWDGTDASDRAAPREFERHGTRGPPPLSPSSSSPRDRRSTSPTAA SRARARAARRPPLRGPPSARDSSAATAATSSTSSSSSSATAAAAPAGTPSPASLLHFLVL EPGSARLPVPVKVESSKFQCAFCGSRGDTGKKCLTLVGRGYFTVTSPTQRVWKALNQHLA EGLAAVTREARAANITLNGEELKAFPLRAGTRQECPLSLLLLNVVLEVLTRSIRQEKGMK GIRIGKEKRAPAMSSTQFNKDPSYGLSAEIKNRLLCKYDPQKEVELRSWIEGLTSLSIGP DFQKGLKDGISLWTLMNKLQPGSVLKINRSMQNWHQLENLSNFIKAMVSYGMNTVDLFDA NDLFESGNMTQLQVSLLALVSKARTKGLQSGVDIGLKYSEKQEPNFDDATMKAGQCTIGL QMGTNKCTSQSGMTAYSTRRHLYDPKNHILPPMDNLTISLQMGTNKCASQMGYTQGAKQS GQVFGLGRQIHDPKFCPEGTVADRAPSGAQARGRLDCGQIKPASIMRIKTLLYGQNGLEG TWFPE >gi568815592f:110775106_110993613|GENSCAN_predicted_CDS_1|1818_bp atggccaggctccccggcgaggtttcaaaccgccgcccccgcgaggcgctcagccgattg gaagttccacaacgaccgtcactacccagggccggagcggcaactccgcagcggagctct gcctgcccgctgcgttcccccaagagactagaccccacaaactcctcctccaccgcggtc ccaactcgagtccccccgccgtctccgagcactcggagggcgcccctctgggacgggacc gacgcctcggaccgggctgcgccgcgggagttcgagaggcacgggacgcgggggccgccg ccgctcagtccctcctcctcctccccccgcgaccgccgctccacttctccaacagccgcc tctcgcgcgcgcgcgcgcgccgcccgccgcccgccgctccgcggtccgccttcagcaagg gactcctcggcggccacagcagccacctcctccacctcttcctcctcctcctccgcgacg gcggcggcggctcccgcaggcacccccagtcccgcctccctccttcatttccttgttttg gaacctggtagcgcacggctcccggttcctgtgaaggtagaaagctcgaagttccagtgt gctttctgcggttctcggggagacactggaaagaagtgccttactctagtgggaagggga tacttcactgtaactagccccacgcagagagtatggaaagcattaaatcagcaccttgct gaagggttggctgcagtgactcgggaggcgagggctgccaacatcacactgaatggggaa gagttgaaagcatttcccctgagagctggaacaagacaagaatgcccactttcacttctt ctactcaacgtagtactggaagtactaaccagatccatcaggcaagagaaaggaatgaaa ggcatccgaattggaaaagagaaacgcgcgccagccatgagctccacgcagttcaacaag gacccctcgtacgggctgtcagccgagatcaagaaccggctcctgtgcaaatatgacccc cagaaggaggtagagctccgcagctggatcgagggactcaccagcctctccatcggcccc gacttccagaagggcctgaaggacgggattagtttatggacactcatgaacaagctacag ccgggctcagtcctcaagatcaaccgttccatgcagaactggcaccaactagaaaacctc tccaatttcatcaaggccatggtcagctacggcatgaacaccgtggacctgttcgatgcc aacgacctgtttgagagtgggaacatgacgcagctgcaggtgtctcttctcgccctggtg agtaaggccaggactaaggggctgcagagcggggtggacatcggccttaagtactcggag aagcaggagccgaacttcgacgacgccaccatgaaggctggccagtgcaccatcgggctg cagatgggcaccaacaaatgcaccagccagtcgggcatgaccgcgtacagcacaaggagg catctctacgaccccaagaaccacatcctgccccccatggacaacttaaccatcagcctc cagatgggtacaaacaagtgtgccagccagatgggctacacgcagggcgccaagcaaagc ggccaggtctttggcctgggccggcagatacatgaccccaagttctgcccagaaggcaca gtggccgacagggctccctcgggcgcgcaggctcgggggaggctagactgtggacagatt aagccagcttcaataatgagaataaagactttactctatggtcaaaatgggttagaagga acctggttccctgaatga >gi568815592f:110775106_110993613|GENSCAN_predicted_peptide_2|476_aa MEQRSCQCFPEAAVLSQFGGMASHTVGALHSLSSFRSRSPHPLCVRSEVSSPQLDITAPT PTQEPARARAAAHLTVMEAAHFFEGTEKLLEVWFSRQQPDANQGSGDLRTIPRWVPGALA DIRAWGLSPPPRHQPRVEPEFPQLSVGGKSAAWGRFGGLGSESSMFVSKRRFILKTCGTT LLLKALVPLLKLARDYSGFDSIQSFFYSRKNFMKPSHQGYPHRNFQEEIEFLNAIFPKSR VISQPDQTLEILMSELDPAVMDQFYMKDGVTAKDVTRESGIRDLIPGSVIDATMFNPCGY SMNGMKSDGTYWTIHITPEPEFSYVSFETNLSQTSYDDLIRKVVEVFKPGKFVTTLFVNQ PPLVIPRQTGSGVDLQQTPTDLQLRVLTVRRNTNKQDIYTKTLSARHHHQRPKVGKTLKM GRNQSRKAENSKNQSTSSPPKERNSSPAMEQSWMENDFDELREEGFRRSVITTSPS >gi568815592f:110775106_110993613|GENSCAN_predicted_CDS_2|1431_bp atggagcaacggagctgccagtgcttcccggaggctgctgtgctcagccaattcggaggc atggcgagccacacggtgggagctctgcacagcctttcctcattccgttcccgctctccc cacccactctgcgttcgatccgaggtttctagtccccagctggacatcacagcaccaact cccactcaggagccggccagagcccgagccgcagcgcatctcacggtgatggaagctgca cattttttcgaagggaccgagaagctgctggaggtttggttctcccggcagcagcccgac gcaaaccaaggatctggggatcttcgcactatcccaaggtgggtccccggggcgctcgct gacatccgggcctgggggctgtcgccgccgccgaggcaccagccacgggtggagcccgag ttccctcagctttcagttgggggcaagtctgcggcctggggtcgcttcggcggccttgga agtgagagtagcatgtttgtctccaagagacgtttcattttgaagacatgtggtaccacc ctcttgctgaaagcactggttcccctgttgaagcttgctagggattacagtgggtttgac tcaattcaaagcttcttttattctcgtaagaatttcatgaagccttctcaccaagggtac ccacaccggaatttccaggaagaaatagagtttcttaatgcaattttcccaaagagtcgg gtaatcagtcagccagatcaaaccttggaaattctgatgagtgagcttgacccagcagtt atggaccagttctacatgaaagatggtgttactgcaaaggatgtcactcgtgagagtgga attcgtgacctgataccaggttctgtcattgatgccacaatgttcaatccttgtgggtat tcgatgaatggaatgaaatcggatggaacttattggactattcacatcactccagaacca gaattttcttatgttagctttgaaacaaacttaagtcagacctcctatgatgacctgatc aggaaagttgtagaagtcttcaagccaggaaaatttgtgaccaccttgtttgttaatcag cctccgctggtgatacccaggcaaacagggtctggagtggacctccagcaaactccaaca gacttgcagctgagggtcctgactgttagaaggaacactaacaaacaggacatctacacc aaaaccctatctgcacgtcaccatcatcaaagaccaaaggtaggtaaaaccctaaagatg gggagaaaccagagcagaaaagctgaaaattctaaaaatcagagcacctcttctcctcca aaggaacgcaactcctcaccagcaatggaacaaagctggatggagaatgactttgatgag ctgagagaagaaggcttcagacgatcggtaataacaacttctccgagctaa >gi568815592f:110775106_110993613|GENSCAN_predicted_peptide_3|211_aa MKREEKLREKRVKRNEQSLQETWEYVKRPNLHLIGVPESDGENGTKSENTLQDIIQENFP NLARQANIQIQEIQRMPQRYSSRRATPRHIIVKFTKVEMMEKMLRAARETGRVTHKGKPI RLTADLLAETLQARREWGPIFNILKEKNFQPRISYPAKLSFISEGEIKSFTDKQMLRDFV TTRPALQELLKEALNMERNNRYQPLQKHAKL >gi568815592f:110775106_110993613|GENSCAN_predicted_CDS_3|636_bp atgaagcgagaagagaagttaagagaaaaaagagtaaaaagaaacgaacaaagcctccaa gaaacatgggagtatgtgaaaagaccaaatctacatctgattggtgtacctgaaagtgac ggggagaatggaaccaagtcggaaaacactcttcaggatattatacaggagaacttcccc aacctagcgaggcaggccaacattcaaattcaggaaatacagagaatgccacaaagatac tcctcaagaagagcaactccaagacacataattgtcaaattcaccaaagttgaaatgatg gaaaaaatgttaagggcagccagagagacaggtcgggttacccacaaagggaagcccatc agactaacagcggatctcttggcagaaactctacaagccagaagagagtgggggccaata ttcaacattcttaaagaaaagaattttcaacccagaatttcatatccagcaaaactaagc ttcataagtgaaggagaaataaaatcctttacagacaaacaaatgctgagagattttgtc accaccaggcctgccctacaagagctcctgaaggaagcactaaacatggaaaggaacaac cggtaccagccactgcaaaaacatgccaaattgtaa >gi568815592f:110775106_110993613|GENSCAN_predicted_peptide_4|142_aa MGDFNTPLSTLDSSMRQKVNKDIQELNSALHQADLIDIYRTLHPKSTEYTFFSAPHHTYF EIDHIVGSKALLSKCQRTEIITNYLSDHSAIKLELRIKKLTQNRSTTWKLNNLLLNDYWV NNEMKAEIKMFFETNENKVTTY >gi568815592f:110775106_110993613|GENSCAN_predicted_CDS_4|429_bp atgggagactttaacaccccactgtcaacattagacagttcaatgagacagaaagttaac aaggatatccaggaattgaactcagctctgcaccaagcagacctaatagacatctacaga actctccaccccaaatcaacagaatatacattcttctcagcaccacatcacacttatttc gaaattgaccacatagttggaagtaaagcactcctcagcaaatgtcaaagaacagaaatt ataacaaactatctctcagaccacagtgcaattaaactagaactcaggattaagaaactc actcaaaatcgctcaactacatggaaattgaacaacctgctcctgaatgactactgggta aataatgaaatgaaggcagaaataaagatgttctttgaaaccaatgagaacaaagtcaca acatactag >gi568815592f:110775106_110993613|GENSCAN_predicted_peptide_5|220_aa MSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWIGRINIV KVAILPKVLYRFNAIPIKLPMTFFMELEKTTLKFIWNKKRACIAKRVLSQKNKAGGITLH DFKLYYKATVTKTAWYSYQNRDIDQWNTTEPSEIIPHIYNHLIFDKPDKNKKWRKDSLFN KWCWENWLAICRELKLDPFLTPYTKINSRQIKDLNVRPKP >gi568815592f:110775106_110993613|GENSCAN_predicted_CDS_5|663_bp atgagtgaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaactt acaagggatgtgaaggacctcttcaaggagaactacaaaccactgctcaatgaaataaaa gaggacacaaacaaatggaagaacattccatgctcatggataggaagaatcaatatcgtg aaagtggccatactgcccaaggtactttatagattcaatgccatccccatcaagctacca atgactttcttcatggaattggaaaaaactactttaaagttcatatggaacaaaaaaaga gcctgcattgccaagagagtcctgagtcaaaagaacaaagctggaggcatcacgctacat gacttcaaactatactacaaggctacagtaaccaaaacagcatggtactcgtaccaaaac agagatatagaccaatggaacacaacagagccctcagaaataataccacacatctacaac catctgatctttgacaaacctgacaaaaacaagaaatggagaaaggattccctatttaat aaatggtgctgggaaaactggctagccatatgtagagagctgaaactggatcccttcctt acaccttacacaaaaattaattcaagacagattaaagacttaaatgttagaccaaaacca taa >gi568815592f:110775106_110993613|GENSCAN_predicted_peptide_6|300_aa MGQVWALIHSTLEPFHTNDEEEGKYNEVAEEVTEQVCLPAKAKAAKEEEVHPYPSASSHY FEEKEWPDPPDLSFLEDTGQKVVAPVTVRAAPRVTAFSSTQAGIQKCHEGPVLGPVPNRG ISSSGHSPTCVQCLSPTTASSAAVDFCCTKAVSLLRGEPPQKVPTGMLSAAEQHLQKPAA KTEAEQLVWWRDLITKSWEIGKIITWGAGYACVSPGPNQQLIWIPSRHLKLYHEPDAKEK IPRGSQGPPGCSHVETDAEEDPNCHEQHPSNTATHLGTDQEAVTDGRRKPEESRTTSHNE >gi568815592f:110775106_110993613|GENSCAN_predicted_CDS_6|903_bp atgggacaagtgtgggctctgattcattccaccttggaaccttttcacactaatgatgag gaggaaggaaagtataacgaagtagcagaagaggtgacagagcaggtttgtttgccagct aaagctaaagcagcaaaagaggaagaggttcatccctacccttctgcatcctctcattat tttgaagaaaaagagtggcctgaccctccagatctttcttttctggaggacactgggcaa aaagtagttgccccagtgactgttcgagcagcacctcgagtgaccgctttcagttctact caggcaggaatccagaaatgccatgaggggcctgtcctgggccccgttccaaaccggggc atttccagctcaggccattcccccacctgtgtacaatgcctgtcccccaccacagccagt agtgctgcagtagatttttgctgcacaaaagctgtgagccttctacgtggagaaccccca caaaaagttccaacggggatgctatcagctgctgaacaacatctacagaaaccagctgca aagacagaagcagaacaactggtttggtggagagatctgataacaaaaagttgggaaata ggtaaaataataacttggggtgcaggttatgcttgtgtttctccaggaccaaatcaacag ctgatttggataccatcaagacacctgaaactttatcatgagccagatgccaaggaaaag attccaagaggatcccaaggaccccccggttgcagccatgttgagactgatgctgaggag gaccccaactgtcatgagcaacacccatcgaacacagccacccacctggggacagatcaa gaagctgtcacagatggcagaagaaaacctgaggaaagcaggacaaccagtcacaacgaa taa >gi568815592f:110775106_110993613|GENSCAN_predicted_peptide_7|133_aa MIPKGGRKGGHKGWARQYTSPEEIDTQLQSEKQKAREEEEQKEGGDGAAGDPKKEKKSLD SDESEDEDDYQQRRKGVEGLIDIENPNRVAQTTKKVTQLDLDGPKELLRREREEIEKQKA KERYMKMHLAGKR >gi568815592f:110775106_110993613|GENSCAN_predicted_CDS_7|402_bp atgatacctaaaggagggagaaagggaggccacaaaggctgggcgaggcaatatacaagc cctgaggagatcgacacgcagctgcagtctgagaagcagaaggccagggaagaagaggag caaaaagaaggtggagacggggctgcaggtgaccccaaaaaggagaagaaatctctagac tcagatgagagtgaagatgaagatgactaccagcaaaggcgcaaaggtgttgaagggctc attgacatcgagaatcccaaccgggtggcacagacaaccaaaaaggtcacacaactggat ctggatgggccaaaggagcttttgaggagagaacgagaggagattgagaagcagaaggca aaagagcgttacatgaaaatgcacttggctgggaagagatag >gi568815592f:110775106_110993613|GENSCAN_predicted_peptide_8|214_aa MAAAADERSPEDGEDEEEEVEQLVLVELSGIIDSDFLSKCENKCKVLGIDTERPILQVDS CVFAGEYEDTLGTCVIFEENVEHADTEGNNKTVLKYKCHTMKKLSMTRTLLTEKKEGEEN IGGVEWLQIKDNDFSYRPNMICNFLHENEDEEVVASAPDKSLELEEEEIQMNDSSNLSCE QEKPMHLEIEDSGPLIDIPSETEGSVFMETQMLP >gi568815592f:110775106_110993613|GENSCAN_predicted_CDS_8|645_bp atggcggcggcggcggacgagcggagtccagaggacggagaagacgaggaagaggaggta gagcagttggttctggtggaattatcaggaattattgattcagacttcctctcaaaatgt gaaaataaatgcaaggttttgggcattgacactgagaggcccattctgcaagtggacagc tgtgtctttgctggggagtatgaagacactctagggacctgtgttatatttgaagaaaat gttgaacatgctgatacagaaggcaataataaaacagtgctaaaatataaatgccataca atgaagaagctcagcatgacaagaactctcctgacagagaagaaggaaggagaagaaaac ataggtggggtggaatggctgcaaataaaggataatgatttctcctatcgacccaacatg atttgtaactttctacatgaaaatgaagacgaagaagtggtagcttcagccccagataaa tctttggaattggaagaggaagagattcaaatgaacgacagttcaaacctgagttgtgaa caggagaaaccaatgcacttggaaatagaagattctggtcctcttattgatataccttct gagacagaaggttctgtttttatggaaactcaaatgctgccttag >gi568815592f:110775106_110993613|GENSCAN_predicted_peptide_9|131_aa DKSGQRFNPTSLPSHGSHLRASPLSPGKQLALRRTRAGDRSAPTQVIRGVTRKSERPRQY HPHHGTLSPQRSPGLRPSALTLLDPECPSLPLPATLNQGACTCPRRLRAPEQAEPEAARN ATSGDGLVEGV >gi568815592f:110775106_110993613|GENSCAN_predicted_CDS_9|396_bp gataaaagcggccaacgcttcaatcccacatccctcccatcccacggctcccacttgaga gcttccccactgtctcctggcaagcagctcgcgctgaggaggacccgggctggtgatcga tcggcccccacccaggtaattagaggggtaaccaggaagagtgagaggccgagacagtac caccctcatcacgggaccctttcaccccaacgctccccggggctgagacccagcgcactt acactactcgatccagagtgtccatcgctaccgctacctgcaactcttaaccagggggca tgcacgtgccctcggcgattacgtgcgccggaacaggcggaaccggaagctgcgcgtaac gccacttccggggacggcctcgtagagggcgtctag