GENSCAN 1.0 Date run: 4-Nov-116 Time: 19:41:18 Sequence gi568815595f:87890750_88091847 : 201098 bp : 37.44% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.07 PlyA - 9 4 6 1.05 1.06 Term - 1282 1218 65 2 2 110 44 37 0.221 -1.43 1.05 Intr - 9845 9762 84 0 0 59 101 54 0.004 2.67 1.04 Intr - 24617 24579 39 0 0 129 93 19 0.002 3.78 1.03 Intr - 42013 41971 43 1 1 54 99 16 0.239 -3.81 1.02 Intr - 43680 43602 79 2 1 96 59 80 0.374 4.53 1.01 Init - 51470 51112 359 2 2 43 88 240 0.461 16.12 1.00 Prom - 51879 51840 40 -7.55 2.00 Prom + 54093 54132 40 -8.05 2.01 Sngl + 55365 55538 174 2 0 75 42 307 0.986 19.44 2.02 PlyA + 55657 55662 6 1.05 3.00 Prom + 61706 61745 40 -5.25 3.01 Init + 63139 63163 25 0 1 95 115 42 0.974 7.34 3.02 Term + 78172 78389 218 2 2 11 49 185 0.761 3.22 3.03 PlyA + 80484 80489 6 1.05 4.04 PlyA - 80517 80512 6 1.05 4.03 Term - 87659 87493 167 0 2 69 47 171 0.785 8.20 4.02 Intr - 89384 89251 134 1 2 40 67 82 0.286 0.67 4.01 Init - 91055 90952 104 2 2 81 39 47 0.105 -1.14 4.00 Prom - 93418 93379 40 -3.35 5.00 Prom + 98194 98233 40 -4.75 5.01 Init + 100001 100853 853 1 1 61 87 425 0.038 34.55 5.02 Intr + 106675 106754 80 2 2 33 100 86 0.042 2.65 5.03 Term + 114350 114466 117 1 0 93 45 80 0.554 1.76 5.04 PlyA + 115119 115124 6 1.05 6.04 PlyA - 116697 116692 6 1.05 6.03 Term - 125311 124918 394 0 1 23 55 261 0.289 9.82 6.02 Intr - 126016 125727 290 1 2 56 -26 214 0.364 1.92 6.01 Init - 126155 126075 81 2 0 53 105 64 0.824 5.62 6.00 Prom - 126426 126387 40 -3.95 7.07 PlyA - 126481 126476 6 1.05 7.06 Term - 137854 137641 214 0 1 84 43 169 0.111 7.72 7.05 Intr - 149591 149548 44 2 2 53 75 55 0.168 -3.18 7.04 Intr - 150103 150041 63 1 0 97 80 45 0.837 2.60 7.03 Intr - 150631 150444 188 2 2 23 105 150 0.576 8.69 7.02 Intr - 151551 151349 203 2 2 53 26 125 0.270 0.81 7.01 Init - 152168 152089 80 2 2 69 13 111 0.833 2.29 7.00 Prom - 153146 153107 40 -4.45 8.02 PlyA - 153495 153490 6 1.05 8.01 Sngl - 165227 164724 504 2 0 83 42 490 0.999 39.69 8.00 Prom - 165414 165375 40 -8.45 9.00 Prom + 165490 165529 40 -4.95 9.01 Init + 167508 167571 64 2 1 79 37 35 0.759 -1.14 9.02 Term + 168055 168368 314 2 2 53 47 385 0.841 25.08 9.03 PlyA + 168394 168399 6 1.05 10.00 Prom + 168479 168518 40 -18.06 10.01 Init + 168571 168756 186 0 0 72 94 345 0.991 32.60 10.02 Intr + 195508 195653 146 0 2 71 75 101 0.022 5.26 10.03 Intr + 200265 200337 73 0 1 87 89 26 0.003 1.09 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 66223 66251 29 2 2 98 43 41 0.820 -2.14 S.002 Term + 169248 169310 63 2 0 120 46 45 0.915 0.41 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:87890750_88091847|GENSCAN_predicted_peptide_1|222_aa MPESPTPLLGRGILAKAGAIIHLNIGEGTPICCPLLEEGINPEVWATGGQYGRAKNAHLV QVELKDSASFPYQRQYPLRPEAQQGLQKIKDLKAQGLVKPCNSPCNTPILGVQKPNGQWR DKAVPGGQGQSSMESHRYGLLEWGQRNSAVSPSGPGLFLVGNSGVSSWFGFTAGLCYILL IFSPASTLTTFVMEQSVTLLKCFCKIQGRREDTTMWKGENVS >gi568815595f:87890750_88091847|GENSCAN_predicted_CDS_1|669_bp atgcctgaaagccccactcccttgttagggagaggcattctagcaaaagcaggggccata atacacctgaacataggagaaggaacacccatttgttgtcccctgcttgaggaaggaatt aatcctgaagtctgggcaacaggaggacaatatggacgagcaaagaatgcccatcttgtt caagttgaactaaaggattctgcctcctttccctaccaaaggcagtacccccttagaccc gaggcccaacaaggactccaaaagattaaggacctaaaagcccaaggcctagtaaaacca tgcaatagcccctgcaatactccaattttaggagtacagaaacccaatggacagtggaga gacaaagcagttccaggtggtcaagggcagtcctccatggagagtcatagatatggactg ttggaatggggacagaggaattcggctgtgagtccatctggtcctggactctttttggtt ggtaattcaggggtttcttcctggtttggattcactgctggcttatgttatattctgctc attttttcccctgcttctacccttaccaccttcgtgatggagcagtcagtcaccctgctg aaatgtttctgcaagatccagggaagaagagaagatactactatgtggaagggtgaaaat gtgtcttga >gi568815595f:87890750_88091847|GENSCAN_predicted_peptide_2|57_aa MQEQWQAFSPIGSGNGRIAGSGAQRTPCRIQRGGSQRRYATAANGSGGLRAKAQLEP >gi568815595f:87890750_88091847|GENSCAN_predicted_CDS_2|174_bp atgcaggaacaatggcaagcctttagcccaatcgggagcggcaatgggcgcatcgctgga tcaggagcacagcggacaccctgccggatccagaggggtggaagtcagcggcggtatgcg acggcggcaaacggcagtggtggacttcgagcaaaagctcagctcgaaccataa >gi568815595f:87890750_88091847|GENSCAN_predicted_peptide_3|80_aa MDENSGHYATLAMAKRDQGTAWAMDSDGASTKPWQLPRGIESASAQQSKIEVWKPPPRFQ RMYGNTWMSTQKFAAGMKPS >gi568815595f:87890750_88091847|GENSCAN_predicted_CDS_3|243_bp atggatgaaaatagcggtcattatgccactctagccatggctaaaagggaccaaggtaca gcttgggccatggattcagatggtgcaagcaccaagccttggcagctcccacgtggtatt gagtctgcaagtgcacagcagtcaaaaattgaggtttggaaacctccgcctagatttcag aggatgtatggaaacacctggatgtccacccagaagtttgctgcaggaatgaagccctca tga >gi568815595f:87890750_88091847|GENSCAN_predicted_peptide_4|134_aa MTILEKMTPHSMNTLKPATGAPAMEKRGNTHVQTQTRIWDPLNGRTERAVTQTGLKHTLL LATVWAMRREELWPFQEHRFIILPELPRTQDLSNERAVTQTGLKHSSPTHQVVGDEKERR AAALLGAQGVVHSR >gi568815595f:87890750_88091847|GENSCAN_predicted_CDS_4|405_bp atgacaatcttagagaaaatgacaccacattcaatgaacacactgaaaccagctacagga gcacctgctatggaaaagagaggaaatacacatgtacagactcagacaagaatttgggac ccactgaatggcagaactgaaagagctgtaacacaaacagggctgaaacacaccctgctg cttgccacagtgtgggcaatgaggagagaagagttgtggcctttccaggaacacagattt atcattcttcctgaacttccaagaactcaggacctgtcaaatgaaagagctgtaacacaa acaggactgaaacacagctccccgactcatcaagttgtgggtgatgagaaggaaagaaga gctgcggcccttctgggagctcagggagtggtgcactcgagatga >gi568815595f:87890750_88091847|GENSCAN_predicted_peptide_5|349_aa MDFLNSSDQNLTSEELLNRMPSKILVSLTLSGLALMTTTINSLVIAAIIVTRKLHHPANY LICSLAVTDFLVAVLVMPFSIVYIVRESWIMGQVVCDIWLSVDITCCTCSILHLSAIALD RYRAITDAVEYARKRTPKHAGIMITIVWIISVFISMPPLFWRHQGTSRDDECIIKHDHIV STIYSTFGAFYIPLALILILYYKIYRAAKTLYHKRQASRIAKEEVNGQVLLESGEKSTKS VSTSYVLEKSLSDPSTDFDKIHSTVRSLRSEFKHEKSWRRQKISGVSQRAFDTQPGVLHC GFPVAECGEEPNVPFILNLSDFIITRKSEKGVRDFFIKQKKKRTDSKLP >gi568815595f:87890750_88091847|GENSCAN_predicted_CDS_5|1050_bp atggatttcttaaattcatctgatcaaaacttgacctcagaggaactgttaaacagaatg ccatccaaaattctggtgtccctcactctgtctgggctggcactgatgacaacaactatc aactcccttgtgatcgctgcaattattgtgacccggaagctgcaccatccagccaattat ttaatttgttcccttgcagtcacagattttcttgtggctgtcctggtgatgcccttcagc attgtgtatattgtgagagagagctggattatggggcaagtggtctgtgacatttggctg agtgttgacattacctgctgcacgtgctccatcttgcatctctcagctatagctttggat cggtatcgagcaatcacagatgctgttgagtatgccaggaaaaggactccaaagcatgct ggcattatgattacaatagtttggattatatctgtttttatctctatgcctcctctattc tggaggcaccaaggaactagcagagatgatgaatgcatcatcaagcacgaccacattgtt tccaccatttactcaacatttggagctttctacatcccactggcattgattttgatcctt tactacaaaatatatagagcagcaaagacattataccacaagagacaagcaagtaggatt gcaaaggaggaggtgaatggccaagtccttttggagagtggtgagaaaagcactaaatca gtttccacatcctatgtactagaaaagtctttatctgacccatcaacagactttgataaa attcatagcacagtgagaagtctcaggtctgaattcaagcatgagaaatcttggagaagg caaaagatctcaggagtgagtcagagggcatttgacacgcaacctggtgttctgcattgt gggtttcctgttgctgaatgtggagaagagccaaatgtcccttttatcctcaatctctct gactttatcatcactaggaagagtgagaaaggagtcagagatttcttcattaaacagaag aagaaaaggacagattccaaattaccgtaa >gi568815595f:87890750_88091847|GENSCAN_predicted_peptide_6|254_aa MKRRPIVSEIDGFLVSLTSRMKPRTLEMCSEFLPSGGVPGLTGSGVKLWTFAVGVTALKA ERLELFFSPGGFVVSLASGVKLQTFAVSVTAHKGSVDPKSEQQQDLLQRAKEQSFRNMEG DRSGPWVVDRTGRPGAGGGARRGGSAAQEPTEGVGGSGMAGCRSRALSRGKAAKARREIE RSAGGLALLGDPVHPPQPLAQVLSPSLPGASRAGRLLRVPARQAHADPELQLARRRRAQP GFPLAPLRAHLPAS >gi568815595f:87890750_88091847|GENSCAN_predicted_CDS_6|765_bp atgaagaggagacctattgtgtccgaaattgatgggttcttggtctcactgacttcaaga atgaagccacggaccctcgagatgtgttcggagtttcttccttctggtggggttcctggt ctcactggctcaggagtgaagctgtggacctttgccgtgggtgtcacagctcttaaggca gagcgtctggagttgttcttttctcccggtgggttcgtggtctcgctggcttcaggagtg aagctgcagaccttcgcggtaagtgttacagctcataaaggcagtgtggacccaaagagt gagcagcagcaagatttattacaaagagcgaaagaacaaagcttccgcaatatggaaggg gaccgcagcggcccttgggtggttgataggactggacgcccgggagcagggggcggcgct cgtaggggaggctcggccgcacaggagcccacggagggggtgggaggctcaggcatggcg ggctgcaggtcccgagccctgtcccgcgggaaggcagctaaggcccggcgagaaattgag cgcagcgctggtgggctggcactgctgggggacccagtacaccctccgcagccgctggcc caggtgctaagcccctcattgcccggggccagcagggctggccggctgctccgagtgccg gcccgccaagcccacgccgacccggaactccagctggcccgcaggcgccgtgcccagccc gggttcccactcgcgcctctccgtgcacacctccccgcaagctga >gi568815595f:87890750_88091847|GENSCAN_predicted_peptide_7|263_aa MGPCGKALMNLSAIEPLNVNGVPREPQKHHWHKYTQQLTESPHWCPDLWSVGYSGGKGQV ELRLPRDTIKQKPEHIPKGIAEISAPIQGLERRRAHLVNDRKNSQFCLRLRTKGGSATDP GCLATSLPGPYDPAIQWCWKCLQQIGKPFGALGRPLEPAPMASLEVLYIHLVEEDKIQWA HEKSGHGGSDTGRWLKCLTTQFDSPLYPELLLGIQEESGHMNKLKMVNVGDFIADESGSQ QDGELERGWSGKVVFPWNSTITS >gi568815595f:87890750_88091847|GENSCAN_predicted_CDS_7|792_bp atgggcccgtgtgggaaggccctgatgaacctcagtgccattgaacccctaaatgtcaat ggggtccctcgtgaaccccaaaagcatcattggcataaatatactcaacagctgacagaa tccccacattggtgccctgacctgtggagtgtgggctattccggtggcaaaggccaagta gaattacgtctacctagggatacaataaaacaaaagccagagcacattcctaaagggatt gcagagatcagtgcccccattcaaggacttgaaagacgtagagcccatctagtgaatgat cggaaaaacagccagttttgtctgagactcagaacaaaaggaggttctgccacagatcca ggctgccttgcaacttctttgccagggccatatgatcctgcaatccagtggtgttggaag tgtctgcagcagatagggaagccatttggagcccttggcaggcccctagagcctgcacct atggcctcactggaagttctctacattcacttggtggaggaagataagattcagtgggct catgaaaaaagcggccatggtggcagtgatacaggcagatggcttaagtgtttaacaact cagtttgacagccctctgtatcccgagctcttgttgggcatccaggaagaatcaggacac atgaacaaattgaagatggtaaatgtgggggattttattgctgatgaaagtggctctcag caggatggagaactggaaaggggatggagtgggaaggtggtcttcccctggaattcaacc attaccagctga >gi568815595f:87890750_88091847|GENSCAN_predicted_peptide_8|167_aa MERFVVTAPPARNRSKTALYVTPLDRVTEFGGELHEDGGKLFCTSCNVVLNHVRKSAISD HLKSKTHTKRKAEFEEQNVRKKQRPLTASLQCNSTAQTEKVSVIQDFVKMCLEANIPLEK ADHPAVRAFLSRHVKNGGSIPKSDQLRRAYLPDGYENENQLLNSQDC >gi568815595f:87890750_88091847|GENSCAN_predicted_CDS_8|504_bp atggagcgatttgtagtaacagcaccacctgctcgaaaccgttctaagactgctttgtat gtgactcccctggatcgagtcactgagtttggaggtgagctgcatgaagatggaggaaaa ctcttctgcacttcttgcaatgtggttctgaatcatgttcgcaagtctgccattagtgac cacctcaagtcaaagactcataccaagaggaaggcagaatttgaagagcagaatgtgaga aagaagcagaggcccctaactgcatctcttcagtgcaacagtactgcgcaaacagagaaa gtcagtgttatccaggactttgtgaaaatgtgcctggaagccaacatcccacttgagaag gctgatcacccagcagtccgtgctttcctatctcgccatgtgaagaatggaggctccata cctaagtcagaccagctacggagggcatatcttcctgatggatatgagaatgagaatcaa ctcctcaactcacaagattgttga >gi568815595f:87890750_88091847|GENSCAN_predicted_peptide_9|125_aa MGDGRRVTEKMNKNVNNCHHAAPWYRRIEAAVRESGRGPAGKEKKRKKEEQGNKGGGGSV AAAVAAVARSSPAAAAVSPAAPRPPAPPMAEPVGGPQQPPSSIDQQTAGLSNRLSISPIV VERRA >gi568815595f:87890750_88091847|GENSCAN_predicted_CDS_9|378_bp atgggggacggaaggagggtgactgaaaagatgaacaagaacgtcaacaactgccaccac gcagctccttggtaccgtaggatcgaggccgccgttagggaaagtggacgaggcccggcc ggaaaggaaaagaaaaggaagaaagaggagcaagggaataagggaggaggaggatccgtc gctgccgccgtcgccgccgttgcccgatcgagccccgcggcggccgccgtgtcccccgcc gcgccccgtccgcctgcaccgcctatggcagagcccgtaggcggtccccagcaaccgccg agcagcattgaccaacagacggccggtttatcaaacagactgtcgatttcaccaatagta gtggaaaggcgggcatga >gi568815595f:87890750_88091847|GENSCAN_predicted_peptide_10|135_aa MAEEESDQEAERLGEELVAIVESPLGPVGLRAAGDGRGGAGSGNCGGGVGISSRDYCRRF CQVVEDYAGRWQVPLPQLQVLQTALCCFTTASASFPDECEHVQYVLSSLAVYTNTIVTVA YGIQYSSMLYRFVAQ >gi568815595f:87890750_88091847|GENSCAN_predicted_CDS_10|405_bp atggcggaggaagagagcgaccaagaggccgaacgcctcggagaagagcttgtggccatt gtggagtccccgctgggccctgtggggcttagagctgcgggcgacggcagaggcggcgct ggcagcggcaactgcggcggcggcgtcggaatcagcagtcgggattactgccgacgcttc tgtcaggtggttgaagattatgctggaagatggcaggtccctttgccacagcttcaggtt cttcagactgccctttgttgttttacaacagccagtgcatcattcccagatgaatgtgag catgtacaatatgttttgagtagccttgctgtatacacaaatactattgttacagttgcc tatggtattcagtacagtagcatgctatacaggtttgtagcccag