GENSCAN 1.0 Date run: 2-Nov-116 Time: 18:49:23 Sequence gi568815584f:85521515_85723494 : 201980 bp : 38.30% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 7490 7586 97 2 1 88 93 88 0.979 8.39 1.02 Intr + 8628 9020 393 2 0 14 98 424 0.838 29.92 1.03 Intr + 9549 9708 160 2 1 81 72 33 0.362 -0.36 1.04 Intr + 10067 10269 203 0 2 18 101 105 0.466 2.88 1.05 Intr + 10292 10405 114 1 0 29 31 166 0.820 4.72 1.06 Intr + 11917 12216 300 0 0 44 16 286 0.732 13.01 1.07 Intr + 12344 12479 136 1 1 26 97 127 0.994 6.62 1.08 Intr + 12542 12684 143 0 2 94 72 89 0.929 7.05 1.09 Intr + 22612 22765 154 0 1 62 48 103 0.242 2.42 1.10 Intr + 24354 24499 146 1 2 8 103 81 0.323 0.68 1.11 Intr + 30180 30266 87 2 0 52 100 41 0.197 0.95 1.12 Intr + 31120 31222 103 0 1 78 110 45 0.269 4.53 1.13 Term + 35447 35625 179 0 2 8 50 116 0.087 -3.33 1.14 PlyA + 35669 35674 6 1.05 2.05 PlyA - 36676 36671 6 1.05 2.04 Term - 45844 45697 148 0 1 39 55 147 0.182 2.89 2.03 Intr - 72026 71923 104 2 2 84 86 62 0.185 3.65 2.02 Intr - 87975 87798 178 2 1 88 86 128 0.896 11.60 2.01 Init - 89211 89159 53 0 2 69 97 -3 0.243 -0.61 2.00 Prom - 94389 94350 40 -5.75 3.00 Prom + 99161 99200 40 -6.15 3.01 Sngl + 100001 101983 1983 1 0 70 48 1331 0.987 121.05 3.02 PlyA + 102341 102346 6 1.05 4.07 PlyA - 103401 103396 6 1.05 4.06 Term - 168722 168657 66 2 0 79 44 85 0.263 0.16 4.05 Intr - 172505 172464 42 2 0 74 103 45 0.217 2.12 4.04 Intr - 183495 183413 83 1 2 32 52 103 0.012 -0.46 4.03 Intr - 194276 194063 214 2 1 60 47 113 0.034 1.77 4.02 Intr - 200212 200126 87 1 0 83 63 45 0.110 0.75 4.01 Intr - 201375 201342 34 2 1 112 99 32 0.678 4.01 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 87315 87208 108 0 0 69 43 91 0.918 0.23 S.002 Term + 105427 105561 135 0 0 66 49 182 0.958 9.14 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584f:85521515_85723494|GENSCAN_predicted_peptide_1|738_aa XPEWRQKKDSVEDVPVWISVCALSDVLAHLGVTLVVRSKPVAVTCAFGIIHCHRGGTSAR DIISARKSPRQGAELLLSEVRLWSSVPGAQQRRGGISAPAAVSTEPWDQGGSSQRWAGGT SAATPKTIPCPGIPAAAAPASSLSTSLTLSDSDEPIARRRRSRQLVGAGGAFSKTKSLPA LLVLFGLAGTELGPSRLSRAPAEVAKAWRAYEAESGSKRVVLLPAAVTGGAEATATVVVG ALLLEPPSKMALTVCSLPISTPSRAPSSPGYVLVTYCGEKMGYVNPPLHNATGTATRAHE PGSVRVCVLACVSACASVRLRAGRRTPGSELSAHTRRRRARLALPPSPGILQIRGAPAPR GSFLPPSPLAVLLLGFVGLAKRRGGGGGGEKAGVAAAEAKRKWHARTLEAVVAEDGGRRC RSSGLAPAAAAVAAAATRRTQQKFGSGGGRGAERDYHQGWKETSRTFAEPFLALAGLAES KREAKFKIPGMSTERWQITSHVRGAAVLRGKFPQEWIDYPVEGEVQPPPFCLTLSRFVRQ ILAGKPNLDQQQAFQTTHEAIHMPHWIGDNGQRNDQREDNSIKCVASERAWEKWRIDCVI NRSASLVFKIYVIAQAILMDPTEITDDLIHFLPVRKGSHVVRNLKASSKRKMAPLGAQQL FQDDNVDNSVASLGFLVQTAARENEEDPEAETPDKTIRAHETYSLPQEQYGGNCPHDSNY LPLGPSHNTWELWEYDSR >gi568815584f:85521515_85723494|GENSCAN_predicted_CDS_1|2217_bp nntccggaatggagacagaagaaagattcggttgaagatgtccctgtgtggatctcagtt tgcgctctgagcgacgtcctggctcatttgggtgtgacgttagttgtacgttcgaaacct gtcgccgtcacttgcgcgtttggcattatccattgtcaccgcggaggaacgagcgctcga gatatcatcagtgcccgcaaatctccgcgccaaggcgctgagctactcctttccgaggtg cgcctctggtcctccgtccctggtgcccagcagcggcgaggcggcatctccgctcccgcc gccgtgtccaccgagccctgggatcagggtggcagttctcaacgatgggcaggagggacc tcggcggcgacccctaaaacaataccatgccccgggatccccgctgctgccgcgccagcg tcttccctttccacctccctgaccctgtcggattcggatgagcccattgcaaggagaaga cgcagccgtcagttggtgggggcgggtggggctttttccaaaaccaagtcccttccagcc ctgcttgtcctcttcgggctggcgggcactgagctggggccatcacgcctttctagagcg cctgcggaggtggcgaaggcttggagagcatacgaggcggaatccggatcgaagcgtgtg gtgctcctgccagcggcagttactggtggagctgaggccaccgctactgtcgtcgttggc gctttgcttctggaacctcccagcaagatggcactcactgtctgttcccttccgattagc acccccagccgcgctccctcctccccgggatacgtattagtcacatactgtggggagaag atgggctatgtaaatccacctttgcataatgcaacaggaacagcgacccgcgcgcacgaa ccgggtagtgtgcgcgtgtgtgtgctcgcgtgtgtgagcgcgtgtgccagcgtgcgtctc cgcgcgggccgccgcacgcccggctccgagctgtccgcacacacgcgtcggaggagagcc cgcctagctctcccgccgagtcccgggatcctccaaatccgaggagctccggcgccgcgg ggcagctttctgccgccttccccgctcgctgtacttcttttggggttcgttggcttggcg aagcggagagggggaggcggaggaggagagaaggcgggggtcgcggcggccgaagccaag agaaagtggcatgcccgaaccctggaggcggtggtggcggaggacgggggaagacgatgc cgcagctccggcttggctccagcagccgccgccgtcgccgccgccgccacccggaggacc cagcaaaagtttggatctgggggagggcgcggcgctgagcgggattaccaccagggctgg aaggagacctcgagaacctttgcagagcccttcctggccctcgcgggtctggcagaaagt aagagggaggcgaagttcaagatcccggggatgagcaccgagcgctggcagatcactagt cacgttagaggggcagctgtgctgagaggcaaattcccccaggagtggattgattaccca gtggagggggaagtacaaccaccacctttctgccttactctgagcaggtttgtcagacaa atattggctggtaaacccaaccttgatcagcagcaagctttccaaaccacacatgaggca attcatatgccacactggataggtgataatggacaacggaatgaccagagggaagataat agcataaagtgtgtggcctctgagagggcctgggaaaaatggagaattgactgtgttatt aataggtcagcttcactggttttcaaaatctatgtaattgctcaggcgattttaatggac ccgactgagattactgatgacttaattcacttcttgccagtgagaaaaggtagccatgtg gtgagaaacttgaaggcctcttcaaagagaaaaatggcccctctgggagctcagcaattg tttcaggatgataatgttgataattccgtcgcatcattgggttttctggtgcagacagcg gcaagagaaaatgaggaagacccagaagcagaaacccctgataaaaccatcagagctcat gaaacttattcactaccacaagaacagtatgggggaaactgcccccatgattcaaattat cttccactgggtccctcccacaacacatgggaattatgggaatatgattcaagatga >gi568815584f:85521515_85723494|GENSCAN_predicted_peptide_2|160_aa MGDSGCRMMVLRRREANKWLRSDSSVFSKCSISPTLLTILQGIQNMGIISKPVRVPEKAE IDHSTNLRPTTNTPPIRKVHDIENPTGTHRFGLNTFSINRVSIICKEEPCERKPQVGLIK EDAEGSFEIPGIFGASITSRHPPGMGKPQRSGNCPVSMGF >gi568815584f:85521515_85723494|GENSCAN_predicted_CDS_2|483_bp atgggggattcagggtgcagaatgatggtgctcaggagaagagaggccaataagtggcta cgctctgactctagtgttttctccaaatgttctatctccccaactctcttgaccattctg cagggaattcagaatatgggaataatctcaaaaccagtgagagttcctgaaaaagctgaa attgaccacagtaccaaccttagacctacaaccaacacccctcctatcaggaaagtacat gacattgaaaaccctactggcactcacagatttgggctcaacacattttcaatcaacaga gtttcaatcatatgtaaggaagagccctgtgaaagaaagcctcaagttggtctaataaag gaagatgccgaaggaagcttcgagattccaggtatctttggagcttctattacaagcaga cacccaccagggatggggaaaccacagcgttcaggaaactgtccagtgtccatgggattc tga >gi568815584f:85521515_85723494|GENSCAN_predicted_peptide_3|660_aa MGLQTTKWPSHGAFFLKSWLIISLGLYSQVSKLLACPSVCRCDRNFVYCNERSLTSVPLG IPEGVTVLYLHNNQINNAGFPAELHNVQSVHTVYLYGNQLDEFPMNLPKNVRVLHLQENN IQTISRAALAQLLKLEELHLDDNSISTVGVEDGAFREAISLKLLFLSKNHLSSVPVGLPV DLQELRVDENRIAVISDMAFQNLTSLERLIVDGNLLTNKGIAEGTFSHLTKLKEFSIVRN SLSHPPPDLPGTHLIRLYLQDNQINHIPLTAFSNLRKLERLDISNNQLRMLTQGVFDNLS NLKQLTARNNPWFCDCSIKWVTEWLKYIPSSLNVRGFMCQGPEQVRGMAVRELNMNLLSC PTTTPGLPLFTPAPSTASPTTQPPTLSIPNPSRSYTPPTPTTSKLPTIPDWDGRERVTPP ISERIQLSIHFVNDTSIQVSWLSLFTVMAYKLTWVKMGHSLVGGIVQERIVSGEKQHLSL VNLEPRSTYRICLVPLDAFNYRAVEDTICSEATTHASYLNNGSNTASSHEQTTSHSMGSP FLLAGLIGGAVIFVLVVLLSVFCWHMHKKGRYTSQKWKYNRGRRKDDYCEAGTKKDNSIL EMTETSFQIVSLNNDQLLKGDFRLQPIYTPNGGINYTDCHIPNNMRYCNSSVPDLEHCHT >gi568815584f:85521515_85723494|GENSCAN_predicted_CDS_3|1983_bp atgggcctacagaccacaaagtggcccagccatggggcttttttcctgaagtcttggctt atcatttccctggggctctactcacaggtgtccaaactcctggcctgccctagtgtgtgc cgctgcgacaggaactttgtctactgtaatgagcgaagcttgacctcagtgcctcttggg atcccggagggcgtaactgtactctacctccacaacaaccaaattaataatgctggattt cctgcagaactgcacaatgtacagtcggtgcacacggtctacctgtatggcaaccaactg gacgaattccccatgaaccttcccaagaatgtcagagttctccatttgcaggaaaacaat attcagaccatttcacgggctgctcttgcccagctcttgaagcttgaagagctgcacctg gatgacaactccatatccacagtgggggtggaagacggggccttccgggaggctattagc ctcaaattgttgtttttgtctaagaatcacctgagcagtgtgcctgttgggcttcctgtg gacttgcaagagctgagagtggatgaaaatcgaattgctgtcatatccgacatggccttc cagaatctcacgagcttggagcgtcttattgtggacgggaacctcctgaccaacaagggt atcgccgagggcaccttcagccatctcaccaagctcaaggaattttcaattgtacgtaat tcgctgtcccaccctcctcccgatctcccaggtacgcatctgatcaggctctatttgcag gacaaccagataaaccacattcctttgacagccttctcaaatctgcgtaagctggaacgg ctggatatatccaacaaccaactgcggatgctgactcaaggggtttttgataatctctcc aacctgaagcagctcactgctcggaataacccttggttttgtgactgcagtattaaatgg gtcacagaatggctcaaatatatcccttcatctctcaacgtgcggggtttcatgtgccaa ggtcctgaacaagtccgggggatggccgtcagggaattaaatatgaatcttttgtcctgt cccaccacgacccccggcctgcctctcttcaccccagccccaagtacagcttctccgacc actcagcctcccaccctctctattccaaaccctagcagaagctacacgcctccaactcct accacatcgaaacttcccacgattcctgactgggatggcagagaaagagtgaccccacct atttctgaacggatccagctctctatccattttgtgaatgatacttccattcaagtcagc tggctctctctcttcaccgtgatggcatacaaactcacatgggtgaaaatgggccacagt ttagtagggggcatcgttcaggagcgcatagtcagcggtgagaagcaacacctgagcctg gttaacttagagccccgatccacctatcggatttgtttagtgccactggatgcttttaac taccgcgcggtagaagacaccatttgttcagaggccaccacccatgcctcctatctgaac aacggcagcaacacagcgtccagccatgagcagacgacgtcccacagcatgggctccccc tttctgctggcgggcttgatcgggggcgcggtgatatttgtgctggtggtcttgctcagc gtcttttgctggcatatgcacaaaaaggggcgctacacctcccagaagtggaaatacaac cggggccggcggaaagatgattattgcgaggcaggcaccaagaaggacaactccatcctg gagatgacagaaaccagttttcagatcgtctccttaaataacgatcaactccttaaagga gatttcagactgcagcccatttacaccccaaatgggggcattaattacacagactgccat atccccaacaacatgcgatactgcaacagcagcgtgccagacctggagcactgccatacg tga >gi568815584f:85521515_85723494|GENSCAN_predicted_peptide_4|175_aa XFDKTTNQYSFLVTIYKEVSYKESAHMIMETEKSQDLQGDSTPSSPVLGLGLALLAPQLA NGLLWSLVIVTVPSFSERSLRAVMKTLGKVTHSHHGGRDAGLHCSSDSDGQSIFYIVWNH VPKITGLMEEMGLDADLSKQPKLTQLVEGEGPSQKVPMFLNMIHRRVKELRVAAE >gi568815584f:85521515_85723494|GENSCAN_predicted_CDS_4|528_bp nnttttgataaaactaccaaccagtactccttcctggtgacaatatataaagaggtttct tataaggaatcagctcacatgattatggagacagaaaagtcccaagatctgcagggtgat tcaactccaagttccccagttttgggactcggactggctctccttgctcctcagcttgca aatggcctattgtggagccttgtgatcgtgacagtacctagttttagtgagaggagtctc agagctgtcatgaaaacattggggaaagtgactcacagtcatcatggtggacgggatgca ggactacattgtagctctgactcagacggacagagcatattttacattgtgtggaatcat gtaccaaaaataacaggacttatggaggaaatggggctggatgcagacctttcaaaacaa cctaaacttacacaactggtggaaggagaaggtccttctcagaaagttcctatgttctta aatatgattcatcgaagggtgaaagaactaagagtagcagcagaatag