GENSCAN 1.0 Date run: 8-Nov-116 Time: 10:27:37 Sequence gi568815586r:11033288_11234244 : 200957 bp : 36.24% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 15610 15822 213 0 0 97 36 127 0.224 6.39 1.02 Intr + 30334 30447 114 0 0 80 76 42 0.057 1.92 1.03 Intr + 38751 38823 73 2 1 51 99 4 0.008 -4.14 1.04 Intr + 38876 39163 288 0 0 96 61 96 0.099 3.99 1.05 Term + 56126 56310 185 0 2 77 36 127 0.545 3.12 1.06 PlyA + 57466 57471 6 1.05 2.00 Prom + 58708 58747 40 -5.35 2.01 Init + 58941 59163 223 2 1 55 96 124 0.239 8.67 2.02 Term + 72074 72276 203 0 2 45 44 128 0.270 0.67 2.03 PlyA + 74090 74095 6 1.05 3.00 Prom + 74190 74229 40 -8.05 3.01 Sngl + 78333 79538 1206 2 0 62 43 428 0.992 31.27 3.02 PlyA + 79775 79780 6 1.05 4.00 Prom + 80893 80932 40 -3.45 4.01 Init + 81020 81223 204 1 0 66 72 99 0.530 5.00 4.02 Intr + 96287 96385 99 1 0 57 40 98 0.200 1.29 4.03 Term + 101226 101414 189 2 0 112 38 68 0.361 0.67 4.04 PlyA + 101685 101690 6 1.05 5.03 PlyA - 101772 101767 6 1.05 5.02 Term - 115032 114500 533 1 2 67 40 231 0.039 9.62 5.01 Init - 138312 138135 178 0 1 94 90 350 0.966 35.27 5.00 Prom - 142563 142524 40 -4.95 6.07 PlyA - 143541 143536 6 1.05 6.06 Term - 148655 148487 169 1 1 17 43 159 0.180 0.67 6.05 Intr - 156557 156426 132 1 0 93 26 89 0.116 2.04 6.04 Intr - 163084 162910 175 2 1 85 19 91 0.175 -0.02 6.03 Intr - 173344 173193 152 0 2 93 88 12 0.197 0.59 6.02 Intr - 176131 175964 168 0 0 75 91 65 0.236 3.64 6.01 Init - 178898 178861 38 2 2 41 81 54 0.220 -0.47 6.00 Prom - 179804 179765 40 -3.65 7.00 Prom + 181111 181150 40 -6.15 7.01 Init + 185257 185309 53 0 2 78 82 63 0.667 5.38 7.02 Term + 188230 188752 523 1 1 -16 48 338 0.978 12.16 7.03 PlyA + 189083 189088 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 44596 44694 99 0 0 83 50 98 0.839 2.65 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:11033288_11234244|GENSCAN_predicted_peptide_1|290_aa VEKNKVGEIGSLEQMKYAEASNTQMLKCLVIAETLEPKILTSNLCEAKLNTVAYQCNNMT QSKPTLDTTRAKNITDKSTLLNQECGKYVQPCCVRNSGAKSKNRWEIAKQLTRILFSGSR EYPASSFCVTGTTVFVETSSHHVAQGGLELLASIDIPNCAYQSSGIASKSHHTKPALGVF EGQMLPRFLTHVHPKVDLISLLLLPLPEHVIEMDKHGSWQDSHLLPDLQTLLEKVLEAKY THLMTVMLSLSPHCEVAASRAIPDITTHCFTFLPVSISTRPCHFCLEFDF >gi568815586r:11033288_11234244|GENSCAN_predicted_CDS_1|873_bp gtggagaaaaataaggttggagaaattggcagtcttgagcaaatgaaatatgctgaggct agtaacacccagatgctgaaatgcttggttattgctgagacattagaaccaaaaattctt acttctaatctatgtgaagccaaattaaacacagttgcataccaatgtaataatatgacc cagagtaaaccaactctggacaccaccagagcaaaaaacataactgataaatcaactctg ctaaaccaagagtgtgggaaatatgtacaaccttgttgtgtcaggaattcaggagccaaa agcaaaaacagatgggaaattgcaaagcaactgactcgcatattgttttcaggttcaagg gagtatcctgcctcatccttctgtgtaactgggactacagtttttgtagagacaagctct caccatgtagcccagggtggtctggaactcctggcctcaatagatattcccaactgtgcc taccaaagttccgggattgcaagcaagagccatcataccaagcctgcactgggagttttt gaaggtcagatgctacctagatttttgactcatgtacatcccaaggtggatctaatcagt ttgctgttattacctcttcctgaacatgtaattgaaatggacaaacatggcagctggcag gactctcaccttcttcctgacctgcagacccttctggaaaaagtgctggaagccaagtac acgcatctgatgaccgtcatgctgtctctctcacctcactgtgaagtggctgccagcaga gccataccagacatcaccacacattgtttcacatttcttcctgtctcaatttccacacgt ccttgccatttttgtcttgaatttgacttctaa >gi568815586r:11033288_11234244|GENSCAN_predicted_peptide_2|141_aa MSEQTKKIFLNAGVVSGIGSCRWVRGLADFKREPLTFTVSVAALKDGVDPKSEQQQGLLR REKGQSFHRVKGNPGLDLNIYVYLIPEFAGMSLFLFKFCDQCQTGKHLNMPVDEINAVFM ENMMISKTAQINSYSNTTSLL >gi568815586r:11033288_11234244|GENSCAN_predicted_CDS_2|426_bp atgtctgaacagacaaaaaaaatttttttaaatgctggtgttgtgtccggaattggttcc tgcaggtgggttcgtggtctcgctgacttcaaaagggagccactgaccttcacggtgagt gttgctgctcttaaagatggtgtggacccaaagagtgagcaacagcaaggtttattgaga agagagaaaggacaaagcttccacagagtgaaaggcaacccaggtcttgaccttaacatc tatgtgtacctgattcctgaatttgcaggaatgtccttgttcctttttaaattctgtgac caatgtcaaacaggaaagcatctcaatatgccagtggatgaaatcaatgctgtctttatg gaaaacatgatgatttccaaaacagctcaaattaactcctattcaaacacaacgtccttg ctgtaa >gi568815586r:11033288_11234244|GENSCAN_predicted_peptide_3|401_aa MYQNLWDTAKAVFRGKFIALNAHRRKWERSKVDTLASQLKELEKQEQTNLKASRRQEITK VRAQLKEIETPKTLQKINESRSWFFEKFNKIERQLARLVKKKRENNPIDTIKNNKGDITT DFREIQTIIREYYKHLYANTLENLEEMDKILDTYNIPSLNQEEVKSLNRPITSSETETVI NSLATKKSPGTDIFTTKFYQRYKEEMVPFFLKLFKTLEKEGLLLNSFYEASIILIPKPDR DTAKRENFRPISLMNIDAKILNKILASQIQQHIKRLIHQDQVVFNPGMQGWFNICKSINI IHHINRTNDKNHMIISTDAEKAFDKIQHAFMLKTLKKLGTDGMYLKILSAIYDKPTANIL NGQKREAFPLKIGTKQGCHLSPLLFNIVLEILARTIRKRKK >gi568815586r:11033288_11234244|GENSCAN_predicted_CDS_3|1206_bp atgtaccagaatctctgggacacagctaaagcagtgtttagagggaaatttatagcacta aatgcccacaggagaaagtgggaaagatctaaagttgacaccctagcatcacaattaaaa gaactggagaagcaagagcaaacaaatttaaaagctagcagaagacaagagataactaag gtcagagcacaactgaaggagatagaaacacctaaaacccttcagaaaatcaatgaatcc aggagctggttttttgaaaagttcaacaaaatagaaagacagctagccagactagtaaag aagaaaagagagaataatccaatagacacaataaaaaataataaaggagatattaccact gacttcagagaaatacaaactatcatcagagaatactataaacacctctatgcaaatacg ctagaaaatttagaagaaatggataaaatactggacacatacaacatcccaagtctaaac caggaagaagtcaaatccctgaatagaccaataacaagttctgaaactgagacagtaatt aatagcctagcaaccaaaaaaagtccaggaacagacatattcacaaccaaattctaccag aggtacaaagaggagatggtaccattctttctgaaactattcaaaacactggaaaaagag ggactactccttaactcattttatgaggccagcatcatcctgattccaaaacctgataga gacacagcaaaaagagaaaattttaggccaatatccttgatgaacatcgatgcgaaaatc ctcaataaaatactggcaagccaaatccagcagcatataaaaaggcttatccaccaagat caagtcgtcttcaaccctgggatgcaaggctggttcaacatatgcaaatcaataaacata atccatcacataaacagaaccaatgacaaaaaccacatgattatctcaacagatgcagaa aaggcctttgataaaattcaacatgccttcatgctaaaaacactcaagaaactaggtact gatggaatgtatctcaaaatattaagtgctatttatgacaaacccacagccaatatactg aatgggcaaaaacgggaagcattccctttgaaaatcggcacaaaacaaggatgccatctc tcaccactcctgttcaacatagtattggaaattctggccaggacaatcaggaagagaaag aaataa >gi568815586r:11033288_11234244|GENSCAN_predicted_peptide_4|163_aa MEYYAAIKKDEFMSFAGTWMKLETIILSKLTQEQKTKHHIITHNWELNNENTWTQGGHRE RHTTGPFRMGYRMNADYHKLKSTLTNAFQDVLSSLSRAELLAGIIHTEIDMKPEFSFASM QTRTCSLPVFAIFPCVTSPSFVFSDFSCLGSFITRYVDRIVNV >gi568815586r:11033288_11234244|GENSCAN_predicted_CDS_4|492_bp atggaatactatgcagccataaaaaaggatgagttcatgtcctttgcagggacctggatg aagctggaaaccatcattctcagcaaactaacacaggaacagaaaaccaaacaccacatt atcactcataattgggagctaaacaatgagaacacatggacacagggaggacacagggaa cgtcacacaactgggccttttaggatgggttatcgaatgaatgcagactaccataaactt aaatcaacactcacaaatgctttccaggatgtgctatcttcactgagcagagcagagctt ctggctggaattattcatactgaaattgacatgaaacctgaattctcatttgctagtatg caaacaaggacatgttcacttccagtgtttgcaatttttccttgtgtaacctctccatca tttgtctttagcgacttcagttgcttgggaagttttataacccgatatgtagatcgtata gtaaatgtctaa >gi568815586r:11033288_11234244|GENSCAN_predicted_peptide_5|236_aa MLAPEAVSLVMALNMYLQSDVEDHRARDVEVREVHAQLPGQLEEGEQGAGEPLAEDAVRV LEVVTRAMRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQNLVKLISNFSKVSGY KINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGVQLTRDVKDLFKENYKPLLNE IKEDTNKWKNIPCSWVGRINIVKMAILPKVIYRFNAIPIKLTMTFFTELENYFKVH >gi568815586r:11033288_11234244|GENSCAN_predicted_CDS_5|711_bp atgctggccccggaggccgttagcttagtcatggctctaaatatgtacctgcaatcggat gttgaggatcaccgagcccgcgacgtagaagtacgggaagttcatgcgcagctgccaggc cagctcgaagaaggcgagcagggtgcgggagagccccttgcagaagacgccgtacgagtg ttggaagttgtgaccagggcaatgaggcaggagaaggaaataaagggtattcaattagga aaagaggaagtcaaattgtccctgtttgcagacgacatgattgtatatctagaaaatccc attgtctcagcccaaaatctcgttaagctgataagcaacttcagcaaagtctcaggatac aaaatcaatgtacaaaaatcacaagcattcttatacaccaataacagacaaacagagagc caaatcatgagtgaactcccattcacaattgcttcaaagagaataaaatacctaggagtc caacttacaagggatgtgaaggacctcttcaaggagaactacaaaccactgctcaatgaa ataaaagaggatacaaacaaatggaagaacattccatgctcatgggtaggaagaatcaat atcgtgaaaatggccatactgcccaaggtaatttatagattcaatgccatccccatcaag ctaacaatgactttcttcacagaattggaaaactactttaaagttcactga >gi568815586r:11033288_11234244|GENSCAN_predicted_peptide_6|277_aa MTKIQNTDNTKRWSEQSKVPADIKIWRHRVEIQFTDSGLPDCAGIGLLDLTQSTMGMWHL GHRGRLFPSISLFCIPKKNQELACAFTAIGLKSAIFPGIPRSFGIYLECRMLFRKQYLGG ETLYSCSDRRLAPETQDRGLCESHEHYDCFLLPPSLQFGNFPFIHQLLSMQEENSGTRSS IAKDTNGNSLGKENAKRVTISVCVCIIVKEKLSCYPSPHMEKLMNNNKKVAYVVVKRVGG QFASQRQNWLNEKHIVREQGRQNPVLSLQNHRFAAFG >gi568815586r:11033288_11234244|GENSCAN_predicted_CDS_6|834_bp atgaccaaaatccagaacactgacaacacaaaacgctggagtgaacaaagcaaagtgcca gcagacatcaagatctggaggcacagagtagaaatccagttcactgactctggcttgcct gattgtgcagggattggtctattagatctgacacaaagcacaatgggaatgtggcattta ggacacagagggcgcctgttccccagtatatccttattttgcatcccaaaaaagaatcaa gaattagcttgtgcttttactgccataggcctgaaatcagctatttttccagggatccct agatcctttgggatctatttagagtgcagaatgctatttcgaaaacagtatctgggaggg gaaacactgtacagctgtagtgataggcgactggctccagagacccaggacagaggccta tgtgagagccatgaacattatgattgctttcttcttccaccttctctacagtttgggaac ttccctttcatccatcagctactttctatgcaagaggaaaatagtggcactagatcttcc atagctaaagatacaaatggaaattcattaggcaaagaaaatgcaaaaagagtaacaata tcggtctgtgtttgcataatagtgaaggagaaactctcttgttacccatcaccccacatg gaaaaactaatgaacaacaacaaaaaagtcgcttatgtggttgttaagagagttggtggc cagtttgccagtcagcgtcaaaattggctgaatgagaaacacattgttagagaacaaggc aggcaaaatcctgtcctgtccttgcagaatcatagatttgcagcttttgggtag >gi568815586r:11033288_11234244|GENSCAN_predicted_peptide_7|191_aa MDPNKKEIRDLPEKEFKRKISIQGSYLNVVKAIYDKPTTKLILNEEKLKAFSLRTGTRQG YPLSPLLFNIVLEVLARAIRQEKEIKGIQIGKEQHKLSLFADDMIVYLENPKDSSKKLLD LINEFSKVSGYKINVCKPVAFLYPNSDQTENQIKNSTPFTIAADVYRNIPNQGDKRPLQG KLQNTAERNHR >gi568815586r:11033288_11234244|GENSCAN_predicted_CDS_7|576_bp atggatccaaacaaaaaagaaatccgtgatttacctgaaaaagaattcaaaagaaaaatc agcatacaaggatcgtacctcaatgtagtaaaagccatctatgacaaacccacaaccaag ttaatactgaatgaggaaaaattgaaagcattctctctgagaactggaacaagacaagga tacccactttcaccacttttattcaatatagtactggaagttttagccagagcaattaga caagagaaagaaataaagggcatccagattggtaaagaacaacacaaactgtcactgttt gctgatgatatgattgtatacctagaaaaccctaaagactcctccaaaaagctcctagat ctcataaatgaattcagcaaagtttcaggatacaaaattaatgtgtgcaaaccagtagcc ttcctataccccaacagcgaccaaactgagaatcaaatcaagaactcaactccttttaca atagctgcagatgtatataggaatatacctaatcaaggagataaaagacctctacaagga aaactacaaaacaccgctgaaagaaatcatagatga