GENSCAN 1.0 Date run: 5-Nov-116 Time: 12:29:14 Sequence gi568815592r:36755816_36974772 : 218957 bp : 48.03% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.18 Intr - 483 430 54 0 0 103 79 68 0.607 6.58 1.17 Intr - 7177 7102 76 0 1 90 78 115 0.512 10.22 1.16 Intr - 9561 9520 42 2 0 74 115 118 0.798 10.46 1.15 Intr - 19250 19146 105 1 0 134 89 149 0.956 18.93 1.14 Intr - 23142 23039 104 0 2 117 92 113 0.386 13.57 1.13 Intr - 36281 36218 64 1 1 90 96 92 0.961 8.92 1.12 Intr - 38834 38775 60 1 0 149 87 24 0.963 6.25 1.11 Intr - 42426 42350 77 0 2 59 94 67 0.597 2.71 1.10 Intr - 42679 42640 40 0 1 122 94 14 0.987 3.63 1.09 Intr - 44255 44152 104 2 2 108 84 252 0.994 25.77 1.08 Intr - 49289 49153 137 0 2 83 89 82 0.936 8.09 1.07 Intr - 58543 58363 181 1 1 36 49 104 0.129 0.74 1.06 Intr - 66345 66299 47 1 2 124 92 34 0.594 5.53 1.05 Intr - 67283 67243 41 1 2 140 113 61 0.599 11.67 1.04 Intr - 71763 71574 190 1 1 72 72 73 0.186 2.74 1.03 Intr - 74517 74469 49 0 1 68 51 40 0.125 -3.45 1.02 Intr - 76561 76426 136 0 1 44 78 109 0.547 6.07 1.01 Init - 83562 83468 95 0 2 80 99 172 0.616 17.37 1.00 Prom - 98854 98815 40 -4.46 2.07 PlyA - 99038 99033 6 1.05 2.06 Term - 100218 99998 221 1 2 108 38 193 0.912 13.50 2.05 Intr - 100839 100771 69 1 0 89 114 78 0.374 9.65 2.04 Intr - 107129 106999 131 1 2 2 69 108 0.217 0.64 2.03 Intr - 108227 107984 244 0 1 -11 87 144 0.343 0.96 2.02 Intr - 116057 115903 155 1 2 98 92 168 0.764 17.92 2.01 Init - 118957 118902 56 1 2 73 94 35 0.394 3.36 2.00 Prom - 122709 122670 40 -5.66 3.00 Prom + 127950 127989 40 -6.26 3.01 Init + 131840 131891 52 1 1 36 115 23 0.440 1.12 3.02 Intr + 138689 138788 100 0 1 105 87 32 0.724 3.97 3.03 Intr + 143611 143818 208 1 1 101 93 171 0.830 17.98 3.04 Intr + 146406 146619 214 2 1 117 98 -17 0.848 0.49 3.05 Intr + 158469 158620 152 1 2 101 105 163 0.994 19.18 3.06 Intr + 163763 163886 124 1 1 113 88 94 0.952 12.06 3.07 Term + 167532 167626 95 1 2 84 49 102 0.901 3.89 3.08 PlyA + 167686 167691 6 1.05 4.00 Prom + 191146 191185 40 -6.16 4.01 Init + 198946 199116 171 0 0 108 115 336 0.972 35.74 4.02 Intr + 203330 203551 222 1 0 108 64 479 0.910 45.82 4.03 Intr + 205636 205745 110 0 2 64 75 158 0.994 11.18 4.04 Intr + 206071 206179 109 1 1 81 45 65 0.469 1.79 4.05 Intr + 206313 206421 109 2 1 38 90 82 0.474 3.26 4.06 Intr + 207120 207797 678 1 0 94 110 228 0.940 17.19 4.07 Term + 208008 208129 122 1 2 107 49 34 0.945 0.04 4.08 PlyA + 208996 209001 6 1.05 5.07 PlyA - 210548 210543 6 1.05 5.06 Term - 213159 213088 72 0 0 118 37 52 0.638 1.01 5.05 Intr - 214299 214224 76 2 1 101 99 31 0.639 5.02 5.04 Intr - 214658 214591 68 2 2 69 77 76 0.821 2.30 5.03 Intr - 214879 214832 48 1 0 107 98 80 0.998 9.78 5.02 Intr - 215241 215128 114 0 0 81 80 23 0.694 1.44 5.01 Intr - 216981 216837 145 2 1 101 77 129 0.598 13.38 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:36755816_36974772|GENSCAN_predicted_peptide_1|534_aa MEQPEDMASLSEFDSLAGSIPATKVEITVSCRLGTGAIFPDFRESSSDLNNVDSVSRKRV GNVCWMGKRGIYGAIWKSFVWEQLREAEDGKQDTGETEALRDGVSTCGIWTAAIRDIIGG SLAATWINMASGALLQTYTARLGTEPKELPFYTPGQGNLLDKDMFSKSDPLCVMYTQGME NKQWREFMETMLNYDLAFSILVMPLGPGSAVVLTDTGDKELNGMEYCVPPLEIFSQIDFS FLIDLWVVFLPSLMGYKLGLTGPGGFSDFKIQVLAFFPFVIRPSNEGYKTSSFGRTEVID NTLNPDFVRKFIVDYFFEEKQNLRFDLYDVDSKSPDLSKHDFLGQAFCTLGEIVGSPGSR LEKPLTIGAFSLNSRTGKPMPAVSNGGVPGKKCGTIILSAEELSNCRDVATMQFCANKLD KKDFFGKSDPFLVFYRSNEDGTFTICHKTEVMKNTLNPVWQTFSIPVRALCNGDYDRTIK VEVYDWDRDGSHDFIGEFTTSYRELARGQSQFNIYEVVNPKKKMKKKKYVNSGT >gi568815592r:36755816_36974772|GENSCAN_predicted_CDS_1|1602_bp atggagcagcctgaggacatggcgtcgctgagcgagttcgactccttggcgggcagcatc ccggccaccaaggtggagatcaccgtgtcctgcagactagggacaggggccatcttcccg gatttcagggagtcgtcatcagatctaaacaatgtggattctgtcagcaggaagagagtg ggcaatgtttgctggatgggcaaacgtggcatctatggggccatttggaaatcgtttgtt tgggagcagctccgagaggctgaggatgggaagcaggacacgggggaaactgaggcccta agagacggtgtctcaacatgtggtatttggacggctgccatcagagacataataggggga tctctggcggccacttggataaacatggcctctggagccctgcttcagacctacacagcc agacttggaacagagcccaaggaactgccattttataccccggggcaggggaacctcctg gacaaagacatgttttccaagtccgacccactgtgcgtcatgtatacccaagggatggag aacaagcagtggcgggagtttatggagaccatgttgaattacgatcttgcattcagcatc ttggttatgcctcttggtcctgggtcagctgtagtcctcaccgacacaggtgacaaggaa ttgaatgggatggagtactgcgttcctccactagagatcttttcccagattgacttcagt tttttaattgacctttgggttgtgttcctgccatcattaatggggtataaattgggcctg acaggacctggtggcttctcagattttaaaatccaagttttagcatttttcccctttgtt attcgtccaagtaatgaaggctataaaacatcttcatttgggcgcaccgaagtcatcgac aacacgctcaatcctgacttcgtgcgcaagttcattgtggattactttttcgaggagaag cagaacctccgttttgatctatacgacgttgactctaagagtcctgatttatccaaacac gatttcctgggccaggccttctgcacccttggagagattgtggggtcccctgggagccgc ctggaaaagcccctcacgataggcgcattcagcctgaattccaggacgggcaaacccatg ccagctgtgtccaacggtggtgtcccaggaaagaaatgtggcaccatcatcctgtccgct gaggagctcagcaactgtagggatgtcgccaccatgcagttctgtgccaacaagctggac aagaaagatttctttgggaaatctgaccccttcttggtattctacagaagcaacgaggat ggaacgttcaccatttgccacaagaccgaggtcatgaagaacaccctaaatccagtctgg caaactttctccattcccgtgagagccctctgcaacggcgactacgatcggaccatcaag gtggaggtgtacgactgggatcgggacggcagccatgacttcattggggagttcaccacc agttaccgggagctggcccgtgggcagagccaattcaacatctatgaggtggtaaacccg aaaaagaaaatgaagaaaaagaaatacgtgaattctggcaca >gi568815592r:36755816_36974772|GENSCAN_predicted_peptide_2|291_aa MAAIPPDSWQPPNVYLETSMGIIVLELYWKHAPKTCKNFAELARRGYYNGTKFHRIIKDF MIQGGDPTGTANTKIEVYWVTPVKDKGSGNQVWKGRSSHSSAGLSNTCESRGAEEEQLPV SSEEHQENVESWKPKEDSNSSRRVTSSVDQVKTGIQTSLTPKASEKEKTASSEPQSPLED CGIVPVWYINKVFVSGRGGASIYGKQFEDELHPDLKFTGAGILAMANAGPDTNGSQFFVT LAPTQWLDGKHTIFGRVCQGIGMVNRVGMVETNSQDRPVDDVKIIKAYPSG >gi568815592r:36755816_36974772|GENSCAN_predicted_CDS_2|876_bp atggcggcaattcccccagattcctggcagccacccaacgtttacttggagaccagcatg ggaatcattgtgctggagctgtactggaagcatgctccaaagacctgtaagaactttgct gagttggctcgtcgaggttactacaatggcacaaaattccacagaattatcaaagacttc atgatccaaggaggtgacccaacagggacagcaaacaccaagatagaggtttattgggta acacctgtgaaagataaagggtcagggaaccaggtttggaagggaaggtcttcacacagc agcgcaggcctgagtaacacctgtgaaagcagaggggcggaggaggagcagctgccagtg agcagcgaggaacaccaggagaatgtggagtcgtggaagccaaaggaagacagcaactca agcagaagggtgacatcgagtgtggatcaagtaaaaactggaatccagacgtctctgacc ccaaaggcaagtgagaaagagaagacagcatcttcagaaccacaaagccccttggaggac tgtggcattgttcccgtctggtatattaacaaagtgtttgtgtcaggtcgaggtggtgca tctatctatggcaaacagtttgaagatgaacttcatccagacttgaaattcacgggggct ggaattctcgcaatggccaatgcggggccagataccaatggcagccagttctttgtgacc ctcgcccccacccagtggcttgacggcaaacacaccatttttggccgagtgtgtcagggc ataggaatggtgaatcgcgtgggaatggtagaaacaaactcccaggaccgccctgtggac gacgtgaagatcattaaggcatacccttctgggtag >gi568815592r:36755816_36974772|GENSCAN_predicted_peptide_3|314_aa MGNETEQVGSDQIIKGHGIIVVNHFPVLSRSVVIKGHVVSELPAQNRKITWDFILEDMDL AANEISIYDKLSETVDLVRQTGHQCGMSEKAIEKFIRQLLEKNEPQRPPPQYPLLIVVYK VLATLGLILLTAYFVIQPFSPLAPEPVLSGAHTWRSLIHHIRLMSLPIAKKYMSENKGVP LHGGDEDRPFPDFDPWWTNDCEQNESEPIPANCTGCAQKHLKVMLLEDAPRKFERLHPLV IKMHKMPDLFIIGSGEAMLQLIPPFQCRRHCQSVAMPIEPGDIGYVDTTHWKVYVIARGV QPLVICDGTAFSEL >gi568815592r:36755816_36974772|GENSCAN_predicted_CDS_3|945_bp atgggaaatgaaactgaacaggtaggcagtgaccagatcataaagggccatggaatcatt gtagtcaatcattttccagttctcagccgctcagttgtgatcaagggacacgtggtttcc gaactgccagctcagaataggaaaataacttgggattttatattggaagacatggatctt gctgccaacgagatcagcatttatgacaaactttcagagactgttgatttggtgagacag accggccatcagtgtggcatgtcagagaaggcaattgaaaaatttatcagacagctgctg gaaaagaatgaacctcagagaccccccccgcagtatcctctccttatagttgtgtataag gttctcgcaaccttgggattaatcttgctcactgcctactttgtgattcaacctttcagc ccattagcacctgagccagtgctttctggagctcacacctggcgctcactcatccatcac attaggctgatgtccttgcccattgccaagaagtacatgtcagaaaataagggagttcct ctgcatgggggtgatgaagacagaccctttccagactttgacccctggtggacaaacgac tgtgagcagaatgagtcagagcccattcctgccaactgcactggctgtgcccagaaacac ctgaaggtgatgctcctggaagacgccccaaggaaatttgagaggctccatccactggtg atcaagatgcataagatgcctgacctatttatcattggcagcggtgaggccatgttgcag ctcatccctcccttccagtgccgaagacattgtcagtctgtggccatgccaatagagcca ggggatatcggctatgtcgacaccacccactggaaggtctacgttatagccagaggggtc cagcctttggtcatctgcgatggaaccgctttctcagaactgtag >gi568815592r:36755816_36974772|GENSCAN_predicted_peptide_4|506_aa MHGSCSFLMLLLPLLLLLVATTGPVGALTDEEKRLMVELHNLYRAQVSPTASDMLHMRWD EELAAFAKAYARQCVWGHNKERGRRGENLFAITDEGMDVPLAMEEWHHEREHYNLSAATC SPGQMCGHYTQVVWAKTERIGCGSHFCEKLQGVEETNIELLVCNYEPPGNVKGKRPYQEG TPCSQCPSGYHCKNSLCGESTGGWPPTRSHFGALSFQVAGFQPFKGRMLESLAASGGPAR EPIGSPEDAQDLPYLVTEAPSFRATEASDSRKMGTPSSLATGIPAFLVTEVSGSLATKAL PAVETQAPTSLATKDPPSMATEAPPCVTTEVPSILAAHSLPSLDEEPVTFPKSTHVPIPK SADKVTDKTKVPSRSPENSLDPKMSLTGARELLPHAQEEAEAEAELPPSSEVLASVFPAQ DKPGELQATLDHTGHTSSKSLPNFPNTSATANATGGRALALQSSLPGAEGPDKPSVVSGL NSGPGHVWGPLLGLLLLPPLVLAGIF >gi568815592r:36755816_36974772|GENSCAN_predicted_CDS_4|1521_bp atgcacggctcctgcagtttcctgatgcttctgctgccgctactgctactgctggtggcc accacaggccccgttggagccctcacagatgaggagaaacgtttgatggtggagctgcac aacctctaccgggcccaggtatccccgacggcctcagacatgctgcacatgagatgggac gaggagctggccgccttcgccaaggcctacgcacggcagtgcgtgtggggccacaacaag gagcgcgggcgccgcggcgagaatctgttcgccatcacagacgagggcatggacgtgccg ctggccatggaggagtggcaccacgagcgtgagcactacaacctcagcgccgccacctgc agcccaggccagatgtgcggccactacacgcaggtggtatgggccaagacagagaggatc ggctgtggttcccacttctgtgagaagctccagggtgttgaggagaccaacatcgaatta ctggtgtgcaactatgagcctccggggaacgtgaaggggaaacggccctaccaggagggg actccgtgctcccaatgtccctctggctaccactgcaagaactccctctgtggtgagtcc acgggtggatggccccccacgcgcagccactttggcgccctgtcgttccaagtggccgga tttcaacccttcaaagggaggatgttagaaagtctggcggcttcgggggggcccgcgcga gaacccatcggaagcccggaagatgctcaggatttgccttacctggtaactgaggcccca tccttccgggcgactgaagcatcagactctaggaaaatgggtactccttcttccctagca acggggattccggctttcttggtaacagaggtctcaggctccctggcaaccaaggctctg cctgctgtggaaacccaggccccaacttccttagcaacgaaagacccgccctccatggca acagaggctccaccttgcgtaacaactgaggtcccttccattttggcagctcacagcctg ccctccttggatgaggagccagttaccttccccaaatcgacccatgttcctatcccaaaa tcagcagacaaagtgacagacaaaacaaaagtgccctctaggagcccagagaactctctg gaccccaagatgtccctgacaggggcaagggaactcctaccccatgcccaggaggaggct gaggctgaggctgagttgcctccttccagtgaggtcttggcctcagtttttccagcccag gacaagccaggtgagctgcaggccacactggaccacacggggcacacctcctccaagtcc ctgcccaatttccccaatacctctgccaccgctaatgccacgggtgggcgtgccctggct ctgcagtcgtccttgccaggtgcagagggccctgacaagcctagcgtcgtgtcagggctg aactcgggccctggtcatgtgtggggccctctcctgggactactgctcctgcctcctctg gtgttggctggaatcttctga >gi568815592r:36755816_36974772|GENSCAN_predicted_peptide_5|174_aa XGLIPHLLGDVVFLWGCNLLAHFINAYLVDDSVSDTPGGLGNDQNPGSQARIRLWVSGCM SVLALTACQALDVDQADTGSALEAHSPFSQALAIRSYTKFVMGIAVSMLTYPFLLVGDLM AVNNCGLQAGLPPYSPVFKSWIHCWKYLSVQGQLFRGSSLLFRRVSSGSCFALE >gi568815592r:36755816_36974772|GENSCAN_predicted_CDS_5|525_bp nntggattaatccctcacctcctgggcgatgtggttttcttgtggggctgtaacctgctg gcccacttcatcaatgcctacctggtggatgacagcgtgagtgacaccccaggggggctg ggaaacgaccagaatccaggttcccaggccaggattcgtctttgggtttcagggtgcatg tctgtcttagcactgactgcgtgccaagccctggatgttgatcaggcagacacgggctca gcccttgaggctcacagtccgttcagccaggccctggccatccggagctataccaagttc gtgatggggattgcagtgagcatgctgacctaccccttcctgctagttggcgacctcatg gctgtgaacaactgcgggctgcaagctgggctccccccttactccccagtgttcaaatcc tggattcactgctggaagtacctgagtgtgcagggccagctcttccgaggctccagcctg cttttccgccgggtgtcatcaggatcatgctttgccctggagtaa