GENSCAN 1.0 Date run: 7-Nov-116 Time: 21:23:11 Sequence gi568815575r:119686687_119890012 : 203326 bp : 46.09% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 6311 6537 227 1 2 65 56 186 0.310 9.20 1.02 Term + 8917 8980 64 1 1 97 42 92 0.403 2.86 1.03 PlyA + 9353 9358 6 1.05 2.00 Prom + 24316 24355 40 -3.66 2.01 Init + 54458 54464 7 1 1 99 83 0 0.380 1.69 2.02 Intr + 64050 64325 276 1 0 75 68 144 0.593 8.59 2.03 Term + 64418 64548 131 0 2 37 55 89 0.640 -1.16 2.04 PlyA + 67267 67272 6 1.05 3.00 Prom + 69003 69042 40 -7.06 3.01 Sngl + 71982 72929 948 2 0 91 48 1042 0.983 95.08 3.02 PlyA + 74879 74884 6 1.05 4.04 PlyA - 75291 75286 6 1.05 4.03 Term - 78940 78827 114 1 0 58 54 77 0.218 -0.13 4.02 Intr - 83160 83112 49 2 1 60 113 28 0.036 1.18 4.01 Init - 105342 105281 62 0 2 80 51 122 0.251 6.44 4.00 Prom - 112307 112268 40 -6.16 5.00 Prom + 119325 119364 40 -2.46 5.01 Init + 119449 119581 133 0 1 78 47 46 0.537 -0.20 5.02 Term + 120818 120906 89 0 2 62 39 128 0.634 3.12 5.03 PlyA + 123277 123282 6 1.05 6.11 PlyA - 126867 126862 6 1.05 6.10 Term - 136357 136267 91 0 1 84 43 130 0.692 5.29 6.09 Intr - 147963 147873 91 1 1 6 69 87 0.079 -2.35 6.08 Intr - 148341 148227 115 0 1 87 12 141 0.124 6.42 6.07 Intr - 151365 151071 295 2 1 78 48 408 0.989 32.81 6.06 Intr - 151841 151681 161 2 2 41 78 198 0.999 12.89 6.05 Intr - 153998 153960 39 2 0 112 73 9 0.536 0.22 6.04 Intr - 154572 154390 183 0 0 70 70 121 0.985 8.48 6.03 Intr - 155092 155049 44 2 2 53 99 35 0.968 -0.94 6.02 Intr - 164915 164809 107 1 2 81 113 39 0.920 5.56 6.01 Init - 166242 165893 350 0 2 93 60 202 0.413 14.57 6.00 Prom - 176926 176887 40 -1.06 7.02 PlyA - 178573 178568 6 1.05 7.01 Sngl - 184927 183896 1032 1 0 57 39 1260 0.996 115.11 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 35997 36061 65 1 2 116 47 42 0.875 0.95 S.002 Term - 148341 148192 150 0 0 87 44 139 0.865 7.31 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575r:119686687_119890012|GENSCAN_predicted_peptide_1|96_aa MWRRRPGLLDPHQALGPGPGTQALHLPPGELYRSLPSLLRPPGLPRVPPQRELLHRKQRS SSDKEKVLHFGRDQVRDYQNLTASSSSKTLNLEDLN >gi568815575r:119686687_119890012|GENSCAN_predicted_CDS_1|291_bp atgtggcggcggcggccaggtctccttgaccctcaccaggccctcggccccggccctggc acccaagctctgcacttaccacctggcgagctatatcggtcgctgccatcgctcctgcgt ccgccagggctgccacgggtgccgccgcaacgggagctactgcacaggaaacagaggagc tcctccgacaaagagaaagtgttacacttcgggcgcgaccaagtccgagactaccagaac ctcactgcgtcctcctcctccaaaacgctgaacctcgaagacctgaactaa >gi568815575r:119686687_119890012|GENSCAN_predicted_peptide_2|137_aa MKHSGAQLASPSGSRTGAAGGAACQSRAVCLHFSALGRSMGLGAVEQGVVLVGEAGAAQE PMEWVGGSGMAGCRSRALPRGKAAKAQREIQHSAGPAGCSECGARQAHAHPELQLARKRC AQPRFLLAPLPPHLPAS >gi568815575r:119686687_119890012|GENSCAN_predicted_CDS_2|414_bp atgaaacactcaggagcccagctggcttcacctagtggatcccgcactggggccgcaggt ggagctgcctgccagtcccgtgccgtgtgcttgcacttctcagcccttgggcggtcgatg ggactgggcgccgtggagcagggggtggtgctcgtcggggaggctggggccgcacaggag cccatggagtgggtgggaggctcaggcatggcgggctgcaggtcccgagccctgccccgc gggaaggcagctaaggcccagcgagaaatccagcacagcgccgggccggccggctgctcc gagtgcggggcccgccaagcccacgcccacccagaactccagctggcccgcaagcgctgc gcgcagccccggttcctgctcgcgcctctccctccacacctccctgcaagctga >gi568815575r:119686687_119890012|GENSCAN_predicted_peptide_3|315_aa MAQLGGAANRAPTASLAPTSQSLRCAPQPRPSRADTGSLGRYWGKAAAAASREHPFPGTL MHSAAGSGRRRGALRELLGLQRAAPAGWLSEERAEELGGPSGPGSSRLCLEPREHAWILA AAEGRYEVLRELLEAEPELLLRGDPITGYSVLHWLAKHGRHEELILVHDFALRRGLRLDV SAPGSGGLTPLHLAALQGHDMVIKVLVGALGADATRRDHSGHRACHYLRPDAPWRLRELS GAEEWEMESGSGCTNLNNNSSGTTAWRAASAVGATAVETSRRVAASRTKAKDTAGSRVAQ MHSLFRHLFPSFQDR >gi568815575r:119686687_119890012|GENSCAN_predicted_CDS_3|948_bp atggcccagctcggaggggccgcgaaccgggcacccacggcctctctcgcgccgacctcg cagagcctgcggtgcgccccgcagccccgcccctcgagagcggacactggtagcctgggc aggtactggggcaaagccgcagccgccgcctcccgggagcaccccttcccaggcacgctg atgcactctgcagcgggctcagggcgccggcggggagcgctgcgggaactgctggggctg cagcgggcggctcctgcggggtggctgtcggaggagcgcgccgaggagctgggcgggccg agtgggccgggcagcagcaggctgtgcctggaaccgcgggagcacgcgtggattctggca gccgccgagggccgctatgaggtgctgcgggagctgctggaggctgagccggagctgctg ctgcggggcgacccgatcaccggctactcggttctgcactggctggccaagcacgggcgc cacgaggagctcattctggtacacgatttcgccctacgccgggggctgaggctcgacgtg agcgccccaggcagcggcggcctcacgcccctccacctggcggcccttcagggccacgac atggtcatcaaggtgctggtgggcgccctgggtgctgacgctacgcgccgcgaccacagc ggccaccgggcctgccactacctgcggcccgacgcgccttggaggttgcgggagctgtcg ggagccgaggaatgggagatggagagcggcagcgggtgcaccaacctgaacaacaacagc agtggcaccactgcgtggagggccgcgagcgcagtgggcgcgacggctgtggagacaagc aggagagtggcagcgtcgcggaccaaggcgaaggacaccgcgggcagccgggtggcgcaa atgcatagccttttccgccatctgttcccctcattccaggaccgttga >gi568815575r:119686687_119890012|GENSCAN_predicted_peptide_4|74_aa MALTLRLPSSDPAFRVPGARSGFVVSLTSGMKLHTLTVEVAKPSVTLQFCKSADLAPCGS PKAYGFGSSSQSST >gi568815575r:119686687_119890012|GENSCAN_predicted_CDS_4|225_bp atggcacttactctccgcctgccgagctccgacccggcgttccgagtccccggcgcccgg agtgggttcgtggtctcactgacttcaggaatgaagctgcatacgctcacggtggaagtt gccaagccttcagtcactcttcaattttgcaagtctgcagacttagcaccatgtggaagc cccaaggcttatggctttgggagcagcagccagagcagtacctga >gi568815575r:119686687_119890012|GENSCAN_predicted_peptide_5|73_aa MEYYAAIKNDEFMSFVGTWMKLEIIILSKLLQEQKTKHRIFSLIDWCCHLATRIDLGFEL LDGKASSLDDPEV >gi568815575r:119686687_119890012|GENSCAN_predicted_CDS_5|222_bp atggaatactatgcagccataaaaaatgatgagttcatgtcctttgtagggacatggatg aaactggaaatcatcattctcagtaaactattgcaagaacaaaaaaccaaacaccgcata ttctcactcatagattggtgctgccatcttgccaccagaatcgacctgggcttcgagctc ctggatggtaaagcttcatccctggatgatcctgaagtataa >gi568815575r:119686687_119890012|GENSCAN_predicted_peptide_6|491_aa MKEEKEHRPKEKRVTLLTPAGATGSGGGTSGDSSKGEDKQDRNKEKKEALSKVPGRPLAR ERAGRVRSAGRMAGNGAAPFLSQVLLLSLRLNGSRIILVHPFTRNGESPSRPNPGWGLYP HMYARAYINFKNQEDIILFRDRFDGYVFLDNKAKKTTPLLSFLKNKQRMREEKREERRRR EIERKRQREEERRKWKEEEKRKRKDIEKLKKIDRIPERDKLKDEPKIKVHRFLLQAVNQK NLLKKPEKGDEKELDKREKAKKLDKENLSDERASGQSCTLPKRSDSELKDEKPKRPEDES GRDYREREREYERDQERILRERERLKRQEEERRRQKERYEKEKTFKRKEEEMKKEKDTLR DKGKKAESTESIGSSEKTEKKEEVVKRDRIRNKDRPAMQLYQPGARSRNRLCPPDDSTKS GDSAAERKQESVSEKGDGKDMHCNLQGELSSEDFITYDTREKTVHGCSVTIEELEAEVDS GTLMTSLEIQQ >gi568815575r:119686687_119890012|GENSCAN_predicted_CDS_6|1476_bp atgaaggaagagaaggagcacaggcctaaggagaagcgagtaaccctgttaacccccgcc ggggccacaggcagcggtggtgggacctcgggggacagctccaagggggaagataagcag gatcgcaacaaggagaagaaagaagcgctgagcaaggtgcccggtcggccgctggcccgg gagagggcgggacgagtccgcagcgcggggaggatggctgggaatggggctgcccccttt ctttctcaggttctccttctatctcttcgcctgaatggaagtcgcatcatcctggttcat ccattcacccgaaatggggagagcccctcccggccaaatcctgggtggggtttgtatcct catatgtatgccagagcatacatcaactttaaaaaccaagaggacattattttgttcagg gatcgctttgatggttatgtattccttgacaataaagctaaaaagacaaccccacttttg agcttcctgaaaaacaagcagagaatgagagaagaaaagagagaagaaaggaggaggaga gaaatagaaagaaaaagacaaagagaagaagagaggaggaaatggaaagaagaagagaaa cgaaaaaggaaagatatagaaaagctaaagaagatagacagaattccagaaagggacaaa ttaaaggatgaaccaaagattaaggtacacaggtttctgttacaagctgtgaatcagaaa aatctgctcaagaagccagaaaaaggagatgaaaaagaattggacaaaagagaaaaagcc aagaaattggacaaagagaatctcagtgatgaaagagccagtgggcaaagttgtacattg cccaagcgttctgatagcgaacttaaagatgaaaaaccaaagagacctgaagatgagagc ggcagagactatagggagagggaacgggaatatgaacgagatcaggagcgcatacttcga gaaagagagaggctgaagcggcaagaagaagagcgccgtaggcagaaggagcgctatgag aaagagaagacttttaagagaaaagaagaagaaatgaaaaaagagaaagacacacttcgg gataaaggaaagaaggctgaaagtacagaatcaataggcagctcagaaaaaactgaaaag aaagaagaagtggtcaagagagatcgaataagaaacaaggatcgtccagcgatgcagctt taccaaccaggagctcgaagccgaaatcgactctgtccccctgatgacagcaccaagtct ggagattcagcagcagaaaggaagcaggaaagtgtgtcagaaaaaggtgacggcaaggac atgcattgcaatttgcagggggaattgtcaagtgaggacttcatcacatatgacacgaga gaaaagactgtccatggatgcagtgttaccatcgaggagctcgaggccgaggtcgactct ggcaccctgatgacaagtctggagattcagcagtag >gi568815575r:119686687_119890012|GENSCAN_predicted_peptide_7|343_aa MAEQLSPGKAVDQVCTFLFKKPGRKGAAGRRKRPACDPEPGESGSSSDEGCTVVRPEKKR VTHNPMIQKTRDSGKQKAAYGDLSSEEEEENEPESLGVVYKSTRSAKPVGPEDMGATAVY ELDTEKERDAQAIFERSQKIQEELRGKEDDKIYRGINNYQKYMKPKDTSMGNASSGMVRK GPIRAPEHLRATVRWDYQPDICKDYKETGFCGFGDSCKFLHDRSDYKHGWQIERELDEGR YGVYEDENYEVGSDDEEIPFKCFICRQSFQNPVVTKCRHYFCESCALQHFRTTPRCYVCD QQTNGVFNPAKELIAKLEKHRATGEGGASDLPEDPDEDAIPIT >gi568815575r:119686687_119890012|GENSCAN_predicted_CDS_7|1032_bp atggcagagcagctttctccaggaaaggcggtggatcaggtgtgcaccttccttttcaaa aagcctgggcggaaaggggctgctggacgcagaaagcgcccggcctgcgacccagagccc ggagaaagcggcagcagtagcgacgaaggctgcactgtggttcgaccggaaaagaagcgg gtgacccacaatccaatgatacagaagacccgtgacagtggtaaacagaaggcggcttac ggcgacttgagcagcgaagaggaagaggaaaatgagcccgagagtctcggcgtggtttat aaatccacccgttcggcgaaacccgtgggaccagaggatatgggagcgacagctgtctat gagctggacacagagaaagagcgcgatgcacaagccatctttgagcgcagccagaagatc caggaggagctgaggggcaaggaggatgacaagatctatcggggaatcaacaattatcag aaatacatgaagcccaaggatacgtctatgggcaatgcctcttccgggatggtgaggaag ggccccatccgagcgcccgagcatctacgtgccaccgtgcgctgggattaccagcccgac atctgtaaggactacaaagagactggcttctgcggcttcggagacagctgcaaattcctc catgaccgttcagattacaagcatgggtggcagatcgaacgtgagcttgatgagggtcgc tatggtgtctatgaggatgaaaactatgaagtgggaagcgatgatgaggaaataccattc aagtgtttcatctgtcgccagagcttccaaaacccagttgtcaccaagtgcaggcattat ttctgcgagagctgtgcactgcagcatttccgcaccaccccgcgctgctatgtctgtgac cagcagaccaatggcgtcttcaatccagcgaaagaattgattgctaaactagagaagcat cgagctacaggagagggtggtgcttccgacttgccagaagaccccgatgaggatgcaatt cccattacttag