GENSCAN 1.0 Date run: 8-Nov-116 Time: 14:12:21 Sequence gi568815575f:119771912_119973394 : 201483 bp : 44.54% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 19820 19942 123 0 0 67 88 49 0.015 3.36 1.02 Term + 35593 35681 89 2 2 62 39 128 0.634 3.12 1.03 PlyA + 38052 38057 6 1.05 2.11 PlyA - 41642 41637 6 1.05 2.10 Term - 51132 51042 91 2 1 84 43 130 0.692 5.29 2.09 Intr - 62738 62648 91 0 1 6 69 87 0.079 -2.35 2.08 Intr - 63116 63002 115 2 1 87 12 141 0.124 6.42 2.07 Intr - 66140 65846 295 1 1 78 48 408 0.989 32.81 2.06 Intr - 66616 66456 161 1 2 41 78 198 0.999 12.89 2.05 Intr - 68773 68735 39 1 0 112 73 9 0.536 0.22 2.04 Intr - 69347 69165 183 2 0 70 70 121 0.985 8.48 2.03 Intr - 69867 69824 44 1 2 53 99 35 0.968 -0.94 2.02 Intr - 79690 79584 107 0 2 81 113 39 0.920 5.56 2.01 Init - 81017 80668 350 2 2 93 60 202 0.413 14.57 2.00 Prom - 91701 91662 40 -1.06 3.02 PlyA - 93348 93343 6 1.05 3.01 Sngl - 99702 98671 1032 0 0 57 39 1260 0.996 115.11 3.00 Prom - 101366 101327 40 -5.16 4.00 Prom + 113077 113116 40 -2.46 4.01 Init + 118947 119107 161 2 2 87 46 137 0.101 7.02 4.02 Intr + 131303 131481 179 2 2 84 67 191 0.272 16.16 4.03 Intr + 131584 131675 92 2 2 94 123 65 0.998 10.31 4.04 Intr + 142788 142967 180 2 0 59 131 70 0.693 8.46 4.05 Intr + 148000 148052 53 0 2 110 95 15 0.985 2.11 4.06 Term + 148597 148696 100 1 1 119 43 78 0.986 4.00 4.07 PlyA + 148786 148791 6 1.05 5.07 PlyA - 149759 149754 6 1.05 5.06 Term - 153483 153309 175 2 1 21 43 223 0.994 8.33 5.05 Intr - 158254 158105 150 0 0 72 91 95 0.892 7.48 5.04 Intr - 160100 160025 76 0 1 50 74 61 0.787 -0.53 5.03 Intr - 164520 164386 135 1 0 48 83 84 0.812 4.44 5.02 Intr - 164771 164701 71 1 2 50 63 85 0.770 1.03 5.01 Init - 171694 171309 386 1 2 100 81 191 0.861 14.86 5.00 Prom - 172128 172089 40 -2.66 6.00 Prom + 183507 183546 40 -4.16 6.01 Sngl + 193667 194101 435 1 0 60 39 161 0.631 4.67 6.02 PlyA + 194203 194208 6 1.05 7.00 Prom + 196302 196341 40 -2.46 7.01 Init + 199331 199465 135 1 0 75 83 71 0.124 5.44 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 63116 62967 150 2 0 87 44 139 0.865 7.31 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575f:119771912_119973394|GENSCAN_predicted_peptide_1|70_aa XVRSRGAVGKRAGFCQMQLALLRFPKCVPPGARQCARGTLLHWCCHLATRIDLGFELLDG KASSLDDPEV >gi568815575f:119771912_119973394|GENSCAN_predicted_CDS_1|213_bp ncggtgcgctcccgtggcgccgtgggaaagcgcgctggtttctgccaaatgcagttagct ttgcttcgcttccctaagtgcgttcccccaggggcccggcagtgtgctaggggcaccctg cttcattggtgctgccatcttgccaccagaatcgacctgggcttcgagctcctggatggt aaagcttcatccctggatgatcctgaagtataa >gi568815575f:119771912_119973394|GENSCAN_predicted_peptide_2|491_aa MKEEKEHRPKEKRVTLLTPAGATGSGGGTSGDSSKGEDKQDRNKEKKEALSKVPGRPLAR ERAGRVRSAGRMAGNGAAPFLSQVLLLSLRLNGSRIILVHPFTRNGESPSRPNPGWGLYP HMYARAYINFKNQEDIILFRDRFDGYVFLDNKAKKTTPLLSFLKNKQRMREEKREERRRR EIERKRQREEERRKWKEEEKRKRKDIEKLKKIDRIPERDKLKDEPKIKVHRFLLQAVNQK NLLKKPEKGDEKELDKREKAKKLDKENLSDERASGQSCTLPKRSDSELKDEKPKRPEDES GRDYREREREYERDQERILRERERLKRQEEERRRQKERYEKEKTFKRKEEEMKKEKDTLR DKGKKAESTESIGSSEKTEKKEEVVKRDRIRNKDRPAMQLYQPGARSRNRLCPPDDSTKS GDSAAERKQESVSEKGDGKDMHCNLQGELSSEDFITYDTREKTVHGCSVTIEELEAEVDS GTLMTSLEIQQ >gi568815575f:119771912_119973394|GENSCAN_predicted_CDS_2|1476_bp atgaaggaagagaaggagcacaggcctaaggagaagcgagtaaccctgttaacccccgcc ggggccacaggcagcggtggtgggacctcgggggacagctccaagggggaagataagcag gatcgcaacaaggagaagaaagaagcgctgagcaaggtgcccggtcggccgctggcccgg gagagggcgggacgagtccgcagcgcggggaggatggctgggaatggggctgcccccttt ctttctcaggttctccttctatctcttcgcctgaatggaagtcgcatcatcctggttcat ccattcacccgaaatggggagagcccctcccggccaaatcctgggtggggtttgtatcct catatgtatgccagagcatacatcaactttaaaaaccaagaggacattattttgttcagg gatcgctttgatggttatgtattccttgacaataaagctaaaaagacaaccccacttttg agcttcctgaaaaacaagcagagaatgagagaagaaaagagagaagaaaggaggaggaga gaaatagaaagaaaaagacaaagagaagaagagaggaggaaatggaaagaagaagagaaa cgaaaaaggaaagatatagaaaagctaaagaagatagacagaattccagaaagggacaaa ttaaaggatgaaccaaagattaaggtacacaggtttctgttacaagctgtgaatcagaaa aatctgctcaagaagccagaaaaaggagatgaaaaagaattggacaaaagagaaaaagcc aagaaattggacaaagagaatctcagtgatgaaagagccagtgggcaaagttgtacattg cccaagcgttctgatagcgaacttaaagatgaaaaaccaaagagacctgaagatgagagc ggcagagactatagggagagggaacgggaatatgaacgagatcaggagcgcatacttcga gaaagagagaggctgaagcggcaagaagaagagcgccgtaggcagaaggagcgctatgag aaagagaagacttttaagagaaaagaagaagaaatgaaaaaagagaaagacacacttcgg gataaaggaaagaaggctgaaagtacagaatcaataggcagctcagaaaaaactgaaaag aaagaagaagtggtcaagagagatcgaataagaaacaaggatcgtccagcgatgcagctt taccaaccaggagctcgaagccgaaatcgactctgtccccctgatgacagcaccaagtct ggagattcagcagcagaaaggaagcaggaaagtgtgtcagaaaaaggtgacggcaaggac atgcattgcaatttgcagggggaattgtcaagtgaggacttcatcacatatgacacgaga gaaaagactgtccatggatgcagtgttaccatcgaggagctcgaggccgaggtcgactct ggcaccctgatgacaagtctggagattcagcagtag >gi568815575f:119771912_119973394|GENSCAN_predicted_peptide_3|343_aa MAEQLSPGKAVDQVCTFLFKKPGRKGAAGRRKRPACDPEPGESGSSSDEGCTVVRPEKKR VTHNPMIQKTRDSGKQKAAYGDLSSEEEEENEPESLGVVYKSTRSAKPVGPEDMGATAVY ELDTEKERDAQAIFERSQKIQEELRGKEDDKIYRGINNYQKYMKPKDTSMGNASSGMVRK GPIRAPEHLRATVRWDYQPDICKDYKETGFCGFGDSCKFLHDRSDYKHGWQIERELDEGR YGVYEDENYEVGSDDEEIPFKCFICRQSFQNPVVTKCRHYFCESCALQHFRTTPRCYVCD QQTNGVFNPAKELIAKLEKHRATGEGGASDLPEDPDEDAIPIT >gi568815575f:119771912_119973394|GENSCAN_predicted_CDS_3|1032_bp atggcagagcagctttctccaggaaaggcggtggatcaggtgtgcaccttccttttcaaa aagcctgggcggaaaggggctgctggacgcagaaagcgcccggcctgcgacccagagccc ggagaaagcggcagcagtagcgacgaaggctgcactgtggttcgaccggaaaagaagcgg gtgacccacaatccaatgatacagaagacccgtgacagtggtaaacagaaggcggcttac ggcgacttgagcagcgaagaggaagaggaaaatgagcccgagagtctcggcgtggtttat aaatccacccgttcggcgaaacccgtgggaccagaggatatgggagcgacagctgtctat gagctggacacagagaaagagcgcgatgcacaagccatctttgagcgcagccagaagatc caggaggagctgaggggcaaggaggatgacaagatctatcggggaatcaacaattatcag aaatacatgaagcccaaggatacgtctatgggcaatgcctcttccgggatggtgaggaag ggccccatccgagcgcccgagcatctacgtgccaccgtgcgctgggattaccagcccgac atctgtaaggactacaaagagactggcttctgcggcttcggagacagctgcaaattcctc catgaccgttcagattacaagcatgggtggcagatcgaacgtgagcttgatgagggtcgc tatggtgtctatgaggatgaaaactatgaagtgggaagcgatgatgaggaaataccattc aagtgtttcatctgtcgccagagcttccaaaacccagttgtcaccaagtgcaggcattat ttctgcgagagctgtgcactgcagcatttccgcaccaccccgcgctgctatgtctgtgac cagcagaccaatggcgtcttcaatccagcgaaagaattgattgctaaactagagaagcat cgagctacaggagagggtggtgcttccgacttgccagaagaccccgatgaggatgcaatt cccattacttag >gi568815575f:119771912_119973394|GENSCAN_predicted_peptide_4|254_aa MPEPPTLSVGSCAARASPTSAAPCSTALSPINHPRAEECGRTRRTGRQLHLQPWKKKMSE TQNSTSQKAMDEDNKAASQTMPNTQDKNYEDELTQVALALVEDVINYAVKIVEEERNPLK NIKWMTHGEFTVEKGLKQIDEYFSKCVSKKCWAHGVEFVERKDLIHSFLYIYYVHWSIST ADLPVARISAGTYFTMKVSKTKPPDAPIVVSYVGDHQALVHRPGMVRFRENWQKNLTDAK YSFMESFPFLFNRV >gi568815575f:119771912_119973394|GENSCAN_predicted_CDS_4|765_bp atgcctgagcctcccaccctctccgtgggctcctgtgcggcccgagcctctccgacgagc gccgccccctgctccacggcgctcagtccgatcaaccacccaagggctgaggagtgcggg cgcacgcgcaggactggcaggcagctccacctgcagccctggaaaaagaaaatgagtgag actcaaaattcaacaagccagaaagcaatggatgaggataacaaagccgcaagccaaaca atgccgaatacacaagacaagaactacgaggatgaattgactcaagtagctctagctctg gttgaggatgtcatcaattatgctgttaagattgtggaagaggagcgaaaccctttgaaa aacatcaagtggatgactcacggtgaattcactgtggaaaagggtcttaaacaaattgac gaatatttttcgaagtgtgtttctaaaaaatgctgggcacatggcgtagagtttgtagag aggaaagacttaattcacagcttcctctacatctactatgtacactggagtatctcaact gctgacctacccgtagcacgaatctctgctggtacctacttcaccatgaaggtctccaaa accaaaccaccggatgcacccattgttgtttcttatgtaggtgaccaccaagcattagtt cacagaccaggaatggttcgctttcgagaaaactggcagaagaatcttactgatgccaaa tatagtttcatggagtcattccccttcttattcaatcgtgtctga >gi568815575f:119771912_119973394|GENSCAN_predicted_peptide_5|330_aa MAPVSGSRSPDREASGSGGRRRSSSKSPKPSKSARSPRGRRSRSHSCSRSGDRNGLTHQL GGLSQGSRNQSYRSRSRSRSRERPSAPRGIPFASASSSVYYGSYSRPYGSDKPWPSLLDK EREESLRQNSDEHTPVEDEEPKKSTTSASTSEEEKKKKSSRSKERSKKRRKKKSSKRKHK KYSEDSDSDSDSETDSSEAEEPSDLIGPEAPKTLTSQDDKPLNYGHALLPGEGAAMAEYV KAGKRIPRRGEIGLTSEEIASFECSGYVMSGSRHRRMEAVRLRKENQIYSADEKRALASF NQEERRKRENKILASFREMVYRKTKGKDDK >gi568815575f:119771912_119973394|GENSCAN_predicted_CDS_5|993_bp atggctccggtgtccggctcacgcagcccggatagggaggcctcgggctcggggggaaga cgtcgcagttcgtcgaagagtccgaagcccagcaaatctgcccgctccccgcggggccgc cgctctcgctcgcactcttgctctcggtccggggaccggaatggactcacccatcagctg ggtggcctcagccaaggctcccgaaaccagtcctaccgctcacgctcgcggtcgcgttct agagagcggccctctgcgccccggggcatccccttcgcttctgcctcctcgtcagtctat tacggcagctactcgcgcccctacgggagcgacaagccttggcctagcctcctcgacaag gagagggaggagagcctgcggcagaattctgatgaacatacaccagtggaggatgaagag ccaaagaaaagcactacttcagcttctacttcagaagaagaaaaaaagaagaagtctagc cgttcaaaagaaaggtccaagaaaaggagaaagaaaaaatcatcgaaaagaaaacataag aagtattctgaagatagcgacagtgactctgattctgaaacagactccagtgaggctgaa gaaccatcagatttaattggcccagaggctccaaaaacacttacctctcaagatgataaa cctttgaactatggccatgctctgttacctggtgaaggtgcagctatggctgaatatgta aaagctggaaaacgtatcccacgaagaggtgaaattggcttgacaagtgaagaaattgca tcatttgaatgctcaggttatgtaatgagtggtagcaggcatcgccgaatggaggctgtg cgactgcgaaaagagaaccagatctacagtgctgatgagaagagagcccttgcatccttt aaccaagaagagagacgaaagagagagaacaagattctggccagttttcgagaaatggtt tacagaaagaccaaagggaaggatgacaaataa >gi568815575f:119771912_119973394|GENSCAN_predicted_peptide_6|144_aa MSELPFRIASKRIKYLGIQLTRDVKDLFKQNYKPLLNEIKEDTNKWKNILCSWIGRINIV KMAILPKVIYRFNVIPIKLPMTFFTELEKTTLKFIWNQKRAHIAKTILSQKNKAGGITLC DLKLYYKATVTKTAWYWYQNRDID >gi568815575f:119771912_119973394|GENSCAN_predicted_CDS_6|435_bp atgagtgaactcccattcagaattgcttcaaagagaataaaatacctaggaatccaactt acaagggatgtgaaggacctcttcaagcagaactacaaaccactgctcaacgaaatcaaa gaggacacaaacaaatggaagaacattctatgctcatggataggaagaatcaatattgtg aaaatggccatactgcccaaggtaatttatagattcaatgtcatccccatcaagctacca atgactttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagg gcccacattgctaagacaatcctaagccaaaagaacaaagctggaggcatcacgctatgt gacctcaaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaac agagatatagactaa >gi568815575f:119771912_119973394|GENSCAN_predicted_peptide_7|45_aa MRTVVEEAYLAFCEEGTNCINLSRPESVNTRTLNTFKRGRLPFPE >gi568815575f:119771912_119973394|GENSCAN_predicted_CDS_7|135_bp atgaggactgttgtagaagaagcatacttggccttttgtgaagagggtacaaattgcatc aacctctcaaggcctgaaagtgtgaacacaaggacacttaataccttcaagagaggtcga ttacctttcccagag