GENSCAN 1.0 Date run: 4-Nov-116 Time: 18:02:26 Sequence gi568815575r:119770585_119971613 : 201029 bp : 44.57% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 21147 21269 123 1 0 67 88 49 0.015 3.36 1.02 Term + 36920 37008 89 0 2 62 39 128 0.634 3.12 1.03 PlyA + 39379 39384 6 1.05 2.11 PlyA - 42969 42964 6 1.05 2.10 Term - 52459 52369 91 0 1 84 43 130 0.692 5.29 2.09 Intr - 64065 63975 91 1 1 6 69 87 0.079 -2.35 2.08 Intr - 64443 64329 115 0 1 87 12 141 0.124 6.42 2.07 Intr - 67467 67173 295 2 1 78 48 408 0.989 32.81 2.06 Intr - 67943 67783 161 2 2 41 78 198 0.999 12.89 2.05 Intr - 70100 70062 39 2 0 112 73 9 0.536 0.22 2.04 Intr - 70674 70492 183 0 0 70 70 121 0.985 8.48 2.03 Intr - 71194 71151 44 2 2 53 99 35 0.968 -0.94 2.02 Intr - 81017 80911 107 1 2 81 113 39 0.920 5.56 2.01 Init - 82344 81995 350 0 2 93 60 202 0.413 14.57 2.00 Prom - 93028 92989 40 -1.06 3.02 PlyA - 94675 94670 6 1.05 3.01 Sngl - 101029 99998 1032 1 0 57 39 1260 0.996 115.11 3.00 Prom - 102693 102654 40 -5.16 4.00 Prom + 114404 114443 40 -2.46 4.01 Init + 120274 120434 161 0 2 87 46 137 0.101 7.02 4.02 Intr + 132630 132808 179 0 2 84 67 191 0.272 16.16 4.03 Intr + 132911 133002 92 0 2 94 123 65 0.998 10.31 4.04 Intr + 144115 144294 180 0 0 59 131 70 0.693 8.46 4.05 Intr + 149327 149379 53 1 2 110 95 15 0.985 2.11 4.06 Term + 149924 150023 100 2 1 119 43 78 0.986 4.00 4.07 PlyA + 150113 150118 6 1.05 5.07 PlyA - 151086 151081 6 1.05 5.06 Term - 154810 154636 175 0 1 21 43 223 0.994 8.33 5.05 Intr - 159581 159432 150 1 0 72 91 95 0.892 7.48 5.04 Intr - 161427 161352 76 1 1 50 74 61 0.787 -0.53 5.03 Intr - 165847 165713 135 2 0 48 83 84 0.812 4.44 5.02 Intr - 166098 166028 71 2 2 50 63 85 0.770 1.03 5.01 Init - 173021 172636 386 2 2 100 81 191 0.861 14.86 5.00 Prom - 173455 173416 40 -2.66 6.00 Prom + 184834 184873 40 -4.16 6.01 Sngl + 194994 195428 435 2 0 60 39 161 0.639 4.67 6.02 PlyA + 195530 195535 6 1.05 7.00 Prom + 197629 197668 40 -2.46 7.01 Init + 200658 200792 135 2 0 75 83 71 0.225 5.44 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 64443 64294 150 0 0 87 44 139 0.865 7.31 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575r:119770585_119971613|GENSCAN_predicted_peptide_1|70_aa XVRSRGAVGKRAGFCQMQLALLRFPKCVPPGARQCARGTLLHWCCHLATRIDLGFELLDG KASSLDDPEV >gi568815575r:119770585_119971613|GENSCAN_predicted_CDS_1|213_bp ncggtgcgctcccgtggcgccgtgggaaagcgcgctggtttctgccaaatgcagttagct ttgcttcgcttccctaagtgcgttcccccaggggcccggcagtgtgctaggggcaccctg cttcattggtgctgccatcttgccaccagaatcgacctgggcttcgagctcctggatggt aaagcttcatccctggatgatcctgaagtataa >gi568815575r:119770585_119971613|GENSCAN_predicted_peptide_2|491_aa MKEEKEHRPKEKRVTLLTPAGATGSGGGTSGDSSKGEDKQDRNKEKKEALSKVPGRPLAR ERAGRVRSAGRMAGNGAAPFLSQVLLLSLRLNGSRIILVHPFTRNGESPSRPNPGWGLYP HMYARAYINFKNQEDIILFRDRFDGYVFLDNKAKKTTPLLSFLKNKQRMREEKREERRRR EIERKRQREEERRKWKEEEKRKRKDIEKLKKIDRIPERDKLKDEPKIKVHRFLLQAVNQK NLLKKPEKGDEKELDKREKAKKLDKENLSDERASGQSCTLPKRSDSELKDEKPKRPEDES GRDYREREREYERDQERILRERERLKRQEEERRRQKERYEKEKTFKRKEEEMKKEKDTLR DKGKKAESTESIGSSEKTEKKEEVVKRDRIRNKDRPAMQLYQPGARSRNRLCPPDDSTKS GDSAAERKQESVSEKGDGKDMHCNLQGELSSEDFITYDTREKTVHGCSVTIEELEAEVDS GTLMTSLEIQQ >gi568815575r:119770585_119971613|GENSCAN_predicted_CDS_2|1476_bp atgaaggaagagaaggagcacaggcctaaggagaagcgagtaaccctgttaacccccgcc ggggccacaggcagcggtggtgggacctcgggggacagctccaagggggaagataagcag gatcgcaacaaggagaagaaagaagcgctgagcaaggtgcccggtcggccgctggcccgg gagagggcgggacgagtccgcagcgcggggaggatggctgggaatggggctgcccccttt ctttctcaggttctccttctatctcttcgcctgaatggaagtcgcatcatcctggttcat ccattcacccgaaatggggagagcccctcccggccaaatcctgggtggggtttgtatcct catatgtatgccagagcatacatcaactttaaaaaccaagaggacattattttgttcagg gatcgctttgatggttatgtattccttgacaataaagctaaaaagacaaccccacttttg agcttcctgaaaaacaagcagagaatgagagaagaaaagagagaagaaaggaggaggaga gaaatagaaagaaaaagacaaagagaagaagagaggaggaaatggaaagaagaagagaaa cgaaaaaggaaagatatagaaaagctaaagaagatagacagaattccagaaagggacaaa ttaaaggatgaaccaaagattaaggtacacaggtttctgttacaagctgtgaatcagaaa aatctgctcaagaagccagaaaaaggagatgaaaaagaattggacaaaagagaaaaagcc aagaaattggacaaagagaatctcagtgatgaaagagccagtgggcaaagttgtacattg cccaagcgttctgatagcgaacttaaagatgaaaaaccaaagagacctgaagatgagagc ggcagagactatagggagagggaacgggaatatgaacgagatcaggagcgcatacttcga gaaagagagaggctgaagcggcaagaagaagagcgccgtaggcagaaggagcgctatgag aaagagaagacttttaagagaaaagaagaagaaatgaaaaaagagaaagacacacttcgg gataaaggaaagaaggctgaaagtacagaatcaataggcagctcagaaaaaactgaaaag aaagaagaagtggtcaagagagatcgaataagaaacaaggatcgtccagcgatgcagctt taccaaccaggagctcgaagccgaaatcgactctgtccccctgatgacagcaccaagtct ggagattcagcagcagaaaggaagcaggaaagtgtgtcagaaaaaggtgacggcaaggac atgcattgcaatttgcagggggaattgtcaagtgaggacttcatcacatatgacacgaga gaaaagactgtccatggatgcagtgttaccatcgaggagctcgaggccgaggtcgactct ggcaccctgatgacaagtctggagattcagcagtag >gi568815575r:119770585_119971613|GENSCAN_predicted_peptide_3|343_aa MAEQLSPGKAVDQVCTFLFKKPGRKGAAGRRKRPACDPEPGESGSSSDEGCTVVRPEKKR VTHNPMIQKTRDSGKQKAAYGDLSSEEEEENEPESLGVVYKSTRSAKPVGPEDMGATAVY ELDTEKERDAQAIFERSQKIQEELRGKEDDKIYRGINNYQKYMKPKDTSMGNASSGMVRK GPIRAPEHLRATVRWDYQPDICKDYKETGFCGFGDSCKFLHDRSDYKHGWQIERELDEGR YGVYEDENYEVGSDDEEIPFKCFICRQSFQNPVVTKCRHYFCESCALQHFRTTPRCYVCD QQTNGVFNPAKELIAKLEKHRATGEGGASDLPEDPDEDAIPIT >gi568815575r:119770585_119971613|GENSCAN_predicted_CDS_3|1032_bp atggcagagcagctttctccaggaaaggcggtggatcaggtgtgcaccttccttttcaaa aagcctgggcggaaaggggctgctggacgcagaaagcgcccggcctgcgacccagagccc ggagaaagcggcagcagtagcgacgaaggctgcactgtggttcgaccggaaaagaagcgg gtgacccacaatccaatgatacagaagacccgtgacagtggtaaacagaaggcggcttac ggcgacttgagcagcgaagaggaagaggaaaatgagcccgagagtctcggcgtggtttat aaatccacccgttcggcgaaacccgtgggaccagaggatatgggagcgacagctgtctat gagctggacacagagaaagagcgcgatgcacaagccatctttgagcgcagccagaagatc caggaggagctgaggggcaaggaggatgacaagatctatcggggaatcaacaattatcag aaatacatgaagcccaaggatacgtctatgggcaatgcctcttccgggatggtgaggaag ggccccatccgagcgcccgagcatctacgtgccaccgtgcgctgggattaccagcccgac atctgtaaggactacaaagagactggcttctgcggcttcggagacagctgcaaattcctc catgaccgttcagattacaagcatgggtggcagatcgaacgtgagcttgatgagggtcgc tatggtgtctatgaggatgaaaactatgaagtgggaagcgatgatgaggaaataccattc aagtgtttcatctgtcgccagagcttccaaaacccagttgtcaccaagtgcaggcattat ttctgcgagagctgtgcactgcagcatttccgcaccaccccgcgctgctatgtctgtgac cagcagaccaatggcgtcttcaatccagcgaaagaattgattgctaaactagagaagcat cgagctacaggagagggtggtgcttccgacttgccagaagaccccgatgaggatgcaatt cccattacttag >gi568815575r:119770585_119971613|GENSCAN_predicted_peptide_4|254_aa MPEPPTLSVGSCAARASPTSAAPCSTALSPINHPRAEECGRTRRTGRQLHLQPWKKKMSE TQNSTSQKAMDEDNKAASQTMPNTQDKNYEDELTQVALALVEDVINYAVKIVEEERNPLK NIKWMTHGEFTVEKGLKQIDEYFSKCVSKKCWAHGVEFVERKDLIHSFLYIYYVHWSIST ADLPVARISAGTYFTMKVSKTKPPDAPIVVSYVGDHQALVHRPGMVRFRENWQKNLTDAK YSFMESFPFLFNRV >gi568815575r:119770585_119971613|GENSCAN_predicted_CDS_4|765_bp atgcctgagcctcccaccctctccgtgggctcctgtgcggcccgagcctctccgacgagc gccgccccctgctccacggcgctcagtccgatcaaccacccaagggctgaggagtgcggg cgcacgcgcaggactggcaggcagctccacctgcagccctggaaaaagaaaatgagtgag actcaaaattcaacaagccagaaagcaatggatgaggataacaaagccgcaagccaaaca atgccgaatacacaagacaagaactacgaggatgaattgactcaagtagctctagctctg gttgaggatgtcatcaattatgctgttaagattgtggaagaggagcgaaaccctttgaaa aacatcaagtggatgactcacggtgaattcactgtggaaaagggtcttaaacaaattgac gaatatttttcgaagtgtgtttctaaaaaatgctgggcacatggcgtagagtttgtagag aggaaagacttaattcacagcttcctctacatctactatgtacactggagtatctcaact gctgacctacccgtagcacgaatctctgctggtacctacttcaccatgaaggtctccaaa accaaaccaccggatgcacccattgttgtttcttatgtaggtgaccaccaagcattagtt cacagaccaggaatggttcgctttcgagaaaactggcagaagaatcttactgatgccaaa tatagtttcatggagtcattccccttcttattcaatcgtgtctga >gi568815575r:119770585_119971613|GENSCAN_predicted_peptide_5|330_aa MAPVSGSRSPDREASGSGGRRRSSSKSPKPSKSARSPRGRRSRSHSCSRSGDRNGLTHQL GGLSQGSRNQSYRSRSRSRSRERPSAPRGIPFASASSSVYYGSYSRPYGSDKPWPSLLDK EREESLRQNSDEHTPVEDEEPKKSTTSASTSEEEKKKKSSRSKERSKKRRKKKSSKRKHK KYSEDSDSDSDSETDSSEAEEPSDLIGPEAPKTLTSQDDKPLNYGHALLPGEGAAMAEYV KAGKRIPRRGEIGLTSEEIASFECSGYVMSGSRHRRMEAVRLRKENQIYSADEKRALASF NQEERRKRENKILASFREMVYRKTKGKDDK >gi568815575r:119770585_119971613|GENSCAN_predicted_CDS_5|993_bp atggctccggtgtccggctcacgcagcccggatagggaggcctcgggctcggggggaaga cgtcgcagttcgtcgaagagtccgaagcccagcaaatctgcccgctccccgcggggccgc cgctctcgctcgcactcttgctctcggtccggggaccggaatggactcacccatcagctg ggtggcctcagccaaggctcccgaaaccagtcctaccgctcacgctcgcggtcgcgttct agagagcggccctctgcgccccggggcatccccttcgcttctgcctcctcgtcagtctat tacggcagctactcgcgcccctacgggagcgacaagccttggcctagcctcctcgacaag gagagggaggagagcctgcggcagaattctgatgaacatacaccagtggaggatgaagag ccaaagaaaagcactacttcagcttctacttcagaagaagaaaaaaagaagaagtctagc cgttcaaaagaaaggtccaagaaaaggagaaagaaaaaatcatcgaaaagaaaacataag aagtattctgaagatagcgacagtgactctgattctgaaacagactccagtgaggctgaa gaaccatcagatttaattggcccagaggctccaaaaacacttacctctcaagatgataaa cctttgaactatggccatgctctgttacctggtgaaggtgcagctatggctgaatatgta aaagctggaaaacgtatcccacgaagaggtgaaattggcttgacaagtgaagaaattgca tcatttgaatgctcaggttatgtaatgagtggtagcaggcatcgccgaatggaggctgtg cgactgcgaaaagagaaccagatctacagtgctgatgagaagagagcccttgcatccttt aaccaagaagagagacgaaagagagagaacaagattctggccagttttcgagaaatggtt tacagaaagaccaaagggaaggatgacaaataa >gi568815575r:119770585_119971613|GENSCAN_predicted_peptide_6|144_aa MSELPFRIASKRIKYLGIQLTRDVKDLFKQNYKPLLNEIKEDTNKWKNILCSWIGRINIV KMAILPKVIYRFNVIPIKLPMTFFTELEKTTLKFIWNQKRAHIAKTILSQKNKAGGITLC DLKLYYKATVTKTAWYWYQNRDID >gi568815575r:119770585_119971613|GENSCAN_predicted_CDS_6|435_bp atgagtgaactcccattcagaattgcttcaaagagaataaaatacctaggaatccaactt acaagggatgtgaaggacctcttcaagcagaactacaaaccactgctcaacgaaatcaaa gaggacacaaacaaatggaagaacattctatgctcatggataggaagaatcaatattgtg aaaatggccatactgcccaaggtaatttatagattcaatgtcatccccatcaagctacca atgactttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagg gcccacattgctaagacaatcctaagccaaaagaacaaagctggaggcatcacgctatgt gacctcaaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaac agagatatagactaa >gi568815575r:119770585_119971613|GENSCAN_predicted_peptide_7|45_aa MRTVVEEAYLAFCEEGTNCINLSRPESVNTRTLNTFKRGRLPFPE >gi568815575r:119770585_119971613|GENSCAN_predicted_CDS_7|135_bp atgaggactgttgtagaagaagcatacttggccttttgtgaagagggtacaaattgcatc aacctctcaaggcctgaaagtgtgaacacaaggacacttaataccttcaagagaggtcga ttacctttcccagag