GENSCAN 1.0 Date run: 4-Nov-116 Time: 11:02:00 Sequence gi568815593r:140145977_140400584 : 254608 bp : 44.95% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 Intr - 508 363 146 1 2 50 80 152 0.181 9.68 1.03 Intr - 6703 6581 123 1 0 74 85 46 0.444 3.68 1.02 Intr - 29808 29574 235 2 1 68 76 88 0.200 3.29 1.01 Init - 37819 37737 83 1 2 54 115 40 0.676 3.74 1.00 Prom - 41578 41539 40 -3.86 2.00 Prom + 44942 44981 40 -4.26 2.01 Init + 45458 45467 10 1 1 91 98 5 0.802 2.33 2.02 Intr + 48470 48676 207 0 0 114 89 132 0.953 14.95 2.03 Term + 93113 93276 164 0 2 85 54 87 0.129 3.10 2.04 PlyA + 95456 95461 6 1.05 3.14 PlyA - 97994 97989 6 1.05 3.13 Term - 100081 99998 84 1 0 95 50 135 0.873 8.05 3.12 Intr - 135557 135473 85 1 1 109 100 59 0.227 9.02 3.11 Intr - 154606 154440 167 1 2 43 101 140 0.739 9.56 3.10 Intr - 157135 157065 71 2 2 40 93 83 0.092 2.90 3.09 Intr - 157387 157214 174 2 0 40 65 98 0.023 2.61 3.08 Intr - 173943 173909 35 2 2 102 121 -5 0.086 2.07 3.07 Intr - 179028 178999 30 2 0 130 75 4 0.071 0.55 3.06 Intr - 190051 189896 156 0 0 58 95 93 0.938 6.13 3.05 Intr - 193961 193825 137 2 2 91 77 59 0.369 4.47 3.04 Intr - 196107 196095 13 2 1 99 59 26 0.347 -4.42 3.03 Intr - 196836 196659 178 1 1 68 103 199 0.999 18.38 3.02 Intr - 200108 199935 174 0 0 83 90 225 0.996 22.11 3.01 Init - 200352 200307 46 0 1 100 110 114 0.632 14.46 3.00 Prom - 201657 201618 40 -2.06 4.03 PlyA - 203363 203358 6 1.05 4.02 Term - 208396 208240 157 0 1 60 52 171 0.802 8.11 4.01 Init - 208794 208775 20 0 2 49 69 14 0.213 -4.91 4.00 Prom - 213881 213842 40 -3.36 5.00 Prom + 214179 214218 40 -3.26 5.01 Init + 214267 214490 224 0 2 61 111 155 0.564 13.13 5.02 Intr + 214836 214996 161 0 2 105 57 141 0.849 12.33 5.03 Intr + 215832 215887 56 1 2 93 72 14 0.826 -1.00 5.04 Intr + 216041 216198 158 1 2 57 94 48 0.881 1.11 5.05 Intr + 216469 216556 88 1 1 100 105 18 0.969 4.67 5.06 Intr + 216936 217090 155 2 2 66 77 104 0.946 5.97 5.07 Intr + 217463 217579 117 2 0 112 94 37 0.909 6.28 5.08 Intr + 217806 217926 121 0 1 4 94 140 0.729 6.70 5.09 Intr + 218078 218211 134 1 2 78 91 84 0.999 7.14 5.10 Intr + 218387 218649 263 2 2 113 90 318 0.995 31.63 5.11 Intr + 219544 219602 59 2 2 90 77 -12 0.829 -3.50 5.12 Intr + 219858 220046 189 2 0 108 89 63 0.965 8.18 5.13 Intr + 220175 220288 114 1 0 79 100 104 0.991 11.34 5.14 Intr + 221444 221605 162 1 0 75 110 175 0.998 18.57 5.15 Intr + 221744 221922 179 1 2 60 94 64 0.919 2.92 5.16 Intr + 222611 222683 73 2 1 146 77 45 0.997 8.61 5.17 Intr + 225119 225187 69 1 0 91 99 64 0.992 7.18 5.18 Intr + 225475 225648 174 0 0 118 103 78 0.999 12.44 5.19 Intr + 226266 226421 156 2 0 86 50 143 0.551 10.51 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 129505 129628 124 0 1 66 87 51 0.884 3.13 S.002 Term + 130021 130106 86 2 2 34 45 193 0.997 7.42 S.003 Init - 157097 157065 33 2 0 100 93 65 0.833 8.07 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:140145977_140400584|GENSCAN_predicted_peptide_1|196_aa MYGPLQKLEDCLDAVFPSAFSLWKGPHSYFQSLVTSAPLPPAGLATCPLPPAGPARNRQS SASEIAAQGEEIQKPPADVPRDGVWRAPLRARRLLVRREPQRRGDPVGPPLCPTKLPQKL ERKKLWVSRQRSGESLRIWKSPLLPGLPHPEPEDAENGLRTARGCFGGVRGYSEKQSCVS LVPSLNSGVLQSQCPS >gi568815593r:140145977_140400584|GENSCAN_predicted_CDS_1|588_bp atgtatggtcctctgcaaaagctagaggactgcctggatgcagtattcccctctgcattc agcttatggaagggtccccacagttacttccagtcactagttacttccgctccgctccca cccgcgggactcgcgacctgccccctgcccccggctggcccagctcggaataggcagagc agcgcctcagaaatcgcagcccaaggcgaagaaatccagaagcctcccgcggacgtcccc cgggacggggtgtggcgagccccgctccgagcccgccgactcctcgtgagacgagaaccg cagcgacgcggagaccctgtggggccacccctctgccccaccaagttgccccagaagctg gagaggaaaaaactctgggtctccagacaacggtctggagaatccttgaggatctggaag tccccgctgctgcctggcctgcctcacccagagcccgaggacgcagagaacggcctgcgc accgcccgcggctgcttcggaggggtgagggggtacagcgagaaacagagctgcgtctcc ctcgtcccctccttaaattccggcgtcctccagtcacagtgtcccagn >gi568815593r:140145977_140400584|GENSCAN_predicted_peptide_2|126_aa MMRGALYRSPMNQENPPPYPGPGPTAPYPPYPPQPMGPGPMGGPYPPPQGYPYQGYPQYG WQGGPQEPPKTTALLGWDWYRHLQKQEGPNKAQLPHQGTRFIIYKVPAMQMKVSDARMGL RIGAFT >gi568815593r:140145977_140400584|GENSCAN_predicted_CDS_2|381_bp atgatgaggggtgcactttacaggtccccgatgaaccaagagaaccctccaccatatcca ggccctggtccaacggccccatacccaccttatccaccacaaccaatgggtccaggacct atggggggaccctacccacctcctcaagggtacccctaccaaggatacccacagtacggc tggcagggtggacctcaggagcctcctaaaaccacagctctcttgggttgggactggtac agacacttgcaaaaacaagaaggacctaataaggctcaattaccacatcaaggcactcga ttcatcatttacaaagtcccagccatgcagatgaaggtttcagatgccagaatggggctg agaattggagccttcacctga >gi568815593r:140145977_140400584|GENSCAN_predicted_peptide_3|449_aa MKLLPSVVLKLFLAAVLSALVTGESLERLRRGLAAGTSNPDPPTVSTDQLLPLGGGRDRK VRDLQEADLDLLRVTLSSKPQALATPNKEEHGKRKKKGKGLGKKRDPCLRKYKDFCIHGE CKYVKELRAPSCIKDEEGRKQQEDRNPGLGECDICARSSGDLCGSCLTPDPGTTSQGVSR LLCCHPGYHGERCHGLSLPVENRLYTYDHTTILAVVAVVLSSVCLLVIVGLLMFSILAIT NQQGMCLHRYRTDRSEAHIKLAGKLLKATWEICALGYVWKIRRSKPDLTGQVVKTRNPKP KYFRSQFSEKDKKPVQLPLRRRLQCTKMAAPVDLELKKAFTELQAKVIDTQQKVKLADIQ IEQLNRTKKHAHLTDTEIMTLVDETNMYEGVGRMFILQSKEAIHSQLLEKQKIAEEKIKE LEQKKSYLERSVKEAEDNIREMLMARRAQ >gi568815593r:140145977_140400584|GENSCAN_predicted_CDS_3|1350_bp atgaagctgctgccgtcggtggtgctgaagctctttctggctgcagttctctcggcactg gtgactggcgagagcctggagcggcttcggagagggctagctgctggaaccagcaacccg gaccctcccactgtatccacggaccagctgctacccctaggaggcggccgggaccggaaa gtccgtgacttgcaagaggcagatctggaccttttgagagtcactttatcctccaagcca caagcactggccacaccaaacaaggaggagcacgggaaaagaaagaagaaaggcaagggg ctagggaagaagagggacccatgtcttcggaaatacaaggacttctgcatccatggagaa tgcaaatatgtgaaggagctccgggctccctcctgcatcaaggacgaggagggcagaaag cagcaagaagacagaaatcctggtttgggggaatgtgacatctgtgcacgttcatctggg gatctttgtggctcttgtttgactccagacccaggaaccactagccagggtgtgtccagg ctgctgtgctgccacccgggttaccatggagagaggtgtcatgggctgagcctcccagtg gaaaatcgcttatatacctatgaccacacaaccatcctggccgtggtggctgtggtgctg tcatctgtctgtctgctggtcatcgtggggcttctcatgtttagcatcttggctattact aaccaacagggcatgtgtttacacagatacaggacagacagaagtgaagcacatattaag ctagctggcaaactgcttaaagccacatgggaaatctgcgcgttgggctatgtctggaag atacggaggagcaagccagacctgaccggtcaggtagtgaagacacggaatccgaaaccc aaatatttcaggagtcaattctcagagaaagataagaagcccgtccaactcccactgcgc aggcgcttacagtgcaccaagatggccgcccccgtggatctagagctgaagaaggccttc acagagcttcaagccaaagttattgacactcaacagaaggtgaagctcgcagacatacag attgaacagctaaacagaacgaaaaagcatgcacatcttacagatacagagatcatgact ttggtagatgagactaacatgtatgaaggtgtaggaagaatgtttattcttcagtccaag gaagcaattcacagtcagctgttagagaagcagaaaatagcagaagaaaaaattaaagaa ctagaacagaaaaagtcctacctggagcgaagcgttaaggaagctgaggacaacatccgg gagatgctgatggcacgaagggcccagtag >gi568815593r:140145977_140400584|GENSCAN_predicted_peptide_4|58_aa MRITNPRPPRSNQDLEWALIAGICFDSSQCPHSFTKVKTVEGKEDEVSETSSLQDETT >gi568815593r:140145977_140400584|GENSCAN_predicted_CDS_4|177_bp atgagaattaccaatccaaggccaccaagatccaaccaagacttggagtgggctctgata gctggaatctgcttcgactcctcacagtgcccacactccttcacaaaggtgaagaccgtt gaggggaaagaggatgaggtgtcagagacatctagtctgcaggatgagaccacctga >gi568815593r:140145977_140400584|GENSCAN_predicted_peptide_5|884_aa MKLPGQEGFEASSAPRNIPSGELDSNPDPGTGPSPDGPSDTESKELGVPKDPLLFIQLNE LLGWPQALEWRETGRWVLFEEKLEVAAGRWSAPHVPTLALPSLQKLRSLLAEGLVLLDCP AQSLLELVGSTHPRKASDNEEAPLREQCQNPLRQKLPPGAEAGTVLAGELGFLAQPLGAF VRLRNPVVLGSLTEVSLPSRFFCLLLGPCMLGKGYHEMGRAAAVLLSDPQFQWSVRRASN LHDLLAALDAFLEEVTVLPPGRWDPTARIPPPKCLPSQHKRLPSQQREIRGPAVPRLTSA EDRHRHGPHAHSPELQRTGSDFLDALHLQCFSAVLYIYLATVTNAITFGGLLGDATDGAQ GVLESFLGTAVAGAAFCLMAGQPLTILSSTGPVLVFERLLFSFSRDYSLDYLPFRLWVGI WVATFCLVLVATEASVLVRYFTRFTEEGFCALISLIFIYDAVGKMLNLTHTYPIQKPGSS AYGCLCQYPGPGGNESQWIRTRPKDRDDIVSMDLGLINASLLPPPECTRQGGHPRGPGCH TVPDIAFFSLLLFLTSFFFAMALKCVKTSRFFPSVVRKGLSDFSSVLAILLGCGLDAFLG LATPKLMVPREFKPTLPGRGWLVSPFGANPWWWSVAAALPALLLSILIFMDQQITAVILN RMEYRLQKGAGFHLDLFCVAVLMLLTSALGLPWYVSATVISLAHMDSLRRESRACAPGER PNFLGIREQRLTGLVVFILTGASIFLAPVLKFIPMPVLYGIFLYMGVAALSSIQFTNRVK LLLMPAKHQPDLLLLRHVPLTRVHLFTAIQLACLGLLWIIKSTPAAIIFPLMLLGLVGVR KALERVFSPQELLWLDELMPEEERSIPEKGLEPEHSFSGSDSED >gi568815593r:140145977_140400584|GENSCAN_predicted_CDS_5|2652_bp atgaagctgccaggccaggaagggtttgaagcctccagtgctcctagaaatattccttca ggggagctggacagcaaccctgaccctggcaccggccccagccctgatggcccctcagac acagagagcaaggaactgggagtacccaaagaccctctgctcttcattcagctgaatgag ctgctgggctggccccaggcgctggagtggagagagacaggcaggtgggtactgtttgag gagaagttggaggtggctgcaggccggtggagtgccccccacgtgcccaccctggcactg cccagcctccagaagctccgcagcctgctggccgagggccttgtactgctggactgccca gctcagagcctcctggagctcgtgggctctactcatccaagaaaggcttctgacaatgag gaagcccccctgagggaacagtgtcagaaccccctgagacagaagctacctccaggagct gaggcagggactgtgctggcaggggagctgggcttcctggcacagccactgggagccttt gttcgactgcggaaccctgtggtactggggtcccttactgaggtgtccctcccaagcagg tttttctgccttctcctgggcccctgtatgctgggaaagggctaccatgagatgggacgg gcagcagctgtcctcctcagtgacccgcaattccagtggtcagttcgtcgggccagcaac cttcatgaccttctggcagccctggatgcattcctagaggaggtgacagtgcttccccca ggtcggtgggacccaacagcccggattcccccgcccaaatgtctgccatctcagcacaaa aggcttccctcgcaacagcgggagatcagaggtcccgccgtcccgcgcctgacctcggct gaggacaggcaccgccatgggccacacgcacacagcccggagttgcagcggaccggcagc gatttcttggacgccctgcatctccagtgcttctcggccgtactctacatttacctggcc actgtcactaatgccatcacttttgggggtctgctgggagatgccactgatggtgcccag ggagtgctggaaagtttcctgggcacagcagtggctggagctgccttctgcctgatggca ggccagcccctcaccattctgagcagcacggggccagtgctggtctttgagcgcctgctc ttctctttcagcagagattacagcctggactacctgcccttccgcctatgggtgggcatc tgggtggctaccttttgcctggtgctggtggccacagaggccagtgtgctggtgcgctac ttcacccgcttcactgaggaaggtttctgtgccctcatcagcctcatcttcatctacgat gctgtgggcaaaatgctgaacttgacccatacctatcctatccagaagcctgggtcctct gcctacgggtgcctctgccaatacccaggcccaggaggaaatgagtctcaatggataagg acaaggccaaaagacagagacgacattgtaagcatggacttaggcctgatcaatgcatcc ttgctgccgccacctgagtgcacccggcagggaggccaccctcgtggccctggctgtcat acagtcccagacattgccttcttctcccttctcctcttccttacttctttcttctttgct atggccctcaagtgtgtaaagaccagccgcttcttcccctctgtggtgcgcaaagggctc agcgacttctcctcagtcctggccatcctgctcggctgtggccttgatgctttcctgggc ctagccacaccaaagctcatggtacccagagagttcaagcccacactccctgggcgtggc tggctggtgtcaccttttggagccaacccctggtggtggagtgtggcagctgccctgcct gccctgctgctgtctatcctcatcttcatggaccaacagatcacagcagtcatcctcaac cgcatggaatacagactgcagaagggagctggcttccacctggacctcttctgtgtggct gtgctgatgctactcacatcagcgcttggactgccttggtatgtctcagccactgtcatc tccctggctcacatggacagtcttcggagagagagcagagcctgtgcccccggggagcgc cccaacttcctgggtatcagggaacagaggctgacaggcctggtggtgttcatccttaca ggagcctccatcttcctggcacctgtgctcaagttcattccaatgcctgtgctctatggc atcttcctgtatatgggggtggcagcgctcagcagcattcagttcactaatagggtgaag ctgttgttgatgccagcaaaacaccagccagacctgctactcttgcggcatgtgcctctg accagggtccacctcttcacagccatccagcttgcctgtctggggctgctttggataatc aagtctacccctgcagccatcatcttccccctcatgttgctgggccttgtgggggtccga aaggccctggagagggtcttctcaccacaggaactcctctggctggatgagctgatgcca gaggaggagagaagcatccctgagaaggggctggagccagaacactcattcagtggaagt gacagtgaagat