GENSCAN 1.0 Date run: 8-Nov-116 Time: 11:02:55 Sequence gi568815575r:111146580_111363936 : 217357 bp : 38.98% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1158 1311 154 2 1 110 94 112 0.922 12.72 1.02 Intr + 5831 5868 38 0 2 105 123 -2 0.802 2.06 1.03 Intr + 16336 16467 132 0 0 54 89 224 0.997 19.02 1.04 Intr + 16983 17148 166 2 1 64 78 182 0.794 13.41 1.05 Intr + 26439 26502 64 1 1 89 110 36 0.658 2.76 1.06 Intr + 35182 35294 113 1 2 90 70 60 0.457 3.50 1.07 Intr + 35589 35644 56 1 2 121 64 21 0.267 0.78 1.08 Intr + 45927 46039 113 2 2 75 97 58 0.856 3.66 1.09 Intr + 47722 47839 118 1 1 86 99 152 0.989 15.65 1.10 Intr + 49888 50061 174 0 0 71 110 123 0.707 12.01 1.11 Intr + 69842 69979 138 1 0 70 67 157 0.809 11.54 1.12 Term + 73779 73868 90 2 0 113 47 34 0.610 -1.46 1.13 PlyA + 73915 73920 6 1.05 2.04 PlyA - 74041 74036 6 1.05 2.03 Term - 76663 76533 131 2 2 97 42 60 0.268 -0.34 2.02 Intr - 84759 84655 105 1 0 51 106 38 0.065 1.17 2.01 Init - 96823 96601 223 1 1 66 39 174 0.172 9.06 2.00 Prom - 97067 97028 40 -6.65 3.13 PlyA - 97344 97339 6 1.05 3.12 Term - 100180 99998 183 1 0 144 35 216 0.989 18.46 3.11 Intr - 100925 100789 137 0 2 68 110 102 0.998 9.77 3.10 Intr - 101413 101292 122 0 2 91 69 163 0.999 13.92 3.09 Intr - 102192 101990 203 0 2 97 31 144 0.976 6.76 3.08 Intr - 102478 102356 123 1 0 101 93 148 0.999 16.46 3.07 Intr - 104524 104338 187 0 1 95 36 238 0.999 18.07 3.06 Intr - 104707 104630 78 0 0 81 73 123 0.996 7.85 3.05 Intr - 105163 104970 194 1 2 86 82 247 0.983 21.27 3.04 Intr - 105920 105728 193 1 1 56 68 219 0.635 15.27 3.03 Intr - 106637 106429 209 2 2 91 78 111 0.999 7.35 3.02 Intr - 107824 107693 132 1 0 83 62 80 0.965 4.82 3.01 Init - 117357 117193 165 0 0 61 93 135 0.973 10.98 3.00 Prom - 121595 121556 40 -5.35 4.04 PlyA - 121896 121891 6 1.05 4.03 Term - 123861 123699 163 2 1 13 33 272 0.440 10.63 4.02 Intr - 124374 124202 173 0 2 70 69 78 0.287 1.92 4.01 Init - 126512 126426 87 2 0 70 86 53 0.605 3.99 4.00 Prom - 136807 136768 40 -3.75 5.08 PlyA - 137077 137072 6 1.05 5.07 Term - 150782 150647 136 1 1 58 47 119 0.036 1.31 5.06 Intr - 155164 155051 114 0 0 111 42 68 0.028 3.14 5.05 Intr - 165840 165696 145 1 1 77 26 62 0.001 -2.68 5.04 Intr - 184462 184325 138 2 0 95 113 181 0.956 20.81 5.03 Intr - 186574 186472 103 1 1 97 103 129 0.999 14.13 5.02 Intr - 193873 193728 146 2 2 74 53 103 0.005 4.48 5.01 Init - 205887 205749 139 0 1 62 34 111 0.606 3.45 5.00 Prom - 208480 208441 40 -3.25 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575r:111146580_111363936|GENSCAN_predicted_peptide_1|451_aa GIPEQWARLLQTSNITKLEQKKNPQAVLDVLKFYDSKETVNNQKYMSFTSGDKSAHGYIA AHPSSTKTASEPPLAPPVSEEEDEEEEEEEDENEPPPVIAPRPEHTKSIYTRSVVESIAS PAVPNKEVTPPSAENANSSTLYRNTDRQRKKSKMTDEEILEKLRSIVSVGDPKKKYTRFE KIGQGKLWARVVSSVLIEVILWVGPPTPKVFASSSDLFWATQGLLNPMAHTVKQLLRFNV GVAIKQMNLQQQPKKELIINEILVMRENKNPNIVNYLDSYLVGDELWVVMEYLAGGSLTD VVTETCMDEGQIAAVCREITPEQSKRSTMVGTPYWMAPEVVTRKAYGPKVDIWSLGIMAI EMVEGEPPYLNENPLRALYLIATNGTPELQNPERLSAVFRDFLNRCLEMDVDRRGSAKEL LQHPFLKLAKPLSSLTPLIIAAKEAIKNSSR >gi568815575r:111146580_111363936|GENSCAN_predicted_CDS_1|1356_bp ggaattccagagcaatgggcacgattactccaaacttccaacataacaaaattggaacag aagaagaacccacaagctgttctagatgttctcaaattctatgattccaaagaaacagtc aacaaccagaaatacatgagctttacatcaggagataaaagtgcacatggatacatagca gcccatccttcgagtacaaaaacagcatctgagcctccattggcccctcctgtgtctgaa gaagaagatgaagaggaagaagaagaagaagatgaaaatgagccaccaccagttatcgca ccaagaccagagcatacaaaatcaatctatactcgttctgtggttgaatccattgcttca ccagcagtaccaaataaagaggtcacaccaccctctgctgaaaatgccaattccagtact ttgtacaggaacacagatcggcaaagaaaaaaatccaagatgacagatgaggagatctta gagaagctaagaagcattgtgagtgttggggacccaaagaaaaaatacacaagatttgaa aaaattggtcaagggaaactatgggcaagagtggtgtctagtgtgcttatagaagtgatt ctttgggtaggaccacccaccccaaaagtatttgcttcttcatcggatctattttgggcc acacaaggcctcttgaatccaatggcacacactgtgaaacagctgcttagatttaatgtt ggagtggccataaagcagatgaaccttcaacagcaacccaagaaggaattaattattaat gaaattctggtcatgagggaaaataagaaccctaatattgttaattatttagatagctac ttggtgggtgatgaactatgggtagtcatggaatacttggctggtggctctctgactgat gtggtcacagagacctgtatggatgaaggacagatagcagctgtctgcagagagatcact cctgagcaaagtaaacgaagcactatggtgggaaccccatattggatggcacctgaggtg gtgactcgaaaagcttatggtccgaaagttgatatctggtctcttggaattatggcaatt gaaatggtggaaggtgaacccccttaccttaatgaaaatccactcagggcattgtatctg atagccactaatggaactccagagctccagaatcctgagagactgtcagctgtattccgt gactttttaaatcgctgtcttgagatggatgtggataggcgaggatctgccaaggagctt ttgcagcatccatttttaaaattagccaagcctctctccagcctgactcctctgattatc gctgcaaaggaagcaattaagaacagcagccgctaa >gi568815575r:111146580_111363936|GENSCAN_predicted_peptide_2|152_aa MKTLEENLGNTIQGIGMGKDFMTKTPKAMATKAKIDKRDLIKLNSVFTAKETIISMNRQP TELEKFFAIYLSDKGRKMSHPITVGHSVYSSQQEIKEKSPGHHPYVCQQRHFGINIERCA LVISGHTSNIKDLDRLTCVKMSLKIQGKRINC >gi568815575r:111146580_111363936|GENSCAN_predicted_CDS_2|459_bp atgaaaaccctagaagaaaacctaggcaataccattcagggcataggcatgggcaaagat ttcatgactaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaacgggatcta attaaactaaacagcgtcttcacagcaaaagaaactatcatcagcatgaacaggcaacct acagaattggagaaattttttgcaatctatctatctgacaaaggaagaaaaatgagtcac cctatcacagtgggccacagtgtatactctagtcaacaggaaattaaagaaaagtcacca ggccaccatccatatgtttgccagcaaagacattttggaatcaacattgaaagatgtgcc ctcgttatctcagggcacactagcaacatcaaagatttggatagactcacctgtgtcaag atgagcttaaaaatccagggaaagagaataaactgttaa >gi568815575r:111146580_111363936|GENSCAN_predicted_peptide_3|641_aa MGPPLKLFKNQKYQELKQECIKDSRLFCDPTFLPENDSLFYNRLLPGKVVWKRPQDICDD PHLIVGNISNHQLTQGRLGHKPMVSAFSCLAVQESHWTKTIPNHKEQEWDPQKTEKYAGI FHFRFWHFGEWTEVVIDDLLPTINGDLVFSFSTSMNEFWNALLEKAYAKLLGCYEALDGL TITDIIVDFTGTLAETVDMQKGRYTELVEEKYKLFGELYKTFTKGGLICCSIESPNQEEQ EVETDWGLLKGHTYTMTDIRKIRLGERLVEVFSAEKVYMVRLRNPLGRQEWSGPWSEISE EWQQLTASDRKNLGLVMSDDGEFWMSLEDFCRNFHKLNVCRNVNNPIFGRKELESVLGCW TVDDDPLMNRSGGCYNNRDTFLQNPQYIFTVPEDGHKVIMSLQQKDLRTYRRMGRPDNYI IGFELFKVEMNRKFRLHHLYIQERAGTSTYIDTRTVFLSKYLKKGNYVLVPTMFQHGRTS EFLLRIFSEVPVQLRELTLDMPKMSCWNLARGYPKVVTQITVHSAEDLEKKYANETVNPY LVIKCGKEEVRSPVQKNTVHAIFDTQAIFYRRTTDIPIIVQVWNSRKFCDQFLGQVTLDA DPSDCRDLKSLYLRKKGGPTAKVKQGHISFKVISSDDLTEL >gi568815575r:111146580_111363936|GENSCAN_predicted_CDS_3|1926_bp atgggtcctcctctgaagctcttcaaaaaccagaaataccaggaactgaagcaggaatgc atcaaagacagcagacttttctgtgatccaacatttctgcctgagaatgattctcttttc tacaaccgactgcttcctggaaaggtggtgtggaaacgtccccaggacatctgtgatgac ccccatctgattgtgggcaacattagcaaccaccagctgacccaagggagactggggcac aagccaatggtttctgcattttcctgtttggctgttcaggagtctcattggacaaagaca attcccaaccataaggaacaggaatgggaccctcaaaaaacagaaaaatacgctgggata tttcactttcgtttctggcattttggagaatggactgaagtggtgattgatgacttgttg cccaccattaacggagatctggtcttctctttctccacttccatgaatgagttttggaat gctctgctggaaaaagcttatgcaaagctgctaggctgttatgaggccctggatggtttg accatcactgatattattgtggacttcacgggcacattggctgaaactgttgacatgcag aaaggaagatacactgagcttgttgaggagaagtacaagctattcggagaactgtacaaa acatttaccaaaggtggtctgatctgctgttccattgagtctcccaatcaggaggagcaa gaagttgaaactgattggggtctgctgaagggccatacctataccatgactgatattcgc aaaattcgtcttggagagagacttgtggaagtcttcagtgctgagaaggtgtatatggtt cgcctgagaaaccccttgggaagacaggaatggagtggcccctggagtgaaatttctgaa gagtggcagcaactgactgcatcagatcgcaagaacctggggcttgttatgtctgatgat ggagagttttggatgagcttggaggacttttgccgcaactttcacaaactgaatgtctgc cgcaatgtgaacaaccctatttttggccgaaaggagctggaatcggtgttgggatgctgg actgtggatgatgatcccctgatgaaccgctcaggaggctgctataacaaccgtgatacc ttcctgcagaatccccagtacatcttcactgtgcctgaggatgggcacaaggtcattatg tcactgcagcagaaggacctgcgcacttaccgccgaatgggaagacctgacaattacatc attggctttgagctcttcaaggtggagatgaaccgcaaattccgcctccaccacctctac atccaggagcgtgctgggacttccacctatattgacacccgcacagtgtttctgagcaag tacctgaagaagggcaactatgtgcttgtcccaaccatgttccagcatggtcgcaccagc gagtttctcctgagaatcttctctgaagtgcctgtccagctcagggaactgactctggac atgcccaaaatgtcctgctggaacctggctcgtggctacccgaaagtagttactcagatc actgttcacagtgctgaggacctggagaagaagtatgccaatgaaactgtaaacccatat ttggtcatcaaatgtggaaaggaggaagtccgttctcctgtccagaagaatacagttcat gccatttttgacacccaggccattttctacagaaggaccactgacattcctattatagta caggtctggaacagccgaaaattctgtgatcagttcttggggcaggttactctggatgct gaccccagcgactgccgtgatctgaagtctctgtacctgcgtaagaagggtggtccaact gccaaagtcaagcaaggccacatcagcttcaaggttatttccagcgatgatctcactgag ctctaa >gi568815575r:111146580_111363936|GENSCAN_predicted_peptide_4|140_aa MAFILGKSVECTVRQIVCEALERERKSLGSLPEVLITITDLPTRAMSDQGHVCSLAHINF ANTSPFFSACPLSSNVHVCDINPGTENSSGNGSSSSSSSSSSSRAPGITQVSREGIRKLT LEFYFLDNLASKKDEETSTL >gi568815575r:111146580_111363936|GENSCAN_predicted_CDS_4|423_bp atggcatttatccttggcaaaagtgtagaatgcactgttaggcagatcgtttgtgaggct ttagaaagggagcggaaatcactagggtcacttcctgaagtcctcatcactatcacagat ctgcctaccagggccatgtctgaccagggccatgtctgcagcctagcccatattaacttt gccaacacttcaccatttttctctgcatgtcctctctctagtaatgtgcatgtctgtgat attaaccctgggacagaaaacagcagcggcaacggcagcagcagcagcagcagcagcagc agcagcagcagggctcctgggataactcaggtgagtagagagggaattcgcaaacttacc ctggagttttatttcctggataacttagcgtctaagaaagatgaagaaacttcaactttg tag >gi568815575r:111146580_111363936|GENSCAN_predicted_peptide_5|306_aa MNNSNSSNTLGNEINSALAQKPAEMELVLGVVSLGFIGGSNKAIVRELPGRKAKSAGLQR LQTPLLLQVQAHGDQSSAPEPLTRVNKVPAGNPHPVTCLHDFFGDDDVFIACGPEKFRYA QDDFSLDENECRVMKGNPSATAGPKASPTPQKTSAKSPGPMRRSKSPADSGNDQDALLPK GFQKVSKANEVTKPCFGTWVEHLSPQFRDIAKGVFRALYVTRHQTCTCLCPWMTRTRLVI PCKGGESAQSPEYKSKPIIVVGTREQDAYIPKMSSRTLMGMIPKITPPQKRLCQETSPDR NTGTVV >gi568815575r:111146580_111363936|GENSCAN_predicted_CDS_5|921_bp atgaacaatagtaacagcagcaacaccctgggaaatgagataaattctgccttagcccag aagcctgctgagatggagctggttcttggagtggtcagcctgggattcataggtggaagc aataaagccatcgtcagagaactaccaggaagaaaggctaagtctgctggtctacagaga ctgcagacccccttactcctacaggttcaggcccacggagatcagagttctgcccctgag cctctgactagagttaataaagttcctgcagggaatccccacccagtaacttgtctccat gatttctttggtgatgatgatgtgtttattgcctgtggtcctgaaaaatttcgctatgct caggatgatttttctctggatgaaaatgaatgccgagtcatgaagggaaacccatcagcc acagctggcccaaaggcatccccaacacctcagaagacttcagccaagagccctggtcct atgcgccgaagcaagtctccagctgactcaggtaacgaccaagacgcacttcttccaaag ggtttccaaaaggtctccaaagctaatgaggtaaccaaaccatgctttgggacatgggtt gaacacttgtctccccagttcagagacatagcaaagggggtgttcagggcactttatgtc accaggcaccagacctgtacctgcctctgtccttggatgactcggactcgcttggtgatt ccatgtaaaggaggggagagtgctcagagtccagagtacaaatccaagcctatcattgta gtagggacaagagaacaggatgcctatatccccaaaatgagctccaggacactgatggga atgatcccaaagatcaccccacctcagaaacgtctgtgccaagagacttccccagataga aacactgggacagtggtttga