GENSCAN 1.0 Date run: 7-Nov-116 Time: 04:46:51 Sequence gi568815596r:121124344_121385109 : 260766 bp : 47.43% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 1037 1294 258 1 0 -86 55 551 0.978 29.95 1.02 PlyA + 2976 2981 6 1.05 2.22 PlyA - 3971 3966 6 1.05 2.21 Term - 7972 7889 84 1 0 116 47 77 0.566 4.05 2.20 Intr - 16381 16282 100 0 1 42 84 85 0.339 3.61 2.19 Intr - 29376 29233 144 2 0 66 81 107 0.950 7.20 2.18 Intr - 32105 31965 141 1 0 124 44 59 0.286 4.57 2.17 Intr - 71606 71582 25 0 1 100 99 -4 0.074 -1.02 2.16 Intr - 77910 77836 75 1 0 65 115 26 0.462 2.49 2.15 Intr - 78299 78195 105 0 0 98 47 49 0.430 1.99 2.14 Intr - 84712 84550 163 1 1 76 94 65 0.227 5.45 2.13 Intr - 107625 107483 143 1 2 73 78 253 0.659 22.87 2.12 Intr - 109851 109748 104 2 2 88 105 142 0.999 15.72 2.11 Intr - 110968 110878 91 2 1 97 94 150 0.999 15.55 2.10 Intr - 113373 113280 94 0 1 87 99 98 0.998 10.34 2.09 Intr - 113507 113459 49 1 1 128 105 50 0.999 9.38 2.08 Intr - 115306 115215 92 1 2 114 83 113 0.999 12.19 2.07 Intr - 118126 118016 111 1 0 68 83 163 0.899 14.38 2.06 Intr - 122627 122475 153 2 0 131 97 245 0.989 29.97 2.05 Intr - 123927 123821 107 1 2 78 80 26 0.999 0.73 2.04 Intr - 124744 124639 106 1 1 93 109 197 0.999 22.09 2.03 Intr - 125304 125228 77 1 2 47 121 74 0.845 5.83 2.02 Intr - 156928 156777 152 0 2 127 67 178 0.695 19.41 2.01 Init - 160883 160705 179 2 2 28 91 282 0.995 19.23 2.00 Prom - 164286 164247 40 -6.56 3.04 PlyA - 165674 165669 6 1.05 3.03 Term - 168746 168615 132 2 0 123 43 28 0.195 -0.11 3.02 Intr - 172308 172222 87 0 0 97 53 34 0.484 0.97 3.01 Init - 174538 174248 291 1 0 53 32 202 0.677 8.25 3.00 Prom - 200271 200232 40 -3.36 4.09 PlyA - 200908 200903 6 1.05 4.08 Term - 216604 216518 87 1 0 123 39 64 0.558 2.66 4.07 Intr - 222811 222695 117 1 0 101 95 74 0.995 10.06 4.06 Intr - 224375 224169 207 2 0 36 96 306 0.927 25.47 4.05 Intr - 228108 227776 333 0 0 36 69 135 0.527 2.17 4.04 Intr - 238957 238829 129 1 0 94 105 137 0.998 16.89 4.03 Intr - 240941 240751 191 0 2 65 49 332 0.999 26.30 4.02 Intr - 243488 243245 244 2 1 28 24 283 0.430 12.97 4.01 Init - 253305 253156 150 0 0 45 87 181 0.864 13.74 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:121124344_121385109|GENSCAN_predicted_peptide_1|85_aa RKKKKEKEKKEKEKEKKKEDEEEKGKGKEKRKRKKEKEKKEGGEEEEEEEEEEEEEEEEE EGEEEEEEEEKEEEEEEEEEEGGGG >gi568815596r:121124344_121385109|GENSCAN_predicted_CDS_1|258_bp aggaagaagaagaaggagaaggagaaaaaggagaaggagaaggagaaaaaaaaggaggac gaggaggagaaggggaaggggaaggagaagaggaagaggaagaaggagaaggagaagaag gaaggaggagaagaggaagaggaagaggaggaggaggaagaagaggaagaggaagaagaa gaaggagaagaggaagaggaagaggaggagaaggaagaagaggaagaggaagaagaagaa gaaggaggaggaggatga >gi568815596r:121124344_121385109|GENSCAN_predicted_peptide_2|764_aa MRRAGRHRGFGARSRRWVLGAPRVCACCRAPVGALGVPAMLFWHTQPEHYNQHNSGSYLR DVLALPIFKQEEPQLSPENEARLPPLQYVLCAATSPAVKLHEETLTYLNQGQSYEIRLLE NRKLGDFQDLNTKYVKSIIRVVFHDRRLQYTEHQQLEGWRWSRPGDRILDIDIPLSVGIL DPRASPTQLNAVEFLWDPAKRASAFIQVHCISTEFTPRKHGGEKGVPFRVQIDTFKQNEN GEYTEHLHSASCQIKVFKPKGADRKQKTDREKMEKRTAQEKEKYQPSYETTILTECSPWP DVAYQVNSAPSPSYNGSPNSFGLGEGNASPTHPVEALPVGSDHLLPSASIQDAQQWLHRN RFSQFCRLFASFSGADLLKMSRDDLVQICGPADGIRLFNAIKGRNVRPKMTIYVCQELEQ NRVPLQQKRDGSGDSNLSVYHAIFLEELTTLELIEKIANLYSISPQHIHRVYRQGPTGIH VVVSNELQLSPLLALLLPSGMWALGKEACPRVPADAALPLGCRGLLGSRVLPQYSGHLLS DASNATLTVLPSGFHSSVLSFLPQGTLLTSSEPKWENPHDGFARWVLLHSFSDEEAEEKE VFSSVKWNWLFSTREPFPFSWGWVKPELGSSGPRALHRDMMMIMMHEDDPKTVVTWTPSL TSSNATCVNTVPLPSCPCAFEWTITGQSNDPRADLHKELAQSFWPWDKIWAGLLNDDRGV AKSLIVVPFVSQPATRHLINAIQQGLLLFPTTDGIAACWELPPI >gi568815596r:121124344_121385109|GENSCAN_predicted_CDS_2|2295_bp atgcgccgagccggccggcaccgcgggttcggagcgcgaagccgccgctgggtcctcggc gcgccccgcgtctgcgcttgctgccgcgccccggtcggcgcgctgggagttccagccatg ctcttctggcacacgcagcccgagcactacaaccagcacaactccggcagctacctgcgt gatgtgctcgctctgcccatcttcaagcaggaggaaccccagctgtcccccgagaacgag gcccgcctgccacccctgcaatatgtgttgtgtgctgccacgtccccagccgtgaagctg catgaagagacgctgacctacctcaaccaaggtcagtcttatgaaatccgactactggag aatcggaagctgggagactttcaagatctgaacacaaaatatgtcaagagcatcatccgt gtggtcttccatgaccgccggctgcagtatacggagcaccagcagctggagggctggcgg tggagtcggccaggggaccggatcctggacatcgatattccactgtctgttggtatcttg gaccccagggccagcccgacccagctgaatgcagtcgagtttttgtgggaccctgcgaag agagcttctgcattcattcaggtacactgcatcagcacagaattcacccccaggaagcac gggggcgagaagggagtgccctttcgagtccagattgacacgtttaagcagaacgagaat ggggagtacacggagcacctgcactcagccagctgccagatcaaggtgttcaagccgaag ggagccgatcggaaacagaagactgaccgggagaagatggagaaaagaactgcccaagag aaggagaaataccagccgtcctatgaaaccaccatcctcacagagtgctctccatggccc gacgtggcctaccaggtgaacagcgccccgtccccaagctacaatggttctccaaacagc tttggcctcggcgaaggcaacgcctctccgacccacccggtggaggccctgcccgtgggc agtgaccacctgctcccatcagcttcgatccaggatgcccagcagtggcttcaccgcaac aggttctcgcagttctgccggctctttgccagcttctcaggtgctgacttgctgaagatg tcccgagatgatttggtccagatctgtggtcccgcagatgggatccggctcttcaacgcc atcaaaggccggaatgtgaggccaaagatgaccatttatgtctgtcaggagctggagcag aatcgagtgcccctgcagcagaagcgggacggcagtggagacagcaacctgtctgtgtac cacgccatcttcctggaagagctgaccaccttggagctgattgagaagatcgccaacctg tacagcatctccccccagcacatccaccgagtctaccggcagggccccacgggcatccat gtggtggtgagcaacgagctgcaactgtcccctctgctggccctgctgctgccctctgga atgtgggccctaggaaaggaagcctgtccccgggtacctgcagatgcagctcttcctctg ggctgccgtgggctgctgggctccagggttctgccccaatactcaggccacctcctctca gatgcatccaatgccaccttgactgtgctgccttctggctttcattcttcagttctctcg ttcctgcctcagggtaccctgctgacctcctccgagcccaagtgggaaaaccctcatgat ggctttgcaaggtgggtgttattacactcattttcagatgaggaagcagaggagaaggaa gtgttctcatctgtaaaatggaactggctctttagtacaagagagcctttcccattcagc tggggctgggtcaagcctgaacttgggtcctcaggtcctcgtgcactacatagagacatg atgatgatcatgatgcatgaagatgatcccaaaacagtagtgacatggaccccctcactg acctcctccaatgccacctgtgtcaacaccgtccctttgccctcatgcccctgtgccttt gagtggacaataaccggacagtccaacgaccccagagctgatcttcacaaggaattggcc cagtctttctggccatgggacaaaatctgggctggcctgctgaatgatgacagaggagtg gccaagtcactcattgttgtcccatttgtcagccagccagccaccagacatctcatcaac gccatccagcagggtctccttctcttccccacgacggatggcattgccgcttgctgggag ctccctccaatttaa >gi568815596r:121124344_121385109|GENSCAN_predicted_peptide_3|169_aa MLLTVGDASAIWQFGNVVDIACTHMKSLPKTKRASALKECRGGISGLEGNPGPTVKHSLQ KCCNANAHQGPAHRLWCQTSNTGDSEYKRDPEDSAINAQKCLLLLSGLFSLPGPAVILEQ SCGCAQDWKFPEASVDAKKKMLCFLYSLQNHEPIKPLFFINYLISGISL >gi568815596r:121124344_121385109|GENSCAN_predicted_CDS_3|510_bp atgcttctgacagttggtgatgcctctgccatttggcagtttggtaatgtggttgacatt gcctgcacacacatgaaatctctgcctaagaccaagagagcatctgcattgaaagagtgc agagggggcatcagcggcttggaaggaaatcctgggccaacggtgaagcactctcttcaa aaatgctgcaacgccaatgctcatcagggcccagcgcaccgcctatggtgccaaaccagc aacacaggcgattctgagtataaaagggatccagaagactctgcaataaatgctcagaag tgcctgctcctgctctcaggcctcttctcactcccggggcctgctgtgattctggagcaa agttgtggctgtgcccaggattggaagtttcctgaggcctcagtggacgccaagaagaag atgttatgcttcctgtacagcctgcagaaccatgagccaattaaacctcttttctttata aattacctaatctcaggtatttctttatag >gi568815596r:121124344_121385109|GENSCAN_predicted_peptide_4|485_aa MLDYDTENLNSEEIYSSLRGVTEAIEKFSFRSQEDLNEPIKRDGKKECDIVSRDGGAASP ATEGRGGSEVEGGRTALDNKTSLLNTQPPRAFPGPRARDYNPYPYSDAINTYDKTALKEA VFDDDMEQLRDVPIDHSDLVADLLKELSNHNERVEERKGALLELLKITREDSLGVWEEHF KTILLLLLETLGDKDHSIRALALRVLREILRNQPARFKNYAELTIMKTLEAHKDSHKEVT CVERATFGRSLNWQEKEITVGFLQIGSFLYIGGMHGVNAVHVESDALAGKGKACSLRCPD EAEAGQAGCAWPRALELEEGPQNTKEAFCTYPCSFLLPSNPGHPQEVHQVVRAAEEAAST LASSIHPEQCIKVLCPIIQTADYPINLAAIKMQTKVVERIAKESLLQLLVDIIPGLLQGY DNTESSVRKASVFCLVAIYSVIGEDLKPHLAQLTGSKMKLLNLYIKRAQTTNSNSSSSSD VSTHS >gi568815596r:121124344_121385109|GENSCAN_predicted_CDS_4|1458_bp atgctggactatgatacagagaacctgaactctgaagaaatctatagttctctacgtgga gttacagaagccattgaaaagtttagttttcgaagccaagaagatctgaatgagccaatt aaacgagatggcaaaaaggagtgtgatattgtgtcccgcgatgggggcgctgcctcccct gccactgagggccgggggggtagtgaagtagaaggaggccggacagctctggataacaag acctcactactcaacacccagcctccgcgcgccttcccggggccgcgggcgcgagactac aacccgtacccctactcagatgccatcaacacctacgacaagaccgccctgaaagaggct gtgttcgatgacgacatggagcagcttcgagacgtgcccatcgaccattctgacctggtg gctgaccttctgaaagagctgtccaaccacaatgagcgagtggaggaacggaagggagcc ctgctggagctgctcaagatcacgcgggaagacagccttggtgtctgggaggagcacttc aagaccattctgctcctgctgctggagacccttggagacaaagaccattcaattcgagca ctggcgttaagagttttgagggaaattctgagaaatcaaccagcaagatttaaaaactac gccgagctgacgattatgaagactctggaagcccacaaagactcccataaggaggtgacc tgtgtggaaagagctacctttggcaggagtttgaattggcaagaaaaagaaatcacagtg ggcttccttcaaataggctccttcctgtacattggtggcatgcatggagtcaatgccgtc catgttgagtccgacgctttggctggcaaggggaaggcctgctcactgaggtgcccggac gaggcagaggcagggcaggccggctgtgcatggccgagggcccttgagctggaggaaggc ccgcagaacaccaaagaagccttctgcacatatccatgctccttcctcctcccatctaat cctggccacccacaagaagtgcatcaggtggtgagagcggctgaggaggctgcgtccaca ctggccagttccatccacccggagcagtgcatcaaggtgctctgccccatcatccagacg gccgactaccccatcaaccttgctgccatcaagatgcagaccaaagtcgtcgagaggatc gcaaaggagtcattgctgcagctccttgtcgacatcatcccaggcttgctgcagggttat gacaacaccgaaagtagtgtgcgtaaggccagcgtgttttgcttagtggcaatttattcc gtaatcggagaagacctgaaacctcaccttgcacagctcacagggagcaagatgaagcta ctaaacttatacataaagagggcccagaccaccaacagcaacagcagctcctcctccgat gtctccacgcacagctaa