GENSCAN 1.0 Date run: 4-Nov-116 Time: 11:59:22 Sequence gi568815575f:14144268_14344887 : 200620 bp : 37.77% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 709 704 6 1.05 1.02 Term - 2519 2415 105 2 0 104 48 111 0.885 6.13 1.01 Init - 11804 11799 6 2 0 92 103 0 0.765 2.87 1.00 Prom - 12299 12260 40 -5.65 2.00 Prom + 12990 13029 40 -3.65 2.01 Init + 15556 15561 6 0 0 76 84 0 0.174 -0.50 2.02 Term + 29678 29788 111 1 0 119 42 122 0.498 8.28 2.03 PlyA + 29948 29953 6 1.05 3.03 PlyA - 30325 30320 6 1.05 3.02 Term - 49003 48840 164 2 2 83 46 85 0.518 0.92 3.01 Init - 51526 51487 40 1 1 83 94 33 0.544 3.90 3.00 Prom - 54240 54201 40 -3.65 4.02 PlyA - 54645 54640 6 1.05 4.01 Sngl - 59458 58694 765 1 0 62 48 317 0.413 20.94 4.00 Prom - 60132 60093 40 -9.35 5.02 PlyA - 60298 60293 6 1.05 5.01 Sngl - 61271 60975 297 2 0 74 43 304 0.705 20.00 5.00 Prom - 62926 62887 40 -4.35 6.04 PlyA - 63023 63018 6 1.05 6.03 Term - 64104 63916 189 0 0 36 49 82 0.045 -4.43 6.02 Intr - 70877 70739 139 1 1 76 84 127 0.643 10.65 6.01 Init - 77551 77499 53 1 2 59 101 32 0.533 2.28 6.00 Prom - 88512 88473 40 -5.85 7.03 PlyA - 89242 89237 6 1.05 7.02 Term - 94294 94174 121 0 1 98 44 130 0.917 6.57 7.01 Init - 94996 94863 134 1 2 1 116 108 0.900 4.56 7.00 Prom - 96334 96295 40 -7.05 8.00 Prom + 98095 98134 40 -7.55 8.01 Sngl + 100001 100624 624 1 0 77 31 468 0.791 35.94 8.02 PlyA + 101140 101145 6 1.05 9.02 PlyA - 102582 102577 6 1.05 9.01 Sngl - 116064 115726 339 0 0 71 44 223 0.893 11.98 9.00 Prom - 116118 116079 40 -5.65 10.04 PlyA - 118162 118157 6 1.05 10.03 Term - 119699 118996 704 0 2 33 37 268 0.103 8.90 10.02 Intr - 124953 124867 87 1 0 88 78 61 0.537 4.12 10.01 Init - 139909 139867 43 1 1 57 116 28 0.458 3.13 10.00 Prom - 143293 143254 40 -3.55 11.05 PlyA - 143589 143584 6 1.05 11.04 Term - 144374 143683 692 0 2 67 47 251 0.123 11.66 11.03 Intr - 186403 186292 112 1 1 -10 105 129 0.015 3.83 11.02 Intr - 188617 188534 84 1 0 77 27 118 0.017 3.70 11.01 Init - 191379 191203 177 0 0 83 9 145 0.035 5.41 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 188617 188489 129 1 0 77 43 153 0.846 6.90 S.002 Init - 192348 192051 298 0 1 61 87 167 0.840 11.33 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575f:14144268_14344887|GENSCAN_predicted_peptide_1|36_aa MHETFTVLPEGAAPIVTTWDVIARSHPKSSSMSSTF >gi568815575f:14144268_14344887|GENSCAN_predicted_CDS_1|111_bp atgcatgagaccttcacagtgctgccagagggagcagctccaattgtgaccacctgggat gttatcgccaggtctcatcctaaaagcagctctatgtcaagcacattttga >gi568815575f:14144268_14344887|GENSCAN_predicted_peptide_2|38_aa MGAEMPHPMAAPSQAHEEYCQAPRDVPLRPKSSSVSWW >gi568815575f:14144268_14344887|GENSCAN_predicted_CDS_2|117_bp atgggggcagagatgcctcaccccatggctgcaccatcacaggcccatgaggagtattgc caggctccccgtgatgttcccttaaggcccaagtcctcttcagtcagctggtggtga >gi568815575f:14144268_14344887|GENSCAN_predicted_peptide_3|67_aa MAEGEAGAYYMAAGVTRYLIQEHSYWHQVGALLEQSFQKKGQAAILAVLCSAASTGDTFR CQRNPAK >gi568815575f:14144268_14344887|GENSCAN_predicted_CDS_3|204_bp atggcagaaggagaagcaggcgcatattacatggcagcaggggtcaccagataccttata caggagcactcctattggcatcaggttggtgcccttcttgaacagagcttccagaaaaag gggcaggcagccatcttagctgttctgtgttctgcagcctccactggtgacaccttcagg tgccagaggaacccagctaaatag >gi568815575f:14144268_14344887|GENSCAN_predicted_peptide_4|254_aa MQEISKIRAELKEIESQKTLQKINESRSWFFEKINKIDGLLARLIKKKREKNQIDTIKND KGDITTDPTEIQTTIREYYKHLYTNKLENLEEMDKFLDTYAFPRLNQEEVESLNRPITAS DTEAIINSLPTKKSLGPDGFTVEFYQSYKEELVPFLLKLFQSIEKEGILPKSFYEASIIL IPKPGRDTTKKENFRLISLMNIDAKILNKLLANQIQQHIKKLIHHEQVGFIPGMQGWFNI HKSINIIHHINRTQ >gi568815575f:14144268_14344887|GENSCAN_predicted_CDS_4|765_bp atgcaagaaataagtaagatcagagcagaactgaaggagatagagtcacaaaaaaccctt caaaaaatcaatgaatccaggagctggttttttgaaaagatcaacaaaattgatggactg ctagcaagactaataaagaagaaaagagagaagaatcaaatagacacaataaaaaatgat aaaggggatatcaccaccgatcccacagaaatacaaactaccatcagagaatactataaa cacctctacacaaataaactagaaaatctagaagaaatggataaattcctggacacatac gctttcccaagactcaaccaggaagaagttgaatccctgaatagaccaataacagcctct gacactgaggcaataattaatagtttaccaaccaaaaaaagtctaggaccagatggattc acagtcgaattctaccagagttacaaagaggagctggttccattccttctgaaactattc caatcaatagaaaaagagggaatcctccctaagtcattttatgaggccagcatcatcctg ataccaaagcctggcagagacacaacaaaaaaagagaattttagactaatatccctgatg aacattgatgcaaaaatcctcaataaattactggcaaaccaaatccagcagcacatcaaa aagcttatccaccacgaacaagttggcttcatccctgggatgcaaggctggttcaatata cacaaatcaataaacataatccatcatataaacagaacccaatga >gi568815575f:14144268_14344887|GENSCAN_predicted_peptide_5|98_aa MELKTMARELRDECTSFSGRFDQLEERVSVIEDQMNEMKQEEKFREKKVKRNEQSLQEIW DYVKRPNLHLIGVPESDGENGTNLENTLQDIIRRISPI >gi568815575f:14144268_14344887|GENSCAN_predicted_CDS_5|297_bp atggagctgaaaaccatggcacgagaactacgtgatgaatgcacaagcttcagtggccga ttcgatcaactggaagaaagggtatcagtgattgaagatcaaatgaatgaaatgaagcaa gaagagaagtttagagaaaaaaaagtaaaaagaaatgaacaaagcctccaagaaatatgg gactatgtgaaaagaccaaatctacatctgattggtgtacctgaaagtgacggggagaat ggaaccaacttggaaaacactctgcaggatatcatccggaggatttccccaatctag >gi568815575f:14144268_14344887|GENSCAN_predicted_peptide_6|126_aa MHVKQKEKTEKVLGAMKILPVLMKQAAKLERIMFKEVKAASSQQPGTESFNPTTYETINS ANSHIPHKHTPQFFPTLATTRDQQVPGELQDPWRPSSRHGLPLGEGKHNQLKPHLGQRKY GYSDDH >gi568815575f:14144268_14344887|GENSCAN_predicted_CDS_6|381_bp atgcatgttaaacaaaaagaaaagactgaaaaagtactaggagcaatgaaaatcttgcct gttttgatgaagcaagctgccaaattggagaggattatgtttaaggaagtgaaggcagct tcttcccaacagccgggaactgagtccttcaatccaacaacctatgagacaataaattct gccaacagccatatcccccacaaacacactccacaattctttccaactttagcaaccaca agggaccagcaggtccctggggaactacaggatccctggagacctagctctcggcatgga cttcccctaggggaaggaaagcataaccaactaaaaccccacttgggacaaaggaaatat gggtacagtgatgatcactga >gi568815575f:14144268_14344887|GENSCAN_predicted_peptide_7|84_aa MKFAQVKKSEGVIQVAETARIKEQSRCEHVMQCGQLRVNQARGGRGCPTGNWICIPTGNC EEPETSRIDVTHQLPCSLTGLQNG >gi568815575f:14144268_14344887|GENSCAN_predicted_CDS_7|255_bp atgaaatttgcacaagtgaaaaagagtgaaggggtaatccaggtagcagaaactgcaagg ataaaggaacaaagcaggtgtgagcatgttatgcaatgtggacagttgagagtcaaccag gcaagaggaggaagaggatgccccacaggcaactggatttgcatccccacaggcaactgt gaagagccagaaacatctaggattgatgttacccaccaactgccctgctccctgacaggc ctgcaaaatggttga >gi568815575f:14144268_14344887|GENSCAN_predicted_peptide_8|207_aa MSSDRQRSDDESPSTSSGSSDADQRDPAAPEPEEQEERKPSATQQKKNTKLSSKTTAKLS TSAKKIQKELAEITLDPPPNCSAGPKGDNICEWRSTILGPLGSVYEGGVFFLDITFSSDY PFKPPKVTFRTRIDHCNINSQGVICLDILKDNWSPALTVSKVLLSICSLLMDCNPVDPLV GSIATQYLTNRAEHDRIARQRTKRYAT >gi568815575f:14144268_14344887|GENSCAN_predicted_CDS_8|624_bp atgtccagtgataggcaaaggtccgatgatgagagccccagcaccagcagtggcagttca gatgcggaccagcgagacccagccgctccagagcctgaagaacaagaggaaagaaaacct tctgccacccagcagaagaaaaacaccaaactctctagcaaaaccactgctaagttatcc actagtgctaaaaaaattcagaaggagctagctgaaataacccttgatcctcctcctaat tgcagtgctgggcctaaaggagataacatttgtgaatggagatcaactatacttggtcca ctgggttctgtatatgaaggtggtgtgttttttctggatatcacattttcatcagattat ccatttaagccaccaaaggttactttccgcaccagaatcgatcactgcaacatcaacagt cagggagtcatctgtctggacatccttaaagacaactggagtcccgctttgactgtttca aaggttttgctgtctatttgttcccttttaatggactgcaaccctgtggatcctctggtt ggaagcatagccactcagtatttgaccaacagagcagaacacgacaggatagccagacag aggaccaagagatacgcaacgtaa >gi568815575f:14144268_14344887|GENSCAN_predicted_peptide_9|112_aa MGKDFMTKTPKAMATKAKIDKRDLIKLKSFCTAKETVIRVNRQPAEWEKNFAIYPSDKGL ISRIYKELQQICKKKTNKPIKKWVTYMNRHSSKEDIYVANKHMKKKLIIAGH >gi568815575f:14144268_14344887|GENSCAN_predicted_CDS_9|339_bp atgggcaaagacttcatgactaaaacaccaaaagcaatggcaacaaaagccaaaattgac aaacgggatctaattaaactaaagagcttctgcacagcaaaagaaactgtcatcagagtg aacaggcaacctgcagaatgggagaaaaattttgcaatctatccatctgacaaagggcta atatccagaatctacaaggaacttcaacaaatttgcaagaaaaaaacaaacaagcccatc aaaaagtgggtgacgtatatgaacagacactcctcaaaagaagacatttatgtggccaac aaacatatgaaaaaaaagctcatcatcgctggtcattag >gi568815575f:14144268_14344887|GENSCAN_predicted_peptide_10|277_aa MIQNVPDRSWPEIEGLRKKARPLYVYRDEKIFNKNAKVTLCYRDSEKASDKIQQPFMLKT LNKLGIDGTYLKIIRAVYDKPTANIIPNGQKLEAAPLKTSTRQGCPLSPLLFNIVLEVLA RAIRKEKAIKDIQIGREEVKLSLFANDIIAYLENPIVSAQNLLKLISNFSKVSGYKINVQ KSQAFLYTNNRQTESQIMSELPFTIATKKIKYLGIQLTRDVKELFKKNYKPLLKELREDT NKWKNIPCSWIGRINIVKLAILPKVTYRFNAIPIKLP >gi568815575f:14144268_14344887|GENSCAN_predicted_CDS_10|834_bp atgatacaaaacgttccagatcgaagttggcccgagattgaaggtttaagaaaaaaagca cgacccttgtatgtctatcgggatgaaaagatattcaataagaatgccaaagttaccttg tgctatagagattcagaaaaggcctctgataaaattcaacagcccttcatgctaaaaact ctcaataaactaggaattgatggaacgtatctgaaaataataagagctgtttatgacaaa cccacagccaatatcataccgaatgggcaaaaactagaagcagcccctttgaaaaccagc acaagacaaggatgccctctctcaccactcctattcaacatagtgttggaagttctggcc agggcaatcaggaaagagaaagcaataaaggatattcaaataggaagagaggaagtcaaa ttgtctctgtttgcaaatgacataattgcatatttagaaaatcccatcgtctcagcccaa aatctccttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaatgtgcaa aaatcacaggcattcctatacaccaataatagacaaacagagagccaaatcatgagtgaa ctcccattcacaattgctacaaagaaaataaaatacctaggaatacaacttacaagggat gtgaaggaactcttcaagaagaattacaaaccactgctcaaggaattaagagaggacaca aacaaatggaaaaacattccatgctcatggataggaagaatcaatatcgtgaaattggcc atactgcccaaagtaacttatagattcaatgctatccccatcaaactaccataa >gi568815575f:14144268_14344887|GENSCAN_predicted_peptide_11|354_aa MHLEKPQTLNASPRKQPEGELYPAKPQRGAAQGHGSPPLASACPRYETWSQRRSSCNFKE RFELGADKDPVENDIWGKKEESKEVYKDHSSACVEELLVQAQGTEFTGPGGLTAADGTLS AIDPLLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVHLENPFISAQNLLKLISNFS KVSGYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDVKDLFKENYK PLLNEIKEDTKKWKNIPCSWVGRMNIMKMAILPKVIDTFNAIPIKLPMTFFTELEKTTLK FIRHQKRVCIAKTILSQKNKAGGITLPDFKLYYKAIVTKTAWYWYQNRDIDQWN >gi568815575f:14144268_14344887|GENSCAN_predicted_CDS_11|1065_bp atgcatcttgaaaagccacagacactcaatgccagcccaagaaaacagccagaaggggag ctgtaccctgcaaagccacagaggggagctgcccaaggccatgggagcccaccccttgca tcagcatgccctcgatatgagacatggagtcaaagaagatcatcttgtaactttaaggag agatttgagctcggtgctgacaaagatccagtagaaaatgacatttgggggaaaaaggaa gaatccaaagaagtgtataaggatcactcctctgcctgtgtagaagagttgctggtccaa gctcagggcactgaattcactggtcctggcggactcactgcagcagatggaacactctca gctattgaccccctgttggaagttctggccagggcaatcaggcaggagaaggaaataaag ggcattcaattaggaaaagaggaagtcaaattgtccctgtttgcagatgacatgattgta catctagaaaaccccttcatctcagcccaaaatctccttaagctgataagcaacttcagc aaagtctcagggtacaaaatcaatgtacaaaaatcacaagcattcttatacaccaataac agacaaacagagagccaaatcatgagtgaactcccattcacaattgcttcaaagagaata aaatacctaggaatccaacttacaagggatgtgaaggacctcttcaaggagaactacaaa ccactgctcaatgaaataaaagaggatacaaagaaatggaagaacattccatgctcgtgg gtaggaagaatgaatatcatgaaaatggccatactgcccaaggtaattgatacattcaat gccatccccatcaagctaccaatgactttcttcacagaattggaaaaaactactttaaag ttcatacggcaccaaaaaagagtctgcattgccaagacaatcctaagccaaaagaacaaa gctggaggcatcacgctacctgacttcaaactatactacaaggctatagtaaccaaaaca gcatggtactggtaccaaaacagagatatagatcaatggaactga