GENSCAN 1.0 Date run: 8-Nov-116 Time: 14:33:37 Sequence gi568815596f:119267585_119472315 : 204731 bp : 43.80% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 1238 1277 40 -1.96 1.01 Init + 23571 23658 88 2 1 62 70 102 0.704 6.60 1.02 Term + 29093 29229 137 0 2 70 43 75 0.148 -0.62 1.03 PlyA + 30644 30649 6 1.05 2.05 PlyA - 30800 30795 6 1.05 2.04 Term - 34964 34888 77 0 2 100 48 46 0.310 -0.20 2.03 Intr - 43900 43802 99 2 0 63 91 46 0.751 2.48 2.02 Intr - 44119 44038 82 1 1 96 85 91 0.932 8.81 2.01 Init - 63951 63904 48 0 0 80 65 55 0.276 3.35 2.00 Prom - 64161 64122 40 -4.86 3.00 Prom + 79223 79262 40 -2.96 3.01 Init + 79449 79491 43 2 1 35 110 29 0.281 0.38 3.02 Intr + 91258 91449 192 2 0 -26 82 134 0.031 0.86 3.03 Intr + 99354 99476 123 1 0 36 92 105 0.150 6.26 3.04 Intr + 100223 100347 125 0 2 52 22 31 0.080 -6.80 3.05 Intr + 100604 100721 118 1 1 98 85 155 0.950 16.24 3.06 Intr + 103156 103218 63 2 0 71 86 93 0.678 6.09 3.07 Term + 104661 104734 74 1 2 96 54 133 0.888 8.77 3.08 PlyA + 106243 106248 6 1.05 4.02 PlyA - 106356 106351 6 1.05 4.01 Sngl - 147258 146821 438 0 0 29 35 303 0.996 15.36 4.00 Prom - 150965 150926 40 -2.16 5.00 Prom + 154632 154671 40 -7.46 5.01 Init + 159480 159588 109 2 1 96 55 55 0.721 3.46 5.02 Intr + 163901 164125 225 0 0 82 3 169 0.596 5.76 5.03 Intr + 164221 164340 120 2 0 76 68 85 0.767 5.77 5.04 Intr + 168940 169070 131 2 2 43 77 32 0.285 -1.99 5.05 Term + 169305 169856 552 2 0 91 47 655 0.977 56.01 5.06 PlyA + 170120 170125 6 -0.45 6.10 PlyA - 171133 171128 6 1.05 6.09 Term - 172673 172533 141 2 0 101 54 132 0.999 9.03 6.08 Intr - 174015 173974 42 0 0 118 105 96 0.974 12.74 6.07 Intr - 179301 179175 127 2 1 55 111 162 0.671 15.88 6.06 Intr - 181196 181105 92 2 2 114 97 -11 0.993 1.19 6.05 Intr - 184495 184426 70 0 1 120 105 89 0.990 12.98 6.04 Intr - 185763 185703 61 1 1 126 94 8 0.842 2.99 6.03 Intr - 194416 194263 154 1 1 102 63 192 0.888 17.75 6.02 Intr - 196671 196539 133 2 1 124 52 279 0.994 28.55 6.01 Intr - 198302 198205 98 2 2 82 97 197 0.996 18.81 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:119267585_119472315|GENSCAN_predicted_peptide_1|74_aa MAGKVSYRGEAARQDVKMNGWALIGWCLAGRKRRKRKPVCYGKEVPQSHMKPGPTRGTRA QVFLWQRQVLFKGE >gi568815596f:119267585_119472315|GENSCAN_predicted_CDS_1|225_bp atggcaggaaaggtgtcataccgaggagaagctgctcggcaagatgtaaagatgaacggc tgggcattaatcggctggtgcctggcaggcagaaaaagaaggaaaagaaaacccgtttgc tacggcaaagaagttcctcaaagtcacatgaaacctggcccaactcgtgggaccagggcc caagtgttcctttggcaaaggcaagttctcttcaaaggagaatag >gi568815596f:119267585_119472315|GENSCAN_predicted_peptide_2|101_aa MGSSEDLQRIFEDMQSTNELVLSLEDDERLLLKEDSTLKAAGIEEEDENDVHSDLLAEDS RCLLSKYAECKWLFSEASETEIAFFCEEDYKNYKANPISSW >gi568815596f:119267585_119472315|GENSCAN_predicted_CDS_2|306_bp atggggagctctgaagatttgcagaggatctttgaagatatgcagagtacaaatgaactt gtgttgagtttggaagatgacgaaagactcctgctgaaagaagacagcactctgaaagca gctggaatcgaggaggaggatgagaatgatgtgcattccgatctcttggctgaggacagt aggtgtctgctctccaaatatgcagaatgcaaatggctgttttcagaagccagtgaaact gaaattgcattcttctgtgaagaagattataagaactacaaagctaatcccatttcatcc tggtga >gi568815596f:119267585_119472315|GENSCAN_predicted_peptide_3|245_aa MSVGDDHKAVLKPSGADVEAAPSYPEDLAKIDEAGYTKQQIFNTDKTALYWKMKPSRICI AREKSTPGFKASKDRLTVGLGRVDRASKGACQCNLGDRFLVLASSAVSLEFLQVGQDVSV PGPVVARQLAALLLPQRGPPLGAKAWPALFTAVFPDLMPAFAEFEKAAEEVRHLKTKPSD EEMLFIYGHYKQATVGDINTERPGMLDFTGKAKWDAWNELKGTSKEDAMKAYINKVEELK KKYGI >gi568815596f:119267585_119472315|GENSCAN_predicted_CDS_3|738_bp atgagtgtaggtgatgatcataaagcggtgttaaaaccatcaggtgctgatgtagaagct gcaccaagttatcctgaagatctagctaagattgatgaagctggctacactaaacaacag attttcaacacagacaaaacagccttatattggaagatgaagccatctaggatctgcata gctagggagaagtcaacgcctggttttaaagcttcaaaggacaggctgactgtggggttg gggcgagtggaccgcgcctctaaaggcgcttgccagtgcaatctgggcgatcgcttcctg gtcctcgcctcctccgctgtctccctggagttcttgcaagtcggccaggatgtctcagtg cctggcccggtggtggccaggcagttggccgcgctgcttctcccgcagaggggaccccca ctgggggcgaaggcttggcctgccctcttcactgctgtatttccagacctgatgcctgcg tttgctgagtttgagaaagctgcagaggaggttaggcaccttaagaccaagccatcggat gaggagatgctgttcatctatggccactacaaacaagcaactgtgggcgacataaataca gaacggcccgggatgttggacttcacgggcaaggccaagtgggatgcctggaatgagctg aaagggacttccaaggaagatgccatgaaagcttacatcaacaaagtagaagagctaaag aaaaaatacgggatatga >gi568815596f:119267585_119472315|GENSCAN_predicted_peptide_4|145_aa MAIIYDLKKQKDKLLRLYTESDEQKKLMKHRKTLHKAKNEDPNCVLKEWIYQHCHEHTPL NGMLIMIQAKMCHNELKIKGNCKYSTDSLQKCKKRHNITFLKISGDETSADHKAVEEFTD EFAKVIADENLMPGRVYNADEASLF >gi568815596f:119267585_119472315|GENSCAN_predicted_CDS_4|438_bp atggccattatatatgacctgaagaaacagaaggataaactgttgaggctctacactgag agtgatgaacagaagaagttaatgaaacatagaaaaacactgcataaagctaaaaatgaa gatcccaattgtgtattgaaagagtggatctatcagcattgccatgaacacacgcccctt aatggtatgctgatcatgatacaagcaaagatgtgtcacaatgaactaaaaattaaaggg aactgtaaatattcaacagattctttgcagaaatgtaagaaaagacataacattacattt ttaaagatttctggtgatgaaacatctgctgatcacaaagcagtggaggaattcactgat gagtttgccaaggtcattgctgatgaaaatctgatgccaggacgagtctataatgctgat gaagcatcattgttttag >gi568815596f:119267585_119472315|GENSCAN_predicted_peptide_5|378_aa MAWGLGEEEDLVGKGGAAGGEEREEAMGHRQGKTWREAGLSKLEQESNPQALVAVPALEE SAPQAKLSSEVRERRSDFPILSPGTRGIPPPKCRVLPAAPQGLLRPPSLTQPAAPPLRRS PGLAPAATAEQLERSRLQRGRRAQHDCRRRAETEAGCMLKVMGCRHDRFSELRTDSDLPV GLDFSPTLKITKERKAQRPLGQRQPRRSFFESFIRTLIITCVALAVVLSSVSICDGHWLL AEDRLFGLWHFCTTTNQTICFRDLGQAHVPGLAVGMGLVRSVGALAVVAAIFGLEFLMVS QLCEDKHSQCKWVMGSILLLVSFVLSSGGLLGFVILLRNQVTLIGFTLMFWCEFTASFLL FLNAISGLHINSITHPWE >gi568815596f:119267585_119472315|GENSCAN_predicted_CDS_5|1137_bp atggcgtgggggctgggagaagaggaagacctggttggtaaaggcggggcagcaggagga gaggagagggaagaggccatgggccacagacaaggcaagacctggagagaggctggactc agcaagctggaacaggaatcgaaccctcaggccctcgtcgccgtcccagccctcgaggaa tctgcgccccaggcgaagctgtcctcggaggttcgggagcgtcggagtgacttcccgatc ctttcccctgggacccgagggatccctccccccaagtgccgggtcctccccgcggctccc caggggctcctccggccgccctcgctgactcagccagccgccccgcccctgcggagaagt cccgggctggcgccggcggccacagcggagcagctggagcgatcgaggctgcagcgcggc cgccgggcgcagcatgactgccgtcggcgtgcagaaaccgaggcaggctgcatgctcaag gtcatgggatgcaggcacgacaggttttctgaactcagaactgactcagatttgccagtc ggtttggacttctcacccactctgaagatcacaaaagaaagaaaggcccagaggcctttg ggccaaaggcagccccgccggtccttctttgaatccttcatccggaccctcatcatcacg tgtgtggccctggctgtggtcctgtcctcggtctccatttgtgatgggcactggctcctg gctgaggaccgcctcttcgggctctggcacttctgcaccaccaccaaccagacgatctgc ttcagagacctgggccaggcccatgtgcccgggctggccgtgggcatgggcctggtacgc agcgtgggcgccttggccgtggtggccgccatttttggcctggagttcctcatggtgtcc cagttgtgcgaggacaaacactcacagtgcaagtgggtcatgggttccatcctcctcctg gtgtctttcgtcctctcctccggcgggctcctgggttttgtgatcctcctcaggaaccaa gtcacactcatcggcttcaccctaatgttttggtgcgaattcactgcctccttcctcctc ttcctgaacgccatcagcggccttcacatcaacagcatcacccatccctgggaatga >gi568815596f:119267585_119472315|GENSCAN_predicted_peptide_6|305_aa HSYLLKLKVMYTVGYSSSLVMLLVALGILCAFRRLHCTRNYIHMHLFVSFILRALSNFIK DAVLFSSDDVTYCDAHRAGCKLVMVLFQYCIMANYSWLLVEGLYLHTLLAISFFSERKYL QGFVAFGWGSPAIFVALWAIARHFLEDVGCWDINANASIWWIIRGPVILSILINFILFIN ILRILMRKLRTQETRGNEVSHYKRLARSTLLLIPLFGIHYIVFAFSPEDAMEIQLFFELA LGSFQGLVVAVLYCFLNGEVQLEVQKKWQQWHLREFPLHPVASFSNSTKASHLEQSQGTC RTSII >gi568815596f:119267585_119472315|GENSCAN_predicted_CDS_6|918_bp cactcctacctgctgaagctgaaagtcatgtacaccgtgggctacagctcctccctggtc atgctcctggtcgcccttggcatcctctgtgctttccggaggctccactgcactcgcaac tacatccacatgcacctgttcgtgtccttcatccttcgtgccctgtccaacttcatcaag gacgccgtgctcttctcctcagatgatgtcacctactgcgatgcccacagggcgggctgc aagctggtcatggtgctgttccagtactgcatcatggccaactactcctggctgctggtg gaaggcctctaccttcacacactcctcgccatctccttcttctctgaaagaaagtacctc cagggatttgtggcattcggatggggttctccagccatttttgttgctttgtgggctatt gccagacactttctggaagatgttgggtgctgggacatcaatgccaacgcatccatctgg tggatcattcgtggtcctgtgatcctctccatcctgattaatttcatccttttcataaac attctaagaatcctgatgagaaaacttagaacccaagaaacaagaggaaatgaagtcagc cattataagcgcctggccaggtccactctcctgctgatccccctctttggcatccactac atcgtcttcgccttctccccagaggacgctatggagatccagctgttttttgaactagcc cttggctcattccagggactggtggtggccgtcctctactgcttcctcaatggggaggtg cagctggaggttcagaagaagtggcagcaatggcacctccgtgagttcccactgcacccc gtggcctccttcagcaacagcaccaaggccagccacttggagcagagccagggcacctgc aggaccagcatcatctga