GENSCAN 1.0 Date run: 8-Nov-116 Time: 04:37:08 Sequence gi568815596f:119336886_119537437 : 200552 bp : 45.54% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 9922 9961 40 -2.96 1.01 Init + 10148 10190 43 1 1 35 110 29 0.283 0.38 1.02 Intr + 21957 22148 192 1 0 -26 82 134 0.031 0.86 1.03 Intr + 30053 30175 123 0 0 36 92 105 0.150 6.26 1.04 Intr + 30922 31046 125 2 2 52 22 31 0.080 -6.80 1.05 Intr + 31303 31420 118 0 1 98 85 155 0.950 16.24 1.06 Intr + 33855 33917 63 1 0 71 86 93 0.678 6.09 1.07 Term + 35360 35433 74 0 2 96 54 133 0.888 8.77 1.08 PlyA + 36942 36947 6 1.05 2.02 PlyA - 37055 37050 6 1.05 2.01 Sngl - 77957 77520 438 2 0 29 35 303 0.996 15.36 2.00 Prom - 81664 81625 40 -2.16 3.00 Prom + 85331 85370 40 -7.46 3.01 Init + 90179 90287 109 1 1 96 55 55 0.721 3.46 3.02 Intr + 94600 94824 225 2 0 82 3 169 0.596 5.76 3.03 Intr + 94920 95039 120 1 0 76 68 85 0.767 5.77 3.04 Intr + 99639 99769 131 1 2 43 77 32 0.285 -1.99 3.05 Term + 100004 100555 552 1 0 91 47 655 0.977 56.01 3.06 PlyA + 100819 100824 6 -0.45 4.21 PlyA - 101832 101827 6 1.05 4.20 Term - 103372 103232 141 1 0 101 54 132 0.999 9.03 4.19 Intr - 104714 104673 42 2 0 118 105 96 0.974 12.74 4.18 Intr - 110000 109874 127 1 1 55 111 162 0.671 15.88 4.17 Intr - 111895 111804 92 1 2 114 97 -11 0.993 1.19 4.16 Intr - 115194 115125 70 2 1 120 105 89 0.990 12.98 4.15 Intr - 116462 116402 61 0 1 126 94 8 0.842 2.99 4.14 Intr - 125115 124962 154 0 1 102 63 192 0.888 17.75 4.13 Intr - 127370 127238 133 1 1 124 52 279 0.994 28.55 4.12 Intr - 129001 128904 98 1 2 82 97 197 0.995 18.81 4.11 Intr - 136671 136568 104 1 2 92 102 127 0.836 14.39 4.10 Intr - 142033 141926 108 2 0 110 46 32 0.579 1.46 4.09 Intr - 145296 145227 70 0 1 103 94 34 0.943 4.25 4.08 Intr - 146311 146132 180 1 0 38 61 102 0.035 2.56 4.07 Intr - 151463 151384 80 0 2 58 94 66 0.042 3.47 4.06 Intr - 157663 157543 121 1 1 56 80 86 0.120 4.67 4.05 Intr - 158698 158589 110 2 2 56 94 43 0.050 1.70 4.04 Intr - 159286 159203 84 2 0 63 94 72 0.013 5.09 4.03 Intr - 178898 178688 211 2 1 72 89 29 0.408 -0.11 4.02 Intr - 183781 183653 129 1 0 85 53 80 0.755 5.09 4.01 Init - 187341 187270 72 0 0 97 109 156 0.642 17.70 4.00 Prom - 197240 197201 40 -1.86 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:119336886_119537437|GENSCAN_predicted_peptide_1|245_aa MSVGDDHKAVLKPSGADVEAAPSYPEDLAKIDEAGYTKQQIFNTDKTALYWKMKPSRICI AREKSTPGFKASKDRLTVGLGRVDRASKGACQCNLGDRFLVLASSAVSLEFLQVGQDVSV PGPVVARQLAALLLPQRGPPLGAKAWPALFTAVFPDLMPAFAEFEKAAEEVRHLKTKPSD EEMLFIYGHYKQATVGDINTERPGMLDFTGKAKWDAWNELKGTSKEDAMKAYINKVEELK KKYGI >gi568815596f:119336886_119537437|GENSCAN_predicted_CDS_1|738_bp atgagtgtaggtgatgatcataaagcggtgttaaaaccatcaggtgctgatgtagaagct gcaccaagttatcctgaagatctagctaagattgatgaagctggctacactaaacaacag attttcaacacagacaaaacagccttatattggaagatgaagccatctaggatctgcata gctagggagaagtcaacgcctggttttaaagcttcaaaggacaggctgactgtggggttg gggcgagtggaccgcgcctctaaaggcgcttgccagtgcaatctgggcgatcgcttcctg gtcctcgcctcctccgctgtctccctggagttcttgcaagtcggccaggatgtctcagtg cctggcccggtggtggccaggcagttggccgcgctgcttctcccgcagaggggaccccca ctgggggcgaaggcttggcctgccctcttcactgctgtatttccagacctgatgcctgcg tttgctgagtttgagaaagctgcagaggaggttaggcaccttaagaccaagccatcggat gaggagatgctgttcatctatggccactacaaacaagcaactgtgggcgacataaataca gaacggcccgggatgttggacttcacgggcaaggccaagtgggatgcctggaatgagctg aaagggacttccaaggaagatgccatgaaagcttacatcaacaaagtagaagagctaaag aaaaaatacgggatatga >gi568815596f:119336886_119537437|GENSCAN_predicted_peptide_2|145_aa MAIIYDLKKQKDKLLRLYTESDEQKKLMKHRKTLHKAKNEDPNCVLKEWIYQHCHEHTPL NGMLIMIQAKMCHNELKIKGNCKYSTDSLQKCKKRHNITFLKISGDETSADHKAVEEFTD EFAKVIADENLMPGRVYNADEASLF >gi568815596f:119336886_119537437|GENSCAN_predicted_CDS_2|438_bp atggccattatatatgacctgaagaaacagaaggataaactgttgaggctctacactgag agtgatgaacagaagaagttaatgaaacatagaaaaacactgcataaagctaaaaatgaa gatcccaattgtgtattgaaagagtggatctatcagcattgccatgaacacacgcccctt aatggtatgctgatcatgatacaagcaaagatgtgtcacaatgaactaaaaattaaaggg aactgtaaatattcaacagattctttgcagaaatgtaagaaaagacataacattacattt ttaaagatttctggtgatgaaacatctgctgatcacaaagcagtggaggaattcactgat gagtttgccaaggtcattgctgatgaaaatctgatgccaggacgagtctataatgctgat gaagcatcattgttttag >gi568815596f:119336886_119537437|GENSCAN_predicted_peptide_3|378_aa MAWGLGEEEDLVGKGGAAGGEEREEAMGHRQGKTWREAGLSKLEQESNPQALVAVPALEE SAPQAKLSSEVRERRSDFPILSPGTRGIPPPKCRVLPAAPQGLLRPPSLTQPAAPPLRRS PGLAPAATAEQLERSRLQRGRRAQHDCRRRAETEAGCMLKVMGCRHDRFSELRTDSDLPV GLDFSPTLKITKERKAQRPLGQRQPRRSFFESFIRTLIITCVALAVVLSSVSICDGHWLL AEDRLFGLWHFCTTTNQTICFRDLGQAHVPGLAVGMGLVRSVGALAVVAAIFGLEFLMVS QLCEDKHSQCKWVMGSILLLVSFVLSSGGLLGFVILLRNQVTLIGFTLMFWCEFTASFLL FLNAISGLHINSITHPWE >gi568815596f:119336886_119537437|GENSCAN_predicted_CDS_3|1137_bp atggcgtgggggctgggagaagaggaagacctggttggtaaaggcggggcagcaggagga gaggagagggaagaggccatgggccacagacaaggcaagacctggagagaggctggactc agcaagctggaacaggaatcgaaccctcaggccctcgtcgccgtcccagccctcgaggaa tctgcgccccaggcgaagctgtcctcggaggttcgggagcgtcggagtgacttcccgatc ctttcccctgggacccgagggatccctccccccaagtgccgggtcctccccgcggctccc caggggctcctccggccgccctcgctgactcagccagccgccccgcccctgcggagaagt cccgggctggcgccggcggccacagcggagcagctggagcgatcgaggctgcagcgcggc cgccgggcgcagcatgactgccgtcggcgtgcagaaaccgaggcaggctgcatgctcaag gtcatgggatgcaggcacgacaggttttctgaactcagaactgactcagatttgccagtc ggtttggacttctcacccactctgaagatcacaaaagaaagaaaggcccagaggcctttg ggccaaaggcagccccgccggtccttctttgaatccttcatccggaccctcatcatcacg tgtgtggccctggctgtggtcctgtcctcggtctccatttgtgatgggcactggctcctg gctgaggaccgcctcttcgggctctggcacttctgcaccaccaccaaccagacgatctgc ttcagagacctgggccaggcccatgtgcccgggctggccgtgggcatgggcctggtacgc agcgtgggcgccttggccgtggtggccgccatttttggcctggagttcctcatggtgtcc cagttgtgcgaggacaaacactcacagtgcaagtgggtcatgggttccatcctcctcctg gtgtctttcgtcctctcctccggcgggctcctgggttttgtgatcctcctcaggaaccaa gtcacactcatcggcttcaccctaatgttttggtgcgaattcactgcctccttcctcctc ttcctgaacgccatcagcggccttcacatcaacagcatcacccatccctgggaatga >gi568815596f:119336886_119537437|GENSCAN_predicted_peptide_4|728_aa MRPHLSPPLQQLLLPVLLACAAHSLMAKKYKNSKYRLWHMEAFSTWWSLPCPVMHLDPEA DKPCGCTQHGSGQRGSESDRLRIREDDGGTQSEARVLGTMGGPLVQVLKSKAVEPGILMF KDRRRVSQFQERERKRESGFFHSSSSTSSTEKPEGARDYGLRQEPELLITWANLECGFVS RVFSPCSIMLEFPETMGFSKNQTGALPRLCDVLQVLWEEQDQCLQELSREQTGDLGTEQP VPEKGSPGFARAVVSGQALAAEPRRASVQASLGALPDVTQVRKPLRRAGTVHICLANQPG IVQPHGLEPAAGTLADLASPASTESVFPQFPEPGLNKEGTWQLGPVSPGDWSGCEGMWDN ISCWPSSVPGRMVEVECPRFLRMLTSRNGSLFRNCTQDGWSETFPRPNLACGVNVNDSSN EKRHSYLLKLKVMYTVGYSSSLVMLLVALGILCAFRRLHCTRNYIHMHLFVSFILRALSN FIKDAVLFSSDDVTYCDAHRAGCKLVMVLFQYCIMANYSWLLVEGLYLHTLLAISFFSER KYLQGFVAFGWGSPAIFVALWAIARHFLEDVGCWDINANASIWWIIRGPVILSILINFIL FINILRILMRKLRTQETRGNEVSHYKRLARSTLLLIPLFGIHYIVFAFSPEDAMEIQLFF ELALGSFQGLVVAVLYCFLNGEVQLEVQKKWQQWHLREFPLHPVASFSNSTKASHLEQSQ GTCRTSII >gi568815596f:119336886_119537437|GENSCAN_predicted_CDS_4|2187_bp atgcgtccccacctgtcgccgccgctgcagcagctactactgccggtgctgctcgcctgc gccgcgcactcgttgatggctaagaaatataaaaattccaagtacaggctctggcacatg gaagccttcagtacatggtggtcgctgccctgtccggtcatgcatctggatccagaagct gacaagccctgtggctgcacacaacatggctcaggccaacgtggttctgagtctgaccgc ctcagaatcagggaagatgatggtggaactcagtcagaagctagagtcctgggaaccatg gggggaccacttgtgcaagttctaaaatccaaggctgtagaacctgggattctgatgttc aaggacaggagaagggtgtcccagttccaggaaagagagagaaagagagagagtggattt ttccactcaagctcttccacttcctccactgagaagccagagggtgccagagactatggg cttcggcaagaaccagaacttttgattacttgggccaatttggaatgtggctttgtctcc agggtcttctctccctgctccattatgttggaatttccagagactatgggcttcagcaag aaccagactggagcccttccccgactatgtgacgtgctacaagtgctgtgggaagagcaa gaccagtgcctgcaggaactctccagagagcagacaggagacctgggcacggagcagcca gtgccagaaaagggctcccctggctttgcaagagctgtggtgtctggccaggccctggcg gctgaaccaaggagggccagcgtgcaggcatccctgggggccctgcctgacgtgacccaa gttaggaagcctctcagacgtgctggcaccgtccacatctgtctggccaaccagccaggc attgtccagcctcatggcctggagcctgcggctggtaccctggctgacctggccagtcca gccagcactgaatctgtttttccacaattcccagagcctggcttgaacaaggagggcact tggcagctggggccggtgtcacctggagactggtcaggttgtgaggggatgtgggacaac ataagctgctggccctcttctgtgccgggccggatggtggaggtggaatgcccgagattc ctccggatgctcaccagcagaaatggttccttgttccgaaactgcacacaggatggctgg tcagaaaccttccccaggcctaatctggcctgtggcgttaatgtgaacgactcttccaac gagaagcggcactcctacctgctgaagctgaaagtcatgtacaccgtgggctacagctcc tccctggtcatgctcctggtcgcccttggcatcctctgtgctttccggaggctccactgc actcgcaactacatccacatgcacctgttcgtgtccttcatccttcgtgccctgtccaac ttcatcaaggacgccgtgctcttctcctcagatgatgtcacctactgcgatgcccacagg gcgggctgcaagctggtcatggtgctgttccagtactgcatcatggccaactactcctgg ctgctggtggaaggcctctaccttcacacactcctcgccatctccttcttctctgaaaga aagtacctccagggatttgtggcattcggatggggttctccagccatttttgttgctttg tgggctattgccagacactttctggaagatgttgggtgctgggacatcaatgccaacgca tccatctggtggatcattcgtggtcctgtgatcctctccatcctgattaatttcatcctt ttcataaacattctaagaatcctgatgagaaaacttagaacccaagaaacaagaggaaat gaagtcagccattataagcgcctggccaggtccactctcctgctgatccccctctttggc atccactacatcgtcttcgccttctccccagaggacgctatggagatccagctgtttttt gaactagcccttggctcattccagggactggtggtggccgtcctctactgcttcctcaat ggggaggtgcagctggaggttcagaagaagtggcagcaatggcacctccgtgagttccca ctgcaccccgtggcctccttcagcaacagcaccaaggccagccacttggagcagagccag ggcacctgcaggaccagcatcatctga