GENSCAN 1.0 Date run: 5-Nov-116 Time: 17:14:51 Sequence gi568815585f:49396002_49628662 : 232661 bp : 40.76% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 12044 12104 61 1 1 38 88 87 0.967 5.16 1.02 Intr + 16868 16950 83 0 2 73 92 76 0.538 4.94 1.03 Intr + 21802 21949 148 0 1 53 58 72 0.288 -0.41 1.04 Term + 25771 25883 113 2 2 69 47 76 0.132 -0.66 1.05 PlyA + 26034 26039 6 1.05 2.05 PlyA - 26286 26281 6 1.05 2.04 Term - 27222 27101 122 1 2 86 42 115 0.579 4.36 2.03 Intr - 38222 38085 138 0 0 80 94 84 0.203 7.71 2.02 Intr - 44911 44866 46 1 1 47 89 58 0.104 -1.14 2.01 Init - 56668 56609 60 1 0 90 98 23 0.039 4.80 2.00 Prom - 57904 57865 40 -5.65 3.00 Prom + 59510 59549 40 -9.65 3.01 Init + 59863 59944 82 0 1 61 92 77 0.907 6.58 3.02 Intr + 64106 64231 126 0 0 46 86 115 0.925 6.83 3.03 Term + 68020 68066 47 2 2 94 54 36 0.416 -2.81 3.04 PlyA + 68763 68768 6 1.05 4.00 Prom + 76105 76144 40 -1.05 4.01 Init + 80596 81038 443 0 2 53 84 199 0.846 11.60 4.02 Intr + 84218 84334 117 2 0 85 82 25 0.416 0.26 4.03 Intr + 84946 85115 170 1 2 74 103 47 0.770 3.47 4.04 Intr + 86736 86961 226 1 1 99 75 112 0.917 7.12 4.05 Intr + 87463 87562 100 1 1 84 53 36 0.729 -1.11 4.06 Intr + 92291 92629 339 1 0 -11 82 291 0.029 13.34 4.07 Intr + 96351 96439 89 2 2 22 84 31 0.202 -6.05 4.08 Intr + 99898 100094 197 1 2 30 82 136 0.100 5.34 4.09 Intr + 110619 110755 137 1 2 51 111 40 0.006 1.97 4.10 Intr + 117058 117165 108 0 0 70 111 141 0.994 14.16 4.11 Intr + 119842 119872 31 0 1 81 103 10 0.708 -1.41 4.12 Intr + 128084 128215 132 0 0 78 84 75 0.544 5.80 4.13 Term + 128912 129015 104 0 2 51 54 95 0.580 -0.14 4.14 PlyA + 129708 129713 6 1.05 5.00 Prom + 132252 132291 40 -3.75 5.01 Init + 135284 135384 101 1 2 97 23 111 0.460 5.28 5.02 Term + 138037 138226 190 1 1 67 38 135 0.416 2.34 5.03 PlyA + 138278 138283 6 1.05 6.14 PlyA - 138449 138444 6 1.05 6.13 Term - 141223 141114 110 2 2 77 43 95 0.795 1.59 6.12 Intr - 144212 144083 130 2 1 68 75 58 0.769 1.85 6.11 Intr - 145005 144875 131 1 2 98 78 51 0.953 4.59 6.10 Intr - 145826 145675 152 1 2 85 52 165 0.984 11.49 6.09 Intr - 148862 148736 127 0 1 83 68 160 0.995 12.32 6.08 Intr - 150617 150560 58 2 1 97 101 11 0.804 0.84 6.07 Intr - 150890 150836 55 1 1 14 105 39 0.257 -3.74 6.06 Intr - 155467 155325 143 1 2 89 88 94 0.915 7.73 6.05 Intr - 156284 156177 108 2 0 111 98 102 0.990 13.06 6.04 Intr - 159672 159514 159 0 0 136 78 148 0.999 17.86 6.03 Intr - 164083 163917 167 2 2 80 107 124 0.999 12.26 6.02 Intr - 170767 170617 151 1 1 54 116 126 0.996 10.81 6.01 Init - 171278 171153 126 2 0 78 94 120 0.877 11.83 6.00 Prom - 184149 184110 40 -2.35 7.04 PlyA - 184630 184625 6 1.05 7.03 Term - 189906 189734 173 1 2 92 53 106 0.249 4.51 7.02 Intr - 191343 191215 129 1 0 114 70 16 0.160 2.15 7.01 Init - 200291 200120 172 2 1 26 66 127 0.372 3.95 7.00 Prom - 206165 206126 40 -6.95 8.04 PlyA - 206667 206662 6 1.05 8.03 Term - 224788 224330 459 1 0 106 53 261 0.961 18.50 8.02 Intr - 225986 225835 152 0 2 61 44 116 0.512 3.46 8.01 Init - 229552 229447 106 1 1 87 39 58 0.270 1.24 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 54681 54490 192 0 0 74 48 154 0.884 4.89 S.002 Intr + 92289 92629 341 1 2 27 82 295 0.884 17.07 S.003 Init + 116630 116635 6 1 0 87 94 0 0.847 1.53 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585f:49396002_49628662|GENSCAN_predicted_peptide_1|134_aa MNVNLAEESKGKETVGDEIKGGGIQMVMLASQPLTSYCAARFLTGHGQKLKSELKKTLQV IMLNIQKVRIPTRDGYNEKDILSTGKNVEKLELSNIAGGQLRTGREPSPSGGNEATLSTP SSSVGGCLLKHRFK >gi568815585f:49396002_49628662|GENSCAN_predicted_CDS_1|405_bp atgaatgtgaatttggcagaagaatcaaagggcaaagaaacagtaggtgatgaaattaaa ggaggcggaattcagatggtaatgcttgccagccagccactcacctcctactgtgctgcc cggttcctaacaggccatggacagaaattaaaatctgagttaaaaaaaaccctgcaagtc attatgctaaatattcagaaagtgagaatacccactagggatggctataatgaaaaagac atactaagcactggcaaaaatgtagagaaactggaactctcaaacattgctggtggtcaa ctgagaacagggagagaaccaagcccctctggtggtaatgaggcaaccctgtctaccccc agcagtagtgttggagggtgcctgctaaaacacagatttaagtaa >gi568815585f:49396002_49628662|GENSCAN_predicted_peptide_2|121_aa MTSKWNVVSWMGSWNGKRHQKTLKTPPKALGTDNFRYPSNIEKKEYQEQSVLSCCSERKD ANPKSVVCSFFMQEQCTKGEKHLFNLKFCGKDPVICFTCRPGDPDDTNDRASMIYGIVLH E >gi568815585f:49396002_49628662|GENSCAN_predicted_CDS_2|366_bp atgacaagtaaatggaatgtggtgtcctggatgggatcttggaatggaaaaagacatcag aaaaccctaaagactccaccaaaggctcttggaactgataacttcagatatccatcaaat attgagaagaaagaatatcaggagcaaagtgttctaagttgctgttcagaacgtaaagat gcgaaccccaaatcagtggtttgttcattcttcatgcaagagcaatgcactaaaggagag aagcatttatttaatctgaaattctgcggaaaggacccagtaatctgtttcacctgccgt ccaggtgatcctgatgatactaatgatagagcatcaatgatctatggtatagtccttcat gagtag >gi568815585f:49396002_49628662|GENSCAN_predicted_peptide_3|84_aa MAVQMIVITYVVSLKVYLDEIGGEDHNSDAKTFWMELEDDGKVDFIFEQVQNVLQSLKQK IKDGSATNKGASQKEVNAQSSGEI >gi568815585f:49396002_49628662|GENSCAN_predicted_CDS_3|255_bp atggctgtacaaatgattgtaataacttatgttgtttcattaaaggtgtacttagatgaa attggtggtgaagatcacaatagcgatgcaaaaactttctggatggagctagaagatgat ggaaaagtggacttcatttttgaacaagtacaaaatgtgctgcagtcactgaaacaaaag atcaaagatgggtctgccaccaataaaggagcatcacagaaagaagtgaatgcccaaagc agtggtgagatttga >gi568815585f:49396002_49628662|GENSCAN_predicted_peptide_4|730_aa MPLNLKGENPLQLPIKCHFQRRHAKTNSHSSALHVSYKTPCGRSLRNVEEVFRYLLETEC NFLFTDNFSFNTYVQLARNYPKQKEVVSDVDISNGVESVPISFCNEIDSRKLPQFKYRKT VWPRAYNLTNFSSMFTDSCDCSEGCIDITKCACLQLTARNAKTSPLSSDKITTGYKYKRL QRQIPTGIYECSLLCKCNRQLCQNRVVQHGPQVRLQVFKTEQKGWGVRCLDDIDRGTFVC IYSGRLLSRANTEKSYGIDENGRDENTMKNIFSKKRKLEVACSDCEVEVLPLGLETHPRT AKTEKCPPKFSNNPKELTVETKYDNISRIQYHSVIRDPESKTAIFQHNGKKMDSSSNHVD EFEDNLLIESDVIDITKYREETPPRSRCNQATTLDNQNIKKAIEVQIQKPQEGRSTACQR QQVFCDEELLSETKNTSSDSLTKFNKGNVFLLDATKEGNVGRFLNYFNTCWRSNIWIDKT LSHQPFEWGEGEALLDSKRQPRFKKTKPSAAGALPGSRYPAATRSCSTVMAQASPPRPER VLGASSPEARPAQEALLLPTGILLIGVFQVAEKMEKRTCALCPKDVEYNVLYFAQSENIA AHENCLLYSSGLVECEDQDPLNPDRSFDVESVKKEIQRGRKLKDKTQLLTLAYATVKVPF LKKCKEAGLLNYLLEEILDKVHSIPEKLMDETTSESEVSNRLATKRMCHSEIDSVTYAPL PPPCIPVAKI >gi568815585f:49396002_49628662|GENSCAN_predicted_CDS_4|2193_bp atgccactgaacttgaagggagaaaaccctctgcagctgccaatcaaatgtcacttccaa agacgacatgcaaagacaaactctcattcttcagcactccacgtgagttataaaacccct tgtggaaggagtctacgaaacgtggaggaagtttttcgttacctgcttgagacagagtgt aactttttatttacagataacttttctttcaatacctatgttcagttggctcggaattac ccaaagcaaaaagaagttgtttctgatgtggatattagcaatggagtggaatcagtgccc atttctttctgtaatgaaattgacagtagaaagctcccacagtttaagtacagaaagact gtgtggcctcgagcatataatctaaccaacttttccagcatgtttactgattcctgtgac tgctctgagggctgcatagacataacaaaatgtgcatgtcttcaactgacagcaaggaat gccaaaacttcccccttgtcaagtgacaaaataaccactggatataaatataaaagacta cagagacagattcctactggcatttatgaatgcagccttttgtgcaaatgtaatcgacaa ttgtgtcaaaaccgagttgtccaacatggtcctcaagtgaggttacaggtgttcaaaact gagcagaagggatggggtgtacgctgtctagatgacattgacagagggacatttgtttgc atttattcaggaagattactaagcagagctaacactgaaaaatcttatggtattgatgaa aacgggagagatgagaatactatgaaaaatatattttcaaaaaagaggaaattagaagtt gcatgttcagattgtgaagttgaagttctcccattaggattggaaacacatcctagaact gctaaaactgagaaatgtccaccaaagttcagtaataatcccaaggagcttactgtggaa acgaaatatgataatatttcaagaattcaatatcattcagttattagagatcctgaatcc aagacagccatttttcaacacaatgggaaaaaaatggactcaagttcaaaccatgttgat gagtttgaagataatctgctgattgaatcagatgtgatagatataactaaatatagagaa gaaactccaccaaggagcagatgtaaccaggcgaccacattggataatcagaatattaaa aaggcaattgaggttcaaattcagaaaccccaagagggacgatctacagcatgtcaaaga cagcaggtattttgtgatgaagagttgctaagtgaaaccaagaatacttcatctgattct ctaacaaagttcaataaagggaatgtgtttttattggatgccacaaaagaaggaaatgtc ggccgcttccttaattattttaacacttgttggagaagcaatatctggatcgataaaaca ctgtcccatcaaccatttgagtggggagagggagaagctcttcttgactcaaagcgacag cccagatttaagaaaacgaaacctagtgcagctggggcacttccgggatctcgctatccg gccgccacccgcagctgcagcacagtcatggcccaggcgtcgccgccccggcccgagagg gtgctcggcgccagcagcccggaggcccggcccgcgcaggaggcgctcctccttcccacc gggatattacttataggtgtctttcaggttgcagaaaagatggaaaaaaggacatgtgca ctctgccccaaagatgtcgaatataatgtcctatactttgcacaatcagagaatatagct gctcatgagaattgtttgctgtattcttcaggacttgtggaatgtgaggatcaggatcca cttaatcctgatagaagttttgatgtggaatcagtaaagaaagaaatccagagaggaagg aagttgaaagataaaacccaactccttactctggcatatgcaactgtgaaagttcctttt cttaagaaatgcaaggaagcaggacttcttaattacttacttgaagaaatattagacaaa gttcattcaattccagaaaaactcatggatgagactacttcagaatcagaggtgtctaac aggttggccactaagagaatgtgccattcagagattgattcggtcacatatgctcccctg ccaccgccctgcattcctgttgctaagatctga >gi568815585f:49396002_49628662|GENSCAN_predicted_peptide_5|96_aa MHLKQQSPTFLAPGTGFVEDNFSTDHGDGAVGQGITSPVEHKLDTSSTVPQSTHTEPSSL ALQFLKAPHLLALAMNSFSRGPSICQNAAVCVTSVK >gi568815585f:49396002_49628662|GENSCAN_predicted_CDS_5|291_bp atgcacctaaagcagcagtccccaacctttttggcaccagggactggtttcgtggaagac aatttttccacggaccatggggatggggcggttggtcagggaatcacatcacccgtagag cacaaactggacacatcctcaacagtgccccagagcactcacacagaacccagcagcctt gcgcttcagttcttaaaggctccacatttactggctttagcaatgaattcctttagcaga gggccatccatttgccaaaatgctgcagtctgtgtaacttctgtcaaatga >gi568815585f:49396002_49628662|GENSCAN_predicted_peptide_6|538_aa MVDVGKWPIFTLLSPQEIASIRKACVFGTSASEALYVTDNDEVFVFGLNYSNCLGTGDNQ STLVPKKLEGLCGKKIKSLSYGSGPHVLLSTEDGVVYAWGHNGYSQLGNGTTNQGIAPVQ VCTNLLIKQVVEVACGSHHSMALAADGEVFAWGYNNCGQVGSGSTANQPTPRKVTNCLHI KRVVGIACGQTSSMAVLDNGEVYGWGYNGNGQLGLGNNGNQLTPVRVAALHSVCVNQIVC GYAHTLALTDEGLLYAWGANTYGQLGTGNKNNLLSPAHIMVEKERPYACTGPWPVINWVA RQEMGPSSCRKTSSGLPLILHYEHEDFLTVAESLKKEFDSPETADLKFRIDGKYIHVHKA VLKIRCEHFRSMFQSYWNEDMKEVIEIDQFSYPVYRAFLQYLYTDTVDLPPEDAIGLLDL ATSYCENRLKKLCQHIIKRGITVENAFSLFSAAVRYDAECLAHIRHLINVELDEDSTLKE ALRMISELKAGMVGFTDSERIKVTLTTGWTELGVVRDLVSSGMCLVGSCDAEAPAHPG >gi568815585f:49396002_49628662|GENSCAN_predicted_CDS_6|1617_bp atggtggatgtcggaaagtggcccatcttcactctactctcccctcaagagatcgcgtct attcggaaggcgtgtgtcttcggcacctcagccagtgaagcactgtacgttactgacaat gatgaggtctttgtatttggactgaactatagtaactgtctaggaactggagataaccag agtacacttgtacccaaaaagctagaaggcttatgtggaaagaagattaaaagcctcagt tacgggagtggaccacatgttcttctcagcaccgaagatggagtggtttatgcctggggc cacaatggatatagccagcttgggaatgggacgaccaaccaaggcattgctcccgtccag gtctgtaccaatctcttgatcaagcaagtggtggaagtagcttgtggctcacatcattca atggctctggcagctgatggagaggtgtttgcttggggttataacaactgtggccaagtg ggatcaggttctacagcaaatcaaccaactcctcgaaaagttacaaactgtttacatatt aagagggtagttggcattgcctgtggtcagacttcatccatggctgttctggacaatggc gaggtatatggctggggttacaatggcaacggtcagctgggcctgggaaacaatggcaac cagctgacccctgtgagagtggcagctttgcacagcgtgtgtgtgaaccagattgtctgc ggttacgcacatactctagcactaacagatgagggcttgctgtatgcctggggagctaac acatatgggcagctgggaactggcaataaaaataacctgctaagcccagcacacatcatg gtggagaaagaaaggccatatgcctgtactggtccgtggcctgttattaactgggttgca cggcaggagatggggccgtccagttgtaggaaaacaagctcaggactcccactgattcta cattatgagcatgaagactttttaacagttgcagagtcactgaagaaagaatttgatagt ccagaaactgctgatctgaagtttcgaattgatggaaaatatattcatgtccataaagct gttttgaaaatcaggtgtgagcattttcgatccatgttccagtcgtattggaatgaagac atgaaggaagtgatagaaatcgatcagttttcttacccagtgtatcgtgcctttctccag tacctctacacagacacagtcgacctgccgccagaagatgctataggtcttctggatttg gcgacatcttactgtgaaaacagactgaaaaaactttgtcagcacattatcaagagagga attactgtggagaatgccttttcgctattctctgctgcagtcagatatgatgcagagtgc ctggcacatattaggcacctgataaatgttgaactggatgaagatagtaccttaaaggaa gcattaaggatgatcagtgagctgaaggcgggtatggttggtttcacagattcagagaga ataaaagttactttaacaacaggttggacagagttaggtgtggttcgtgatcttgtgtca tctggaatgtgtcttgtggggagctgtgatgccgaggcacctgctcaccctggttag >gi568815585f:49396002_49628662|GENSCAN_predicted_peptide_7|157_aa MKTLAVFPGGNVSLLETKTFDPVGYICEDENINSLTGSLGLIASGDTAAPTPWIHLPDMF TQVLLIGLQPKETDFSFFEQNRPLLVNSLNCHYFGQATSFSLWGGRPTLPTAKPSRGAEQ EPVPRAFLVPQTQKPLSPGSSHRISWGHSSGRASGRA >gi568815585f:49396002_49628662|GENSCAN_predicted_CDS_7|474_bp atgaagactcttgctgtgttccctggtggaaatgtttccctgttggaaaccaagaccttc gaccctgtgggctacatttgtgaggatgaaaacataaattctttaactgggtcactggga ctgatagcaagtggggacactgctgctcccacaccatggattcatctcccagacatgttt acacaagttcttttgatcgggttgcaaccaaaagaaactgattttagtttctttgagcaa aacaggcctcttctagtaaattcactcaactgccactactttgggcaagccacttcattc agtctctggggtggtaggccgaccctccccacagccaagccatctcggggagcagagcag gagcccgtgcctcgcgcgttcctggttcctcagacacaaaagcctctaagtcccggcagc agccaccggatttcatggggacactccagtggcagggcctcggggcgggcctga >gi568815585f:49396002_49628662|GENSCAN_predicted_peptide_8|238_aa MARLQAASGGSCVALTTESETGTAIKGPAGSQGWIWLISRGGHSWVEETEEKGPEVVVPG GMTKKDLKSAVIHGDSQRGQHLASESVFGFSGILLICNPPQNSLEQKSVCLCFPRTLTTN KAFMIENESIGNILQEIFIQQMACGSCFPVFLSPRLRAPLASLAVSGQRDVFQVRLLTVG AGSWVWASCHGLLLGVQPQWGSFASLPPPTPLHLAAWTILAPPSSLSGEKTTHLRATR >gi568815585f:49396002_49628662|GENSCAN_predicted_CDS_8|717_bp atggccaggctgcaggctgcctctggaggtagctgcgtggcactgaccacagaatctgag acaggcaccgccatcaaggggccggctggcagtcaggggtggatctggcttatcagtaga ggaggacatagctgggtagaggaaactgaagagaagggaccagaagtggtggtgccaggt gggatgaccaagaaagaccttaaatcagcagtcatccatggtgacagccagaggggccag catcttgcctctgaatcagtatttggtttcagtggaatattgctgatctgtaaccctcca cagaactccctagaacagaaatctgtctgcctctgctttcctaggacgctgaccacaaat aaagctttcatgatagagaatgaatccataggaaacatcctgcaagaaatatttattcag caaatggcatgtggaagctgtttccctgtgttcctaagtcccaggctgcgagcaccactc gcctccctcgcggtgtctggacagcgggatgtcttccaggttcgtctcctcactgttggc gccgggtcctgggtgtgggcctcctgccacggactcctcctcggggtgcagccacagtgg ggatcctttgcatctttgccgcccccaacgcctctgcacttggctgcctggactattttg gccccacccagcagccttagtggagagaaaacaacgcacctgcgtgccacacgatga