GENSCAN 1.0 Date run: 4-Nov-116 Time: 13:58:19 Sequence gi568815585r:49434125_49667279 : 233155 bp : 42.25% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 21740 21821 82 1 1 61 92 77 0.907 6.58 1.02 Intr + 25983 26108 126 1 0 46 86 115 0.925 6.83 1.03 Term + 29897 29943 47 0 2 94 54 36 0.416 -2.81 1.04 PlyA + 30640 30645 6 1.05 2.00 Prom + 37982 38021 40 -1.05 2.01 Init + 42473 42915 443 1 2 53 84 199 0.846 11.60 2.02 Intr + 46095 46211 117 0 0 85 82 25 0.416 0.26 2.03 Intr + 46823 46992 170 2 2 74 103 47 0.770 3.47 2.04 Intr + 48613 48838 226 2 1 99 75 112 0.917 7.12 2.05 Intr + 49340 49439 100 2 1 84 53 36 0.729 -1.11 2.06 Intr + 54168 54506 339 2 0 -11 82 291 0.029 13.34 2.07 Intr + 58228 58316 89 0 2 22 84 31 0.202 -6.05 2.08 Intr + 61775 61971 197 2 2 30 82 136 0.100 5.34 2.09 Intr + 72496 72632 137 2 2 51 111 40 0.006 1.97 2.10 Intr + 78935 79042 108 1 0 70 111 141 0.994 14.16 2.11 Intr + 81719 81749 31 1 1 81 103 10 0.708 -1.41 2.12 Intr + 89961 90092 132 1 0 78 84 75 0.544 5.80 2.13 Term + 90789 90892 104 1 2 51 54 95 0.580 -0.14 2.14 PlyA + 91585 91590 6 1.05 3.00 Prom + 94129 94168 40 -3.75 3.01 Init + 97161 97261 101 2 2 97 23 111 0.460 5.28 3.02 Term + 99914 100103 190 2 1 67 38 135 0.416 2.34 3.03 PlyA + 100155 100160 6 1.05 4.14 PlyA - 100326 100321 6 1.05 4.13 Term - 103100 102991 110 0 2 77 43 95 0.795 1.59 4.12 Intr - 106089 105960 130 0 1 68 75 58 0.769 1.85 4.11 Intr - 106882 106752 131 2 2 98 78 51 0.953 4.59 4.10 Intr - 107703 107552 152 2 2 85 52 165 0.984 11.49 4.09 Intr - 110739 110613 127 1 1 83 68 160 0.995 12.32 4.08 Intr - 112494 112437 58 0 1 97 101 11 0.804 0.84 4.07 Intr - 112767 112713 55 2 1 14 105 39 0.257 -3.74 4.06 Intr - 117344 117202 143 2 2 89 88 94 0.915 7.73 4.05 Intr - 118161 118054 108 0 0 111 98 102 0.990 13.06 4.04 Intr - 121549 121391 159 1 0 136 78 148 0.999 17.86 4.03 Intr - 125960 125794 167 0 2 80 107 124 0.999 12.26 4.02 Intr - 132644 132494 151 2 1 54 116 126 0.996 10.81 4.01 Init - 133155 133030 126 0 0 78 94 120 0.877 11.83 4.00 Prom - 146026 145987 40 -2.35 5.04 PlyA - 146507 146502 6 1.05 5.03 Term - 151783 151611 173 2 2 92 53 106 0.249 4.51 5.02 Intr - 153220 153092 129 2 0 114 70 16 0.160 2.15 5.01 Init - 162168 161997 172 0 1 26 66 127 0.372 3.95 5.00 Prom - 168042 168003 40 -6.95 6.04 PlyA - 168544 168539 6 1.05 6.03 Term - 186665 186207 459 2 0 106 53 261 0.959 18.50 6.02 Intr - 187863 187712 152 1 2 61 44 116 0.340 3.46 6.01 Init - 191429 191324 106 2 1 87 39 58 0.080 1.24 6.00 Prom - 193827 193788 40 -9.75 7.00 Prom + 195282 195321 40 -5.05 7.01 Sngl + 196324 196914 591 0 0 113 42 604 0.834 54.24 7.02 PlyA + 198083 198088 6 1.05 8.00 Prom + 198793 198832 40 -8.05 8.01 Init + 199562 199661 100 1 1 62 79 57 0.586 2.67 8.02 Intr + 200527 200649 123 2 0 85 66 31 0.372 0.24 8.03 Intr + 201948 201990 43 1 1 89 109 66 0.752 5.18 8.04 Intr + 206888 206973 86 2 2 100 119 114 0.886 14.24 8.05 Intr + 218759 218947 189 0 0 69 86 75 0.077 4.04 8.06 Term + 223811 224082 272 0 2 -5 42 168 0.041 -2.44 8.07 PlyA + 224289 224294 6 1.05 9.04 PlyA - 224722 224717 6 1.05 9.03 Term - 225661 225427 235 0 1 111 43 129 0.796 5.51 9.02 Intr - 229071 228933 139 1 1 73 94 149 0.977 12.60 9.01 Init - 230375 230273 103 2 1 73 68 73 0.930 4.25 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 16558 16367 192 1 0 74 48 154 0.894 4.89 S.002 Intr + 54166 54506 341 2 2 27 82 295 0.884 17.07 S.003 Init + 78507 78512 6 2 0 87 94 0 0.847 1.53 S.004 Term + 208041 208117 77 1 2 106 44 58 0.903 0.22 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585r:49434125_49667279|GENSCAN_predicted_peptide_1|84_aa MAVQMIVITYVVSLKVYLDEIGGEDHNSDAKTFWMELEDDGKVDFIFEQVQNVLQSLKQK IKDGSATNKGASQKEVNAQSSGEI >gi568815585r:49434125_49667279|GENSCAN_predicted_CDS_1|255_bp atggctgtacaaatgattgtaataacttatgttgtttcattaaaggtgtacttagatgaa attggtggtgaagatcacaatagcgatgcaaaaactttctggatggagctagaagatgat ggaaaagtggacttcatttttgaacaagtacaaaatgtgctgcagtcactgaaacaaaag atcaaagatgggtctgccaccaataaaggagcatcacagaaagaagtgaatgcccaaagc agtggtgagatttga >gi568815585r:49434125_49667279|GENSCAN_predicted_peptide_2|730_aa MPLNLKGENPLQLPIKCHFQRRHAKTNSHSSALHVSYKTPCGRSLRNVEEVFRYLLETEC NFLFTDNFSFNTYVQLARNYPKQKEVVSDVDISNGVESVPISFCNEIDSRKLPQFKYRKT VWPRAYNLTNFSSMFTDSCDCSEGCIDITKCACLQLTARNAKTSPLSSDKITTGYKYKRL QRQIPTGIYECSLLCKCNRQLCQNRVVQHGPQVRLQVFKTEQKGWGVRCLDDIDRGTFVC IYSGRLLSRANTEKSYGIDENGRDENTMKNIFSKKRKLEVACSDCEVEVLPLGLETHPRT AKTEKCPPKFSNNPKELTVETKYDNISRIQYHSVIRDPESKTAIFQHNGKKMDSSSNHVD EFEDNLLIESDVIDITKYREETPPRSRCNQATTLDNQNIKKAIEVQIQKPQEGRSTACQR QQVFCDEELLSETKNTSSDSLTKFNKGNVFLLDATKEGNVGRFLNYFNTCWRSNIWIDKT LSHQPFEWGEGEALLDSKRQPRFKKTKPSAAGALPGSRYPAATRSCSTVMAQASPPRPER VLGASSPEARPAQEALLLPTGILLIGVFQVAEKMEKRTCALCPKDVEYNVLYFAQSENIA AHENCLLYSSGLVECEDQDPLNPDRSFDVESVKKEIQRGRKLKDKTQLLTLAYATVKVPF LKKCKEAGLLNYLLEEILDKVHSIPEKLMDETTSESEVSNRLATKRMCHSEIDSVTYAPL PPPCIPVAKI >gi568815585r:49434125_49667279|GENSCAN_predicted_CDS_2|2193_bp atgccactgaacttgaagggagaaaaccctctgcagctgccaatcaaatgtcacttccaa agacgacatgcaaagacaaactctcattcttcagcactccacgtgagttataaaacccct tgtggaaggagtctacgaaacgtggaggaagtttttcgttacctgcttgagacagagtgt aactttttatttacagataacttttctttcaatacctatgttcagttggctcggaattac ccaaagcaaaaagaagttgtttctgatgtggatattagcaatggagtggaatcagtgccc atttctttctgtaatgaaattgacagtagaaagctcccacagtttaagtacagaaagact gtgtggcctcgagcatataatctaaccaacttttccagcatgtttactgattcctgtgac tgctctgagggctgcatagacataacaaaatgtgcatgtcttcaactgacagcaaggaat gccaaaacttcccccttgtcaagtgacaaaataaccactggatataaatataaaagacta cagagacagattcctactggcatttatgaatgcagccttttgtgcaaatgtaatcgacaa ttgtgtcaaaaccgagttgtccaacatggtcctcaagtgaggttacaggtgttcaaaact gagcagaagggatggggtgtacgctgtctagatgacattgacagagggacatttgtttgc atttattcaggaagattactaagcagagctaacactgaaaaatcttatggtattgatgaa aacgggagagatgagaatactatgaaaaatatattttcaaaaaagaggaaattagaagtt gcatgttcagattgtgaagttgaagttctcccattaggattggaaacacatcctagaact gctaaaactgagaaatgtccaccaaagttcagtaataatcccaaggagcttactgtggaa acgaaatatgataatatttcaagaattcaatatcattcagttattagagatcctgaatcc aagacagccatttttcaacacaatgggaaaaaaatggactcaagttcaaaccatgttgat gagtttgaagataatctgctgattgaatcagatgtgatagatataactaaatatagagaa gaaactccaccaaggagcagatgtaaccaggcgaccacattggataatcagaatattaaa aaggcaattgaggttcaaattcagaaaccccaagagggacgatctacagcatgtcaaaga cagcaggtattttgtgatgaagagttgctaagtgaaaccaagaatacttcatctgattct ctaacaaagttcaataaagggaatgtgtttttattggatgccacaaaagaaggaaatgtc ggccgcttccttaattattttaacacttgttggagaagcaatatctggatcgataaaaca ctgtcccatcaaccatttgagtggggagagggagaagctcttcttgactcaaagcgacag cccagatttaagaaaacgaaacctagtgcagctggggcacttccgggatctcgctatccg gccgccacccgcagctgcagcacagtcatggcccaggcgtcgccgccccggcccgagagg gtgctcggcgccagcagcccggaggcccggcccgcgcaggaggcgctcctccttcccacc gggatattacttataggtgtctttcaggttgcagaaaagatggaaaaaaggacatgtgca ctctgccccaaagatgtcgaatataatgtcctatactttgcacaatcagagaatatagct gctcatgagaattgtttgctgtattcttcaggacttgtggaatgtgaggatcaggatcca cttaatcctgatagaagttttgatgtggaatcagtaaagaaagaaatccagagaggaagg aagttgaaagataaaacccaactccttactctggcatatgcaactgtgaaagttcctttt cttaagaaatgcaaggaagcaggacttcttaattacttacttgaagaaatattagacaaa gttcattcaattccagaaaaactcatggatgagactacttcagaatcagaggtgtctaac aggttggccactaagagaatgtgccattcagagattgattcggtcacatatgctcccctg ccaccgccctgcattcctgttgctaagatctga >gi568815585r:49434125_49667279|GENSCAN_predicted_peptide_3|96_aa MHLKQQSPTFLAPGTGFVEDNFSTDHGDGAVGQGITSPVEHKLDTSSTVPQSTHTEPSSL ALQFLKAPHLLALAMNSFSRGPSICQNAAVCVTSVK >gi568815585r:49434125_49667279|GENSCAN_predicted_CDS_3|291_bp atgcacctaaagcagcagtccccaacctttttggcaccagggactggtttcgtggaagac aatttttccacggaccatggggatggggcggttggtcagggaatcacatcacccgtagag cacaaactggacacatcctcaacagtgccccagagcactcacacagaacccagcagcctt gcgcttcagttcttaaaggctccacatttactggctttagcaatgaattcctttagcaga gggccatccatttgccaaaatgctgcagtctgtgtaacttctgtcaaatga >gi568815585r:49434125_49667279|GENSCAN_predicted_peptide_4|538_aa MVDVGKWPIFTLLSPQEIASIRKACVFGTSASEALYVTDNDEVFVFGLNYSNCLGTGDNQ STLVPKKLEGLCGKKIKSLSYGSGPHVLLSTEDGVVYAWGHNGYSQLGNGTTNQGIAPVQ VCTNLLIKQVVEVACGSHHSMALAADGEVFAWGYNNCGQVGSGSTANQPTPRKVTNCLHI KRVVGIACGQTSSMAVLDNGEVYGWGYNGNGQLGLGNNGNQLTPVRVAALHSVCVNQIVC GYAHTLALTDEGLLYAWGANTYGQLGTGNKNNLLSPAHIMVEKERPYACTGPWPVINWVA RQEMGPSSCRKTSSGLPLILHYEHEDFLTVAESLKKEFDSPETADLKFRIDGKYIHVHKA VLKIRCEHFRSMFQSYWNEDMKEVIEIDQFSYPVYRAFLQYLYTDTVDLPPEDAIGLLDL ATSYCENRLKKLCQHIIKRGITVENAFSLFSAAVRYDAECLAHIRHLINVELDEDSTLKE ALRMISELKAGMVGFTDSERIKVTLTTGWTELGVVRDLVSSGMCLVGSCDAEAPAHPG >gi568815585r:49434125_49667279|GENSCAN_predicted_CDS_4|1617_bp atggtggatgtcggaaagtggcccatcttcactctactctcccctcaagagatcgcgtct attcggaaggcgtgtgtcttcggcacctcagccagtgaagcactgtacgttactgacaat gatgaggtctttgtatttggactgaactatagtaactgtctaggaactggagataaccag agtacacttgtacccaaaaagctagaaggcttatgtggaaagaagattaaaagcctcagt tacgggagtggaccacatgttcttctcagcaccgaagatggagtggtttatgcctggggc cacaatggatatagccagcttgggaatgggacgaccaaccaaggcattgctcccgtccag gtctgtaccaatctcttgatcaagcaagtggtggaagtagcttgtggctcacatcattca atggctctggcagctgatggagaggtgtttgcttggggttataacaactgtggccaagtg ggatcaggttctacagcaaatcaaccaactcctcgaaaagttacaaactgtttacatatt aagagggtagttggcattgcctgtggtcagacttcatccatggctgttctggacaatggc gaggtatatggctggggttacaatggcaacggtcagctgggcctgggaaacaatggcaac cagctgacccctgtgagagtggcagctttgcacagcgtgtgtgtgaaccagattgtctgc ggttacgcacatactctagcactaacagatgagggcttgctgtatgcctggggagctaac acatatgggcagctgggaactggcaataaaaataacctgctaagcccagcacacatcatg gtggagaaagaaaggccatatgcctgtactggtccgtggcctgttattaactgggttgca cggcaggagatggggccgtccagttgtaggaaaacaagctcaggactcccactgattcta cattatgagcatgaagactttttaacagttgcagagtcactgaagaaagaatttgatagt ccagaaactgctgatctgaagtttcgaattgatggaaaatatattcatgtccataaagct gttttgaaaatcaggtgtgagcattttcgatccatgttccagtcgtattggaatgaagac atgaaggaagtgatagaaatcgatcagttttcttacccagtgtatcgtgcctttctccag tacctctacacagacacagtcgacctgccgccagaagatgctataggtcttctggatttg gcgacatcttactgtgaaaacagactgaaaaaactttgtcagcacattatcaagagagga attactgtggagaatgccttttcgctattctctgctgcagtcagatatgatgcagagtgc ctggcacatattaggcacctgataaatgttgaactggatgaagatagtaccttaaaggaa gcattaaggatgatcagtgagctgaaggcgggtatggttggtttcacagattcagagaga ataaaagttactttaacaacaggttggacagagttaggtgtggttcgtgatcttgtgtca tctggaatgtgtcttgtggggagctgtgatgccgaggcacctgctcaccctggttag >gi568815585r:49434125_49667279|GENSCAN_predicted_peptide_5|157_aa MKTLAVFPGGNVSLLETKTFDPVGYICEDENINSLTGSLGLIASGDTAAPTPWIHLPDMF TQVLLIGLQPKETDFSFFEQNRPLLVNSLNCHYFGQATSFSLWGGRPTLPTAKPSRGAEQ EPVPRAFLVPQTQKPLSPGSSHRISWGHSSGRASGRA >gi568815585r:49434125_49667279|GENSCAN_predicted_CDS_5|474_bp atgaagactcttgctgtgttccctggtggaaatgtttccctgttggaaaccaagaccttc gaccctgtgggctacatttgtgaggatgaaaacataaattctttaactgggtcactggga ctgatagcaagtggggacactgctgctcccacaccatggattcatctcccagacatgttt acacaagttcttttgatcgggttgcaaccaaaagaaactgattttagtttctttgagcaa aacaggcctcttctagtaaattcactcaactgccactactttgggcaagccacttcattc agtctctggggtggtaggccgaccctccccacagccaagccatctcggggagcagagcag gagcccgtgcctcgcgcgttcctggttcctcagacacaaaagcctctaagtcccggcagc agccaccggatttcatggggacactccagtggcagggcctcggggcgggcctga >gi568815585r:49434125_49667279|GENSCAN_predicted_peptide_6|238_aa MARLQAASGGSCVALTTESETGTAIKGPAGSQGWIWLISRGGHSWVEETEEKGPEVVVPG GMTKKDLKSAVIHGDSQRGQHLASESVFGFSGILLICNPPQNSLEQKSVCLCFPRTLTTN KAFMIENESIGNILQEIFIQQMACGSCFPVFLSPRLRAPLASLAVSGQRDVFQVRLLTVG AGSWVWASCHGLLLGVQPQWGSFASLPPPTPLHLAAWTILAPPSSLSGEKTTHLRATR >gi568815585r:49434125_49667279|GENSCAN_predicted_CDS_6|717_bp atggccaggctgcaggctgcctctggaggtagctgcgtggcactgaccacagaatctgag acaggcaccgccatcaaggggccggctggcagtcaggggtggatctggcttatcagtaga ggaggacatagctgggtagaggaaactgaagagaagggaccagaagtggtggtgccaggt gggatgaccaagaaagaccttaaatcagcagtcatccatggtgacagccagaggggccag catcttgcctctgaatcagtatttggtttcagtggaatattgctgatctgtaaccctcca cagaactccctagaacagaaatctgtctgcctctgctttcctaggacgctgaccacaaat aaagctttcatgatagagaatgaatccataggaaacatcctgcaagaaatatttattcag caaatggcatgtggaagctgtttccctgtgttcctaagtcccaggctgcgagcaccactc gcctccctcgcggtgtctggacagcgggatgtcttccaggttcgtctcctcactgttggc gccgggtcctgggtgtgggcctcctgccacggactcctcctcggggtgcagccacagtgg ggatcctttgcatctttgccgcccccaacgcctctgcacttggctgcctggactattttg gccccacccagcagccttagtggagagaaaacaacgcacctgcgtgccacacgatga >gi568815585r:49434125_49667279|GENSCAN_predicted_peptide_7|196_aa MGSVNSRGHKAEAQVVMMGLDSAGKTTLLYKLKGHQLVETLPTVGFNVEPLKAPGHVSLT LWDVGGQAPLRASWKDYLEGTDILVYVLDSTDEARLPESAAELTEVLNDPNMAGVPFLVL ANKQEAPDALPLLKIRNRLSLERFQDHCWELRGCSALTGEGLPEALQSLWSLLKSRSCMC LQARAHGAERGDSKRS >gi568815585r:49434125_49667279|GENSCAN_predicted_CDS_7|591_bp atgggttctgtgaattccagaggtcacaaggcggaagcccaggtggtgatgatgggcctg gactcggcgggcaagaccacgctcctttacaagctgaagggccaccagctggtggagacc ctgcccactgttggtttcaacgtggagcctctgaaagctcctgggcacgtgtcactgact ctctgggacgttggggggcaggccccgctcagagccagctggaaggactatctggaaggc acagatatcctcgtgtacgtgctggacagcacagatgaagcccgcttacccgagtcggcg gctgagctcacagaagtcctgaacgaccccaacatggctggcgtccccttcttggtgctg gccaacaagcaggaggcacctgatgcacttccgctgcttaagatcagaaacaggctgagt ctagagagattccaggaccactgctgggagctccggggctgcagtgccctcactggggag gggctgcccgaggccctgcagagcctgtggagcctcctgaaatctcgcagctgcatgtgt ctgcaggcgagagcccatggggctgagcgcggagacagcaagagatcttga >gi568815585r:49434125_49667279|GENSCAN_predicted_peptide_8|270_aa MTEGLECHELEKVSAEEPGWDGAGTAGFSFCRVGALISAQSLEPKFGSSAISMGSRFPAY TYCFGCGFSLPFLPRGTEHQITRIELLSRPKLLQSEYWKAQHEPLHALRMHPPPAKAGPP DVSVKGIMDIGDQMSGQGLNVWRLFKPGALRGKPVKSVSCFQFLAIRSYPQGLKSVPLPK VTTRDWVIYKKRGLIGSGFYRLYRRHGWGKPQETYNHGGRQGEAGTSYMGGARGRERERD IATHFTTDLVRALSWDSTRGMVLNHEKPPP >gi568815585r:49434125_49667279|GENSCAN_predicted_CDS_8|813_bp atgacagaaggactggaatgccatgaactggagaaggtgagcgctgaagaaccaggatgg gacggggctggaacagctgggttcagcttttgcagggtgggtgctttaatttcagcccag tctctagagcctaaatttggctcctctgcaatttccatgggctctaggtttccagcttat acttactgctttggctgtggtttctctcttccatttctaccacgagggactgaacatcaa atcactcgcatcgaattactcagcagaccaaagctcttgcagtctgagtactggaaagcc cagcatgagcctctccacgcacttcgaatgcaccctcctccagcaaaggctgggccacca gatgtcagtgttaaagggatcatggatattggtgatcagatgtcaggacagggactgaat gtatggagactgttcaaaccaggggcgctcaggggcaaacctgtgaaatcggtttcctgc ttccagttcttagcaattagatcctacccccagggactgaaatcagtcccactcccaaaa gtaactacccgagactgggtaatttataagaaaagaggtttaattggctcagggttctac aggctgtataggaggcatggctgggggaagcctcaggaaacttacaatcatggcggaagg caaggtgaagccggcacttcctacatgggcggagcaagaggaagagagcgcgaaagggac atcgctacacactttacaacagatctcgtgagagctctatcatgggacagcactagggga atggtgctaaaccatgagaaaccacccccatga >gi568815585r:49434125_49667279|GENSCAN_predicted_peptide_9|158_aa MVFLRRNHSDNGSGWISESTLAPQAAQLRLCESHWKEYGKADARWVYFDPTIVSVEILTV ALDGSLALFLIYAIVKEKYYRQVRSWKCESLALLYQVSNSLGIKRQSWREQQVCGLSSPS GGLTGSAALGGGPFCGHPVVTPKSLAKHLYPVTRQFDS >gi568815585r:49434125_49667279|GENSCAN_predicted_CDS_9|477_bp atggtgttcctgagaagaaaccactctgataatgggtctggatggatctctgagagcaca ctggctccccaagcagcacagctgagactgtgtgagagccattggaaagaatatggcaaa gctgatgcaagatgggtttattttgatccaaccattgtgtctgtggaaattctgaccgtc gccctggatgggtctctggcattgttcctcatttatgccatagtcaaagaaaaatattac cgacaagtaaggtcatggaaatgtgagagtttggcattactataccaagtgtccaactcc ctgggtatcaagaggcagtcatggcgggagcagcaggtctgtggtctgagtagccccagt ggtggcctcactggtagtgctgcccttgggggtggtccattctgtggtcatcccgtagtc acccccaagagcttagccaagcatctatatcctgtaacccggcagtttgactcctag