GENSCAN 1.0 Date run: 4-Nov-116 Time: 16:07:13 Sequence gi568815591f:139260262_139522337 : 262076 bp : 43.57% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 991 1480 490 1 1 86 86 278 0.808 20.41 1.02 Intr + 6072 6142 71 2 2 106 83 21 0.935 1.38 1.03 Intr + 9133 9262 130 1 1 44 110 77 0.902 6.20 1.04 Intr + 12061 12236 176 0 2 104 39 103 0.377 5.74 1.05 Intr + 13036 13149 114 1 0 62 110 156 0.990 14.76 1.06 Intr + 13670 13813 144 2 0 40 116 53 0.854 2.70 1.07 Intr + 19057 19099 43 1 1 54 115 28 0.714 0.34 1.08 Intr + 22763 24167 1405 1 1 16 54 406 0.353 18.85 1.09 Term + 61544 61608 65 0 2 130 47 80 0.057 6.15 1.10 PlyA + 62032 62037 6 1.05 2.00 Prom + 62090 62129 40 -3.06 2.01 Init + 79929 80256 328 2 1 93 65 199 0.232 15.60 2.02 Intr + 81161 81261 101 0 2 -1 63 194 0.105 7.83 2.03 Intr + 85240 85392 153 0 0 82 60 117 0.891 8.57 2.04 Intr + 99944 100061 118 1 1 89 127 116 0.103 15.64 2.05 Intr + 107558 107628 71 0 2 60 67 21 0.004 -3.90 2.06 Intr + 115801 115895 95 0 2 80 40 109 0.161 4.06 2.07 Intr + 134015 134066 52 2 1 42 64 60 0.118 -2.09 2.08 Intr + 138338 138436 99 1 0 121 42 62 0.568 5.21 2.09 Intr + 141876 141986 111 2 0 66 116 92 0.749 10.38 2.10 Intr + 145383 145526 144 2 0 119 84 125 0.999 15.68 2.11 Intr + 146913 147089 177 2 0 63 83 120 0.992 9.12 2.12 Term + 149302 149442 141 0 0 64 42 89 0.888 -0.17 2.13 PlyA + 149622 149627 6 1.05 3.08 PlyA - 152815 152810 6 1.05 3.07 Term - 157434 157216 219 0 0 79 42 280 0.780 19.54 3.06 Intr - 167301 167278 24 0 0 105 64 26 0.100 0.12 3.05 Intr - 193446 193350 97 2 1 96 17 159 0.101 9.61 3.04 Intr - 193953 193850 104 0 2 106 68 0 0.461 -1.23 3.03 Intr - 219511 219366 146 2 2 85 94 25 0.791 2.80 3.02 Intr - 219986 219885 102 0 0 105 80 131 0.839 14.15 3.01 Init - 223381 222625 757 1 1 81 100 915 0.434 84.87 3.00 Prom - 224496 224457 40 -3.16 4.03 PlyA - 228038 228033 6 1.05 4.02 Term - 229821 229649 173 1 2 34 36 148 0.734 2.19 4.01 Init - 234780 234618 163 0 1 57 44 164 0.838 8.79 4.00 Prom - 238025 237986 40 -3.06 5.00 Prom + 238215 238254 40 -8.06 5.01 Init + 242193 242376 184 2 1 100 55 153 0.941 12.48 5.02 Intr + 242467 242655 189 2 0 0 -1 244 0.053 6.26 5.03 Intr + 243030 243545 516 1 0 -53 38 336 0.007 7.43 5.04 Term + 252945 253105 161 1 2 110 47 90 0.609 5.20 5.05 PlyA + 254769 254774 6 1.05 6.02 PlyA - 254872 254867 6 1.05 6.01 Term - 258797 258680 118 1 1 53 47 145 0.809 4.91 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 99923 99456 468 2 0 72 42 231 0.815 13.12 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:139260262_139522337|GENSCAN_predicted_peptide_1|879_aa XVVALNSHKSEKKKKRYKDSLSLAAMIRKFQKEKDALKKESNPKVPVTLSTPSLNKPPCA AAALGNDVPDLNLSSGDPDLPIFVSTNEHELFQEAENALEMLDDFDFDRLLDAASDGSPL SESGGENGTTTQPTYTSQVMPKVVPTLPEGLPVLLEKRIEDLRVAAKLFDEEGRKKFFTQ DMNNILLDIELQLQELGPVIRSGVYSHLEAFVPCNKETLVKRLKKLHLNVQDDRLREPLQ KLKLAVSNVMPEQLFKYQEDCQARSQAKCAKYVSLLFGLAFAFEKCMYVILQTDEEREKN GSEEDDDEKPGKRVIGPRKKFHWDDTIRTLLCNLVEIKLGCYELEPNKSQSAEDYLKSFM ETEVKPLWPKGWMQARAKKKVIPAPKPKVKECSPKKDQKTPTSLVASVSGPPTSSSTAAI AAASSSSAPAQETICLDDSLDEDLSFHSPSLDLVSEALAVINNGNKGPPVGSRISMPTTK PRPGLREEKLASIMSKLPLATPKKLDSTQTTHSSSLIAGHTGPVPKKPQDLAHTGISSGL IAGSSIQNPKVSLEPLPARLLQQGLQRSSQIHTSSSSQTHVSSSSQAQIAASSHALGTSE AQDASSLTQVTKVHQHSAVQQNYVSPLQATISKSQTNPVVKLSNNPQLSCSSSLIKTSDK PLMYRLPLSTPSPGNGSQGSHPLVSRTVPSTTTSSNYLAKAMVSQISTQGFKSPFSMAAS PKLAASPKPATSPKPLPSPKPSASPKPSLSAKPSVSTKLISKSNPTPKPTVSPSSSSPNA LVAQGSHSSTNSPVHKQPSGMNISRQSPTLNLLPSSRTSGLPPTKNLQAPSKLTNSSSTG TVGKNSLSGIAMNVPASRGQQQARDSSQQGRRRQYEKEK >gi568815591f:139260262_139522337|GENSCAN_predicted_CDS_1|2640_bp nnagttgtggctctaaattcacacaagtctgaaaaaaagaagaaacgttataaagattct ctttctctagctgccatgattagaaaattccagaaagagaaggatgcattaaagaaggag tctaaccccaaagtcccagtgaccttgtcaaccccttctctgaataaacccccatgtgct gctgcagcactggggaatgacgtcccggacttaaatctgagcagcggtgatccagacctt cccatttttgttagcacaaatgaacatgagctgtttcaggaagctgaaaatgccctagag atgctagatgattttgacttcgacagattactggatgctgcttctgatggtagcccccta tctgagtcggggggtgaaaatggaaccaccacccagccaacctacacttctcaggttatg cccaaagtggtacctacactcccagagggtctacctgtacttcttgaaaaacgtatcgaa gaccttcgtgtagctgccaaactttttgatgaagaaggaaggaaaaaattctttacacag gatatgaataatattcttctggacattgagttacagctacaagaactaggccctgtcatt cgcagtggtgtctactcccaccttgaagcttttgtgccatgcaataaagaaacactagta aaacgtctgaagaagttacatctcaatgtccaggatgatcgtttaagagaacctctgcaa aaactgaaactggctgttagcaatgtcatgcctgaacagctatttaaataccaggaggac tgccaggctcgtagtcaagctaagtgtgccaagtatgtatctcttctatttggactggct tttgcatttgagaagtgtatgtacgttatattgcagacagatgaagaacgagaaaaaaat ggatctgaagaggatgatgatgagaaaccaggaaaacgtgtcataggaccaagaaagaaa ttccactgggatgacactatcagaactttgttatgtaaccttgttgagatcaaattggga tgctatgagttagaaccaaataaaagccagtctgctgaagattatcttaagtcttttatg gagacagaagtgaagcccctgtggcctaagggctggatgcaggcaagggcaaagaaaaag gtgattcctgcacctaaacccaaagtaaaggagtgtagtccaaaaaaggaccagaaaact ccaacatccctggtggcttcggttagcggtcctccaacgagctccagcacagctgccatt gctgcagctagctctagctctgcaccagcccaagaaaccatctgcctcgacgactcacta gatgaagacctttctttccattcaccttcactggatcttgtttctgaagctttagcggtt atcaacaatgggaacaagggccctccagttggctcaaggataagcatgccaaccacaaag cctcgtccaggactgagagaagaaaaattagcaagtatcatgagtaagctgccactagct actcccaaaaaactagattctactcagactacacattcttcaagtcttattgctggtcac acagggccagtaccaaagaaaccccaggatttagctcatactggcatctcttcaggcctt attgctggttcttccattcagaaccctaaagtttctttagaacctttgccagccaggcta cttcaacaaggacttcagaggtcaagccagattcacacttcttcctcttcacagacccat gtctcctcttcttcccaagcccaaattgctgcctcttctcatgctctgggaacatccgag gcccaagatgcttcttcgttaacacaagtaacaaaggtgcaccagcattcagctgtccag cagaactatgtgtctccattacaggccaccatcagtaaatcccagaccaaccccgtcgtg aagttaagtaataatccccaactctcctgttcctcctcacttattaagacttcagataag ccacttatgtaccgccttcccttatctaccccctcacctggaaatggttctcaagggtcc caccccctggtttctaggacagtacctagcaccactacctccagtaactatttagccaag gctatggtgtcacagatctccacgcagggtttcaaatctcccttctcgatggctgcctcc ccaaaacttgccgcatctcccaagcctgccacatctcctaaacccctgccctcgcctaag ccttctgcctcacccaagccctctctgtcagctaagccttcagtatcaactaaacttatt tctaaatccaacccaactcccaagcctactgtatccccaagtagttccagtccaaatgca ctagttgcccagggtagccactccagcactaacagcccagtccataaacagcccagtgga atgaacatcagcagacagtctcccaccttgaatttattgccctctagtcgcacttcaggc cttccacctacaaaaaatcttcaggccccctcaaagctaacaaactcatcatccactgga actgttgggaagaacagcttgagtggaattgcaatgaatgtacctgccagcagaggacag cagcaggccagggacagcagccagcaaggccggaggaggcagtatgagaaagagaaatga >gi568815591f:139260262_139522337|GENSCAN_predicted_peptide_2|529_aa MASTLQPEAEIDRGRRHLTAAGAKLLLEIRNTGKGPAASGKNPPELTFRGRVSKIRPICP ARCAPRRRMPAFAQARGLRRRRGDYCAGKRVRRSVRAPVQRPEHRCSFEGLLRELRYLSA ATGRPYRDTAAYRYLVKAFRAHRVTSEKLCRAQHELHFQAATYLCLLRSIRKHVALHQEF HGKGERSVEESAGLPKRARSAPHPRPSARYAAAMSAQAQMRAMLDQLMGTSRDVLWSTGV SVSAGFHQLVPQWECEEEIQLVNESNSVMTEYARVTFSTVVLMMSFLELHIVDKTEESLP YGAHILRMDLGECLKVHDLALRADYEIASKEQDFFFELDAMDHLQSFIADCDRRTEVAKK RLAETQEEISAEVAAKAERVHELNEEIGKLLAKVEQLGAEGNVEESQKVMDEVEKARAKK REAEEVYRNSMPASSFQQQKLRVCEVCSAYLGLHDNDRRLADHFGGKLHLGFIEIREKLE ELKRVVAEKQEKRNQERLKRREEREREEREKLRRYGVMGQVIEVESLWF >gi568815591f:139260262_139522337|GENSCAN_predicted_CDS_2|1590_bp atggcctcaacgcttcaaccggaagcagaaattgaccgcggcaggcgccatctaacggcc gctggagcaaagctcctcctggaaattcgcaacaccggaaaaggtccggctgcttccggt aaaaacccaccagagctgacgttcagagggcgagtctcgaagatccggccaatttgccca gcgcgctgtgctccgcgacggcgcatgcccgcttttgcgcaggcgcggggactacggcgc aggcgcggagactattgcgcaggcaagcgcgtacgcagaagcgtgcgcgcgcccgttcaa cgtccggagcatcggtgcagtttcgagggacttctgcgggagttgcgctacctgagcgcg gccaccggccgaccctatcgcgacaccgcggcctatcggtaccttgtgaaggctttccgt gcacatcgggtcaccagtgaaaagttgtgcagagcccaacatgagcttcatttccaagct gccacctatctctgcctcctgcgtagcatccggaaacatgtggccctacatcaggaattt catggcaagggtgagcgctcggtggaggagtctgctggcttgcccaaaagggcccggtct gcgccccacccccgcccgtccgcccgctacgccgccgccatgtcggcgcaggcccagatg cgcgcgatgctggaccagttgatgggcacctcccgggacgtgctttggtccacgggggtc agtgtttctgcaggatttcaccaactggttccacagtgggaatgtgaagaggagatacaa ctcgtcaacgaatcaaattcagtgatgacagagtatgcaagagtcaccttctcaactgtt gtcctcatgatgtcctttctggaactgcacatcgtagataaaacagaagaatccttgccc tatggagctcacattttaagaatggatcttggagaatgtctgaaagtccatgacctggct ttaagagcggattatgaaattgcatccaaagaacaagattttttctttgaacttgatgcc atggatcatctgcagtcattcattgcagattgtgatcgtagaacagaagtggccaagaaa agattagcagaaactcaagaagagattagtgctgaagtagcagcaaaggcagaacgtgtt catgagttaaatgaagaaattggtaaattgttagccaaggtggaacaactaggagctgaa gggaatgtggaggaatcccagaaagtaatggatgaagtagagaaagcacgggcaaagaaa agagaagcagaggaagtttatcggaattctatgccagcttccagttttcagcagcagaaa cttcgagtctgtgaagtctgctctgcctatttaggacttcatgataatgacagacgactg gctgatcattttgggggtaaactgcacctgggatttattgaaataagagagaagcttgaa gaattaaagagagtcgtagctgagaagcaggagaaaagaaaccaggaacggctgaaacga agagaagagagagagagagaagaaagggagaagctgaggaggtatggagtaatgggccaa gtaattgaagttgaatctctgtggttctaa >gi568815591f:139260262_139522337|GENSCAN_predicted_peptide_3|482_aa MEESWEAAPGGQAGAELPMEPVGSLVPTLEQPQVPAKVRQPEGPESSPSPAGAVEKAAGA GLEPSSKKKPPSPRPGSPRVPPLSLGYGVCPEPPSPGPALVKLPRNGEAPGAEPAPSAWA PMELQVDVRVKPVGAAGGSSTPSPRPSTRFLKVPVPESPAFSRHADPAHQLLLRAPSQGG TWGRRSPLAAARTESGCDAEGRASPAEGSAGSPGSPTCCRCKELGLEKEDAALLPRAGLD GDEKLPRAVTLTGLPMYVKSLYWALAFMAVLLAVSGVVIVVLASRAGARCQQCPPGWVLS EEHCYYFSAEAQAWEASQAFCSAYHATLPLLSHTQDFLGRYPVSRHSWVGAWRGPQGWHW IDEAPLPPQLLPEDGEDNLDINCGALEEGTLVAANCSTPRPWVANSIQRQNCADFSGGYG CGYGCWSGTCGDGVSPEIWIEFSSCVHVTWSEIDVCDALWTWIYQTRFDFVSTFVITIFL IY >gi568815591f:139260262_139522337|GENSCAN_predicted_CDS_3|1449_bp atggaggagtcttgggaggctgcgcccggaggccaagccggggcagagctcccaatggag cccgtgggaagcctggtccccacgctggagcagccgcaggtgcccgcgaaggtgcgacaa cctgaaggtcccgaaagcagcccaagtccggccggggccgtggagaaggcggcgggcgca ggcctggagccctcgagcaagaaaaagccgccttcgcctcgccccgggtccccgcgcgtg ccgccgctcagcctgggctacggggtctgccccgagccgccgtcaccgggccctgccttg gtcaagctgccccggaatggcgaggcgcccggggctgagcctgcgcccagcgcctgggcg cccatggagctgcaggtagatgtgcgcgtgaagcccgtgggcgcggccggtggcagcagc acgccatcgcccaggccctccacgcgcttcctcaaggtgccggtgcccgagtcccctgcc ttctcccgccacgcggacccggcgcaccagctcctgctgcgcgcaccatcccagggcggc acgtggggccgccgctcgccgctggctgcagcccggacggagagcggctgcgacgcagag ggccgggccagccccgcggaaggaagcgccggctccccgggctcccccacgtgctgccgc tgcaaggagctggggctggagaaggaggatgcggcgctgttgccccgcgcggggttggac ggcgacgagaagctgccccgggccgtaacgcttacggggctacccatgtacgtgaagtcc ctgtactgggccctggcgttcatggctgtgctcctggcagtctctggggttgtcattgtg gtcctggcctcaagagcaggagccagatgccagcagtgccccccaggctgggtgttgtcc gaggagcactgttactacttctctgcagaagcgcaggcctgggaagccagccaggctttc tgctcagcctaccacgctaccctccccctgctaagccacacccaggacttcctgggcaga tacccagtctccaggcactcctgggtgggggcctggcgaggcccccagggctggcactgg atcgacgaggccccactcccgccccagctactccctgaggacggcgaggacaatctggat atcaactgtggggccctggaggaaggcacgctggtggctgcaaactgcagcactccaaga ccctgggtagccaacagcatccagagacagaactgtgccgacttctctggtggctacggc tgcggctacggctgctggagcgggacctgtggcgatggcgtttctcccgagatttggatc gagttctcctcttgcgttcacgtgacatggagcgagatcgatgtctgcgatgctctctgg acctggatctaccagacaagatttgattttgtttctacctttgtaattactatatttctc atttattaa >gi568815591f:139260262_139522337|GENSCAN_predicted_peptide_4|111_aa MPVGGGPESVGRCNGCQCHIKGKGIYILNSERPVPGDYIYIRKKKQQNSDPQPKTHDLHG LNQRETGDETFGEPGGLGEHKAAKGEKRCVWEPELPNDMLAFPLRSRAPLH >gi568815591f:139260262_139522337|GENSCAN_predicted_CDS_4|336_bp atgcctgtagggggtggccctgagagtgtgggcaggtgcaatggctgtcaatgccacata aagggcaaggggatctacatcctaaacagtgaaagaccagtgcccggagactacatctac atcaggaagaagaagcagcaaaattctgacccacagcccaagacccatgacctgcatgga ctcaatcagcgggagactggggatgaaacattcggggaaccaggaggactaggggagcat aaagcagcaaaaggggagaaaagatgtgtttgggagccggagctgcccaatgacatgctg gctttcccacttaggagccgtgcacccctacactag >gi568815591f:139260262_139522337|GENSCAN_predicted_peptide_5|349_aa MDPRKVNELRAFVKTCKQDPSVPHAEKMHFLRERVESMAGKVPPATQKAKSEENTKEEKP DSTDAPQEMGDENAEITEEMMDQANEKVAAIEALNDGELQTAIDLFPDAIKLNPHLAILY AKTAAQPYKCREKAHRLSLQLDYDEDASAMLKEVQPGAQKIAEHRRKYERKREEQEIKER IERVKKAQEEHERAQREEEARRQSGAQYCSFPSGFPAGVPGNCPRRMSGMGGGMAGMARI PGLNEILSDPEILAAMQDPEIMLAFQDVAQDPANMSKYQRNTKTMHLISRLSAKFGEKSL EQLRETFDHEKLSADSNHLMCVDQELTQPRLKSWAIISLTTFPTVGSGI >gi568815591f:139260262_139522337|GENSCAN_predicted_CDS_5|1050_bp atggacccccgcaaagtgaacgagcttcgggcctttgtgaaaacgtgtaagcaggatccg agcgttccgcacgccgagaaaatgcatttcctaagggagagggtggagagcatggcgggt aaagtaccacctgctactcagaaagctaaatcagaagaaaataccaaggaagaaaaacct gatagcactgatgcccctcaagaaatgggagatgagaatgcagagataacagaggagatg atggatcaggcaaatgagaaagtggctgctattgaagccctaaatgatggtgaactgcag acagccattgacttgttcccagatgccatcaagctgaatcctcacttggccattttgtat gccaagacggctgctcagccttacaagtgtcgagagaaagcacacagactttcccttcaa ttggattatgatgaagatgctagtgcaatgctgaaagaagttcaacctggggcacagaaa attgcagaacatcggagaaagtatgagcgaaaacgtgaagaacaagagatcaaagaaaga atagaaagagttaagaaggctcaagaagagcatgagagagcccagagggaggaagaagcc agaagacagtcaggagctcagtactgctcttttccaagtggctttcctgcgggagtgcct ggtaattgtcccagaagaatgtctggaatgggagggggcatggctggaatggccagaatc cccggactcaatgaaattcttagtgatccagagattcttgcagccatgcaggatccagaa attatgttagccttccaggatgtggctcaggacccagcaaacatgtcaaaataccagcgc aacacaaagactatgcatcttatcagtagattgtcagccaaatttggagaaaaatcattg gagcagctgagggaaacatttgaccatgagaaattatctgctgattcaaaccacttgatg tgcgtggaccaagaactcacccagccaaggctcaagtcctgggctataatatcactgacg acatttcccacagttggctctgggatctga >gi568815591f:139260262_139522337|GENSCAN_predicted_peptide_6|39_aa XVCFGGNPKDNDPYGCERVHFLKKEVLMKMDKTHQDPPP >gi568815591f:139260262_139522337|GENSCAN_predicted_CDS_6|120_bp nnggtatgttttggggggaacccaaaagacaatgacccttacggctgtgaaagggtccac ttcctcaagaaggaggtgctcatgaaaatggataaaacccatcaagaccccccgccctag