GENSCAN 1.0 Date run: 4-Nov-116 Time: 13:06:56 Sequence gi568815576f:45068178_45284652 : 216475 bp : 48.89% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1151 1234 84 1 0 68 115 62 0.804 7.72 1.02 Term + 12071 12343 273 1 0 58 37 1025 0.999 89.87 1.03 PlyA + 14579 14584 6 1.05 2.05 PlyA - 14982 14977 6 1.05 2.04 Term - 15768 15677 92 1 2 69 51 85 0.161 0.78 2.03 Intr - 40277 40219 59 1 2 95 85 83 0.748 7.23 2.02 Intr - 41200 41067 134 1 2 82 90 49 0.698 3.94 2.01 Init - 58139 58041 99 2 0 83 42 59 0.094 1.06 2.00 Prom - 69849 69810 40 -5.56 3.03 PlyA - 72014 72009 6 1.05 3.02 Term - 73292 73273 20 0 2 125 36 33 0.934 0.28 3.01 Init - 74382 74244 139 0 1 24 94 123 0.871 4.80 3.00 Prom - 77952 77913 40 -4.36 4.00 Prom + 86019 86058 40 -2.96 4.01 Init + 91969 91988 20 0 2 98 103 6 0.506 1.74 4.02 Intr + 95435 95626 192 2 0 28 63 173 0.386 7.41 4.03 Intr + 95918 96119 202 2 1 47 78 173 0.349 11.49 4.04 Intr + 103423 103506 84 0 0 98 81 49 0.973 5.22 4.05 Intr + 107717 107903 187 1 1 79 111 118 0.741 12.46 4.06 Intr + 110061 110723 663 1 0 60 115 271 0.987 18.74 4.07 Intr + 113109 113190 82 1 1 36 69 104 0.998 2.00 4.08 Intr + 115225 115343 119 1 2 112 80 67 0.991 8.41 4.09 Term + 116276 116478 203 0 2 60 47 241 0.856 14.75 4.10 PlyA + 116622 116627 6 1.05 5.00 Prom + 117222 117261 40 -6.06 5.01 Init + 117568 117613 46 0 1 93 93 28 0.758 4.65 5.02 Term + 123117 123403 287 1 2 19 38 200 0.773 3.67 5.03 PlyA + 124199 124204 6 1.05 6.22 PlyA - 126458 126453 6 1.05 6.21 Term - 128470 128254 217 0 1 75 55 188 0.931 10.82 6.20 Intr - 129460 129361 100 2 1 61 89 -23 0.901 -5.73 6.19 Intr - 129771 129613 159 1 0 88 95 131 0.950 13.76 6.18 Intr - 131858 131696 163 2 1 99 45 277 0.978 24.05 6.17 Intr - 135007 134813 195 1 0 99 90 305 0.996 31.41 6.16 Intr - 135808 135668 141 1 0 87 100 307 0.998 32.35 6.15 Intr - 137141 137040 102 2 0 89 83 188 0.999 18.77 6.14 Intr - 137530 137453 78 1 0 119 99 62 0.997 10.15 6.13 Intr - 137735 137616 120 2 0 103 86 215 0.999 23.49 6.12 Intr - 143930 143779 152 0 2 121 113 269 0.915 32.58 6.11 Intr - 144426 144335 92 2 2 88 77 30 0.457 1.54 6.10 Intr - 149762 149477 286 0 1 73 55 81 0.014 -0.30 6.09 Intr - 155713 155390 324 2 0 60 21 211 0.005 7.35 6.08 Intr - 158206 158134 73 1 1 117 49 67 0.038 4.68 6.07 Intr - 164850 164774 77 1 2 69 35 80 0.008 0.03 6.06 Intr - 172648 172463 186 2 0 -20 109 190 0.872 9.96 6.05 Intr - 174534 174499 36 1 0 97 66 33 0.421 0.23 6.04 Intr - 183144 183060 85 0 1 54 105 35 0.036 1.19 6.03 Intr - 183368 183235 134 0 2 48 64 73 0.027 1.26 6.02 Intr - 186830 186611 220 2 1 60 57 131 0.216 5.07 6.01 Init - 188219 188169 51 2 0 47 110 62 0.945 5.46 6.00 Prom - 191237 191198 40 -5.56 7.00 Prom + 192672 192711 40 -9.36 7.01 Init + 194778 194872 95 2 2 60 35 121 0.432 1.95 7.02 Intr + 194896 195116 221 1 2 34 89 102 0.361 2.85 7.03 Intr + 200242 200262 21 2 0 140 81 2 0.790 2.22 7.04 Intr + 200513 200689 177 0 0 98 53 98 0.480 7.19 7.05 Intr + 207393 207659 267 1 0 80 47 149 0.253 7.50 7.06 Term + 211890 211894 5 1 2 127 52 0 0.373 -2.63 7.07 PlyA + 213235 213240 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 100001 100069 69 1 0 99 83 86 0.955 10.25 S.002 Intr + 157831 157937 107 2 2 72 82 71 0.838 4.83 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576f:45068178_45284652|GENSCAN_predicted_peptide_1|118_aa MIQEYAGIPAKELAKVWDLCSAIQALCMTQTITIIIITIIIIIIITILITITIIITIITI IIIIIITIIIITIIIITIITIIITVITIITIIIIIIIIIIIITIIIITIIIIIITIMK >gi568815576f:45068178_45284652|GENSCAN_predicted_CDS_1|357_bp atgattcaggaatatgctggaatacctgcaaaggagctggcaaaggtatgggacctgtgc tcagctatccaggcactttgcatgacccaaaccatcaccatcatcatcatcaccatcatc atcatcatcatcatcaccatcctcatcaccatcaccatcatcatcaccatcatcaccatc atcatcatcattatcatcaccatcatcatcatcaccatcatcatcatcaccatcatcacc atcatcatcaccgtcatcaccatcatcaccatcatcatcatcatcatcatcatcatcatc atcatcaccatcatcatcatcaccatcatcatcatcatcataacaataatgaaataa >gi568815576f:45068178_45284652|GENSCAN_predicted_peptide_2|127_aa MAEGKKEQVTSHVDGSRQRVCAGKLCLRKPSDLGPAELLALGSAVGLSPIFLHGNVHVVL TLMGYHQVSQEAPTPTHGRTTVINHNKLMSDDDKHHKVMENKERGITVKVLKSIALGIAV DFRASTV >gi568815576f:45068178_45284652|GENSCAN_predicted_CDS_2|384_bp atggcagaaggcaagaaggagcaagtcacatctcacgtggatggcagcaggcaaagagtt tgtgcagggaaactctgccttaggaagccgtcggatcttggacctgcggagcttctggct cttggatcagctgtaggactttcaccaatatttctgcatgggaatgtccatgtggtcttg acactaatgggataccatcaagtatcccaggaagcacccacacccacccacgggagaacc acagtcatcaatcataacaagctcatgtcagacgatgataagcatcacaaagtcatggag aataaagagcgaggcatcactgtcaaagttttgaagagcattgctttgggtattgctgtt gactttcgtgcttctaccgtgtga >gi568815576f:45068178_45284652|GENSCAN_predicted_peptide_3|52_aa MRARARARVCVCVCVCVCENLACIFVFVVLPQTNMGSVFHVIPTQAEVTAVK >gi568815576f:45068178_45284652|GENSCAN_predicted_CDS_3|159_bp atgcgcgcgcgcgcgcgcgcgcgtgtgtgtgtgtgtgtgtgtgtgtgtgtgtgtgaaaat ctggcttgtatttttgtatttgtcgtgctgccccaaaccaatatgggcagcgtcttccat gtgatacccacccaggcagaggtaactgctgtcaagtaa >gi568815576f:45068178_45284652|GENSCAN_predicted_peptide_4|583_aa MDSFGISPAAPTRGHRGPRPCPLTDLPRRNSSRIAGPSAATSARTSWKTRCRPHVSRRPK ARLLAPATRSSVRSPPRDMVFESLGLPSTKSSEFRSAAPEAAERAQPGDPCGLQTPAPLR PGFRRNQDPASAAAAAEEVGTFSMASEEVLKNRAIKKAKRRNVGFESDTGGAFKGFKGLV VPSGGGRFSGFGSGAGGKPLEGLSNGNNITSAPPFASAKAAADPKVAFGSLAANGPTTLV DKVSNPKTNGDSQQPSSSGLASSKACVGNAYHKQLAALNCSVRDWIVKHVNTNPLCDLTP IFKDYEKYLANIEQQHGNSGRNSESESNKVAAETQSPSLFGSTKLQQESTFLFHGNKTED TPDKKMEVASEKKTDPSSLGATSASFNFGKKVDSSVLGSLSSVPLTGFSFSPGNSSLFGK DTTQSKPVSSPFPTKPLEGQAEGDSGECKGGDEEENDEPPKVVVTEVKEEDAFYSKKCKL FYKKDNEFKEKGIGTLHLKPTANQKTQLLVRADTNLGNILLNVLIPPNMPCTRTGKNNVL IVCVPNPPIDEKNATMPVTMLIRVKTSEDADELHKILLEKKDA >gi568815576f:45068178_45284652|GENSCAN_predicted_CDS_4|1752_bp atggactcatttggaatcagccctgcagctcccacacgcggacaccgcgggcctcggcca tgtccactgaccgaccttcctcgccggaactcctcgcgcatcgccggcccctcagcggcg actagcgcgcgaacatcctggaagacgcgttgccggccccacgtgagccgacgccccaaa gcgcggctgctggcgccggctacgagatcaagcgtgaggagcccccccagggatatggtg tttgagtctctgggcttgccgagcactaagtcctctgagttccgcagcgcagcaccggaa gcggccgagcgcgctcagcccggcgacccctgcgggctccagacccctgcgccgctgcgc cccgggtttcgccgcaaccaagacccagcgagtgcagcggcggccgccgaggaggtggga acattctccatggccagtgaggaagtcttgaagaatagagccataaagaaagcaaagcgc agaaatgttggatttgaatctgacactggaggagcctttaaaggttttaaaggtttggtg gtaccttctggaggaggacgcttttctggatttggtagtggcgctggagggaagcctttg gaaggactgtcgaatggaaacaacataaccagtgcccctcccttcgccagtgcaaaggca gcggcagatcccaaggtagcctttggttctcttgctgcaaatggccctaccaccttggtt gataaagtttcaaatcccaaaactaatggggacagtcagcagccctcctcctctggcctt gcttccagtaaagcttgtgtcggaaatgcctatcacaagcagttggccgccttgaactgc tccgtgcgggattggatagtgaagcacgtgaatacaaaccccctctgtgatctgacacct atctttaaagactatgagaaatatttagcaaacattgaacagcaacacgggaacagtggc aggaattctgaaagtgaatctaacaaagtggcagctgaaacacagtctccttcccttttt ggctcaacaaaattacagcaagagtcaacgtttttgtttcatggcaacaaaactgaagat acacctgacaagaagatggaggtggcatctgaaaagaaaacggacccatcatcactagga gcgacaagtgcctcatttaatttcggcaagaaagttgatagctctgttttgggctcatta agctctgtccccctgactggattttctttctcccctggaaactccagtttatttggcaaa gatactacccagagtaaaccagtctcttcaccatttcccactaaaccattggagggccaa gcagaaggtgacagtggtgaatgcaaaggtggagatgaagaagagaatgatgagccaccc aaagtagtagttaccgaagtaaaagaagaagatgctttttactccaaaaagtgtaaactg ttttacaagaaagacaatgagtttaaagagaaaggcataggtactctgcatttaaaacct acagcaaatcagaagacacagcttttggtgcgggcagacaccaatttaggcaacatattg ctgaacgttctgattccacccaatatgccatgtacgcgaacagggaagaataacgttctt atcgtctgtgttccaaatccaccaattgacgagaagaatgccaccatgccagtcaccatg ttgattcgggtaaaaaccagcgaggatgcagacgagttgcacaaaattttactggagaaa aaggatgcctga >gi568815576f:45068178_45284652|GENSCAN_predicted_peptide_5|110_aa MVIHTNIPVTSPEVSGKAHSVKGDDEAAPTYEGPAVADSEGSAGKCESLLEVPSPAELQD RRLLQTPHLPLDQPAYLEPAKPELGAPGLYSIFSISVRVKRFSEKAWEIL >gi568815576f:45068178_45284652|GENSCAN_predicted_CDS_5|333_bp atggttattcacacgaatatccctgtcacttctccagaggtgtcaggaaaagctcacagt gtgaaaggcgacgatgaggctgctcccacctacgagggccctgcagtagcagactcggaa ggatcagccggaaaatgcgagagcctgctcgaggtcccctcacctgctgagctgcaagac agacgcctgctgcaaacacctcatcttccgttagatcagccagcttacctggagcctgca aaacctgaacttggggctcctggcctctattccattttttccatttctgtgagggttaag aggttctctgaaaaagcctgggaaattttatag >gi568815576f:45068178_45284652|GENSCAN_predicted_peptide_6|996_aa MSSANNYMIWEKGSKSQTSTQDEGPKLYATMRHGLVHFVNRTAWPEGKKMSASSGSRISR AVMGGCRQDILGFSRKTVPADSQPTLGEGPRLKVYTTAWGFQMCPNFPIQPLSVAPMMHH CSIAHVGKGGTGWEQRLPRAQPPGFVALSKPLGISTPCKMLSKDRDAGPLEKRNTQADCG FLGARGGLRAGPPALPAAAAPEPAGGAEAEERSLQHMLRAIAEERGRLSLRREVCGLATC PWIHLSPNSMNLNVIDIDTWRGWDLMQRLGQSRPTELDTKPNPTSLPVLTLGAYLVLILG AHLVLILGAHLVLILGAVSVLTLGAYLVLILGAVSVLTLGAHLVLFLGASLVLTLGAGLM LTLGAGLVLTLGAYLVLILVAGSVLTLGAYLVLILVPGVTHLERALPVLEFEESDLQKQV TRLRQGRQLDHQNSWQISVFLVQAIVIIHGFTYSLKFICNPKISTHGTFGLMTDIHAVSK NSLPDTGFQLSLGWLEGGCPFKSGSRWDCRSQGEWVGVRDVGCFKDDRIVFWTWMFSTYF MEKWAPRQDDMLFYVRRKLAYSGSESGADGRKAAEPEVEVEVYRRDSKKLPGLGDPDIDW EESVCLNLILQKLDYMVTCAVCTRADGGDIHIHKKKSQQVFASPSKHPMDSKGEESKISY PNIFFMIDSFEEVFSDMTVGEGEMVCVELVASDKTNTFQGVIFQGSIRYEALKKVYDNRV SVAARMAQKMSFGFYKYSNMEFVRMKGPQGKGHAEMAVSRVSTGDTSPCGTEEDSSPASP MHERVTSFSTPPTPERNNRPAFFSPSLKRKVPRNRIAEMKKSHSANDSEEFFREDDGGAD LHNATNLRSRSLSGTGRSLVGSWLKLNRADGNFLLYAHLTYVTLPLHRILTDFGWSRSFP RAEVLPLFTFLAHLLSPLEWPQMPRRWEHEKFDGFALGWEYLTHAQFQKGLQLSRRPRSR QNGLLLPAKEHRQHAHLQPGTSHAGPRTEEPAEFGF >gi568815576f:45068178_45284652|GENSCAN_predicted_CDS_6|2991_bp atgagttctgccaacaactacatgatctgggaaaagggctccaagtctcagacttctact caggatgaagggcccaagctctatgccacgatgaggcatggattggtgcacttcgtgaac agaacagcatggccagaagggaagaaaatgtcagcaagcagcggcagcagaatcagcagg gctgttatgggggggtgcagacaggacatattgggcttcagcaggaagacagtgcctgct gacagccagcccaccctcggagagggcccacgattgaaggtctatacaacagcctggggc ttccagatgtgcccaaactttcccatccagcccttatccgtggctcccatgatgcaccac tgctccatcgcccacgtaggaaagggcggcactgggtgggaacagcgcttaccacgggca caaccccctggcttcgtggccttgagtaagccactcggcatctccactccctgcaaaatg ttgtcaaaagacagggatgcaggcccactggagaagagaaatacacaggcggactgtggc ttccttggtgctcgcggcgggctgcgggccgggccgcccgccctccccgccgccgccgcc cccgagcctgcgggcggcgccgaggccgaggagcgctcgctgcagcacatgctgcgtgcc atagcggaggagcgcggccgtcttagcctgcgccgcgaggtctgcggcctcgccacctgt ccttggatccacctgagccccaacagcatgaaccttaatgtcatcgacattgacacctgg cgggggtgggacctcatgcagcgcctggggcagagcaggcccacagaactggatacgaag cccaaccccacgagcttgccagtgctgaccctgggtgcttatctggtgctgatcctgggt gctcatctggtgctgatcctgggtgctcatctggtgctgatcctgggtgctgtctcagtg ctgaccctgggtgcttatctggtgctgatcctgggtgctgtctcagtgctgaccctgggt gctcatctggtgctgttcctgggtgctagcttagtgctgaccctgggtgctggcttaatg ctgaccctgggtgctggcttagtgctgaccttgggtgcttatctggtgctgatcctagtt gctggctcagtgctgaccctgggtgcttatctggtgctgatcctagtccctggagtcact cacctggaaagggctcttccagttctggagttcgaagagagtgacttacagaagcaggtc actcgcttgaggcaggggagacagctggatcaccagaactcctggcaaatatcagttttc ctggtccaggccattgtcattattcacggattcacctactcactgaaatttatttgtaac cctaaaatcagtactcatggcactttcgggttaatgactgacattcatgcagtgtcgaaa aattccttacctgacacaggtttccagctgagccttggctggctggagggcggctgtccc tttaagtcgggttcccgctgggactgccgcagccagggggagtgggtgggggtgagagat gttgggtgcttcaaggatgaccgcatcgtcttctggacttggatgttctccacctacttc atggagaaatgggctccccggcaggacgacatgcttttctatgtgcgccggaagctggcg tactccggcagcgaaagcggtgcagacgggaggaaggcagctgagcctgaggtggaggtg gaggtgtaccggcgggactccaagaagctgccaggcctgggagaccctgacatcgactgg gaggagagcgtctgcctgaatctcatcctgcagaagctggactacatggtgacctgtgcg gtgtgcacacgtgctgacggcggggacattcacatccataagaagaaatctcagcaagtg ttcgcgtcccccagtaaacaccccatggacagcaagggggaggagtccaagatcagctac cccaacatcttcttcatgattgacagcttcgaggaggtgttcagcgacatgaccgtaggg gaaggagagatggtctgtgtggagctggtggctagtgacaaaaccaacacgttccagggg gtcatctttcagggctccatccgctacgaggcgctcaagaaggtgtatgacaaccgggtg agcgtggccgcccgcatggcacagaagatgtcgtttggcttctacaagtacagcaacatg gagtttgtgcgcatgaagggcccccagggcaagggccacgccgagatggcggtcagccga gtgtctacaggtgacacatccccctgtgggactgaagaggactccagcccagcttcgccc atgcacgagcgggtgacctccttcagcacaccccccaccccagaacggaacaaccggcct gccttcttctccccatccctcaagaggaaggtgccccggaaccggatcgctgagatgaag aagtcgcactcggccaacgacagcgaggagttcttccgggaggacgacggtggagccgat ctgcacaatgcaaccaacctgcggtctcggtccctgtcgggcacaggacggtccctggtc gggtcctggctgaagctgaacagagcagatggaaacttccttctctatgcacacttaacc tacgtcacgttgccgctgcatcggattttaacagactttgggtggagccgtagctttccc agagctgaagtcctccctttgttcacctttctggcgcacctgctgtcccctcttgagtgg ccacagatgcccaggagatgggaacacgagaagtttgatggctttgccctgggctgggaa tacctcacccacgcccagttccagaaaggcctccagctgagcagacggccccgatcccgc cagaacggccttttgcttccagccaaagaacaccgccaacacgcacacctccaacctggg acatcccacgctgggcctcgcacggaggaacctgcagaatttggattctga >gi568815576f:45068178_45284652|GENSCAN_predicted_peptide_7|261_aa MWDVGQDSSLLRGAGGPGLQLWFGRLQLHLGGLREHRDAWVHSRGLGSCSGMQGAATPTQ NGRGSCLSPAPASSMEHAAPAVPPCCSWCDSSSRSRWPATATTETGPRTVPSAQESAVGL LTASLLKGDSGSTGFCPLLMSLAAPGTQQCSLGFCCTFERKRSSVSPAAPGTAAPELAAA LMLMPRKNQIAVYELLFKQGVMVAKEGVHLPRHPELADKNVPNLHIMKAMQSLKSQGHRE EQFAWRHFYWHLTREGIQSIQ >gi568815576f:45068178_45284652|GENSCAN_predicted_CDS_7|786_bp atgtgggatgtggggcaggactccagcctgctccgtggagcgggaggccccggtctgcag ctgtggtttgggcggctgcagctgcacctgggagggctccgagagcacagggatgcctgg gtccacagccgtggcttgggcagctgcagcggcatgcagggagctgccaccccgactcag aatgggagaggctcctgcctgtccccagctcctgccagttccatggaacatgcagcccca gccgtgcctccctgctgcagctggtgtgacagcagcagccgctccagatggcctgccact gccaccactgagacaggtccccgcacagtcccttctgcacaggagagcgccgtgggtttg ctgaccgcgtccctgctgaaaggcgactccgggagcacaggcttctgccctctcctgatg tccctagcagcgccgggcacgcagcagtgctcgctgggtttttgttgcacctttgagagg aaaaggagctcggtgtcccctgcagcccctggaactgcggccccggaacttgcagctgca ttgatgttgatgcctaggaagaaccagattgccgtttatgaactcctttttaagcaggga gtcatggtggctaaggagggtgtccacctgcctaggcacccggagctggcagacaagaat gtgcccaaccttcacatcatgaaggccatgcagtccctcaagtcccaaggccacagggag gaacagtttgcctggagacatttctactggcaccttacccgtgagggtatccagagtatc caatga