GENSCAN 1.0 Date run: 8-Nov-116 Time: 05:28:14 Sequence gi568815583r:75368772_75627264 : 258493 bp : 45.28% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.21 PlyA - 538 533 6 1.05 1.20 Term - 3438 3208 231 0 0 125 36 218 0.996 16.77 1.19 Intr - 7101 6894 208 2 1 50 94 251 0.999 20.98 1.18 Intr - 11952 11858 95 0 2 77 109 90 0.999 8.76 1.17 Intr - 12934 12842 93 1 0 62 64 102 0.534 5.36 1.16 Intr - 15666 15493 174 0 0 108 100 129 0.837 16.24 1.15 Intr - 21050 20881 170 0 2 93 94 231 0.987 23.87 1.14 Intr - 24044 23471 574 2 1 50 75 451 0.978 32.31 1.13 Intr - 26092 25909 184 0 1 96 83 184 0.988 18.49 1.12 Intr - 27725 27487 239 2 2 116 83 177 0.999 16.41 1.11 Intr - 31385 31269 117 2 0 84 72 55 0.776 4.16 1.10 Intr - 32169 31959 211 2 1 96 94 115 0.996 11.82 1.09 Intr - 33199 33081 119 1 2 67 98 3 0.940 -1.54 1.08 Intr - 38373 38284 90 0 0 101 113 24 0.982 6.39 1.07 Intr - 41220 41065 156 0 0 47 100 173 0.999 14.61 1.06 Intr - 41515 41363 153 1 0 68 68 107 0.990 6.97 1.05 Intr - 42837 42721 117 0 0 46 110 124 0.804 11.06 1.04 Intr - 44274 43992 283 2 1 78 105 75 0.788 5.62 1.03 Intr - 45540 45434 107 0 2 98 83 58 0.870 5.31 1.02 Intr - 54052 53876 177 1 0 92 31 105 0.473 5.32 1.01 Init - 61604 61416 189 2 0 60 115 159 0.993 14.81 1.00 Prom - 74649 74610 40 -2.96 2.03 PlyA - 75171 75166 6 1.05 2.02 Term - 82810 82675 136 0 1 98 49 81 0.365 2.69 2.01 Init - 86324 86125 200 2 2 46 38 231 0.724 10.28 2.00 Prom - 98040 98001 40 -6.06 3.14 PlyA - 98750 98745 6 1.05 3.13 Term - 100212 99998 215 1 2 121 35 285 0.786 23.89 3.12 Intr - 101228 101021 208 2 1 80 97 99 0.994 8.65 3.11 Intr - 102059 101909 151 1 1 63 84 149 0.965 12.16 3.10 Intr - 104996 104918 79 0 1 124 117 52 0.998 10.31 3.09 Intr - 111143 111077 67 2 1 53 121 115 0.016 9.78 3.08 Intr - 121530 121437 94 2 1 114 74 69 0.437 8.07 3.07 Intr - 137232 136904 329 0 2 95 103 255 0.808 22.20 3.06 Intr - 140256 140146 111 0 0 121 86 119 0.999 15.58 3.05 Intr - 148593 148488 106 2 1 41 116 98 0.607 8.12 3.04 Intr - 154474 154350 125 1 2 69 87 55 0.975 2.88 3.03 Intr - 155527 155438 90 1 0 99 64 90 0.995 7.89 3.02 Intr - 158490 158347 144 0 0 119 36 148 0.889 13.18 3.01 Init - 210005 209943 63 2 0 82 115 109 0.890 14.16 3.00 Prom - 218009 217970 40 -1.96 4.08 PlyA - 221756 221751 6 1.05 4.07 Term - 229910 229587 324 2 0 110 43 300 0.837 22.46 4.06 Intr - 236456 236379 78 2 0 81 67 55 0.823 2.45 4.05 Intr - 238542 238445 98 1 2 109 91 22 0.778 4.33 4.04 Intr - 240880 240787 94 1 1 78 77 79 0.995 5.34 4.03 Intr - 241223 241119 105 2 0 99 96 76 0.992 9.91 4.02 Intr - 248781 248637 145 2 1 100 108 187 0.998 22.18 4.01 Intr - 252285 252123 163 1 1 114 69 77 0.793 7.43 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583r:75368772_75627264|GENSCAN_predicted_peptide_1|1228_aa MKRRLDDQESPVYAAQQRRIPGSTEAFPHQHRVLAPAPPVYEAVSETMQSATGIQYSVTP SYQVSAMPQSSGSHGPAIAAVHSSHHHPTAVQPHGGQVVQSHAHPAPPVAPVQGQQQFQR LKVEDALSYLDQVKLQFGSQPQVYNDFLDIMKEFKSQSIDTPGVISRVSQLFKGHPDLIM GFNTFLPPGYKIEVQTNDMVNVTTPGQVHQIPTHGIQPQPQPPPQHPSQPSAQSAPAPAQ PAPQPPPAKVSKNNQPVEFNHAINYVNKIKNRFQGQPDIYKAFLEILHTYQKEQRNAKEA GGNYTPALTEQEVYAQVARLFKNQEDLLSEFGQFLPDANSSVLLSKTTAEKVDSVRNDHG GTVKKPQLNNKPQRPSQNGCQIRRHPTGTTPPVKKKPKLLNLKDSSMADASKHGGGTESL FFDKVRKALRSAEAYENFLRCLVIFNQEVISRAELVQLVSPFLGKFPELFNWFKNFLGYK ESVHLETYPKERATEGIAMEIDYASCKRLGSSYRALPKSYQQPKCTGRTPLCKEVLNDTW VSFPSWSEDSTFVSSKKTQYEEHIYRCEDERFELDVVLETNLATIRVLEAIQKKLSRLSA EEQAKFRLDNTLGGTSEVIHRKALQRIYADKAADIIDGLRKNPSIAVPIVLKRLKMKEEE WREAQRGFNKVWREQNEKYYLKSLDHQGINFKQNDTKVLRSKSLLNEIESIYDERQEQAT EENAGVPVGPHLSLAYEDKQILEDAAALIIHHVKRQTGIQKEDKYKIKQIMHHFIPDLLF AQRGDLSDVEEEEEEEMDVDEATGAVKKHNGVGGSPPKSKLLFSNTAAQKLRGMDEVYNL FYVNNNWYIFMRLHQILCLRLLRICSQAERQIEEENREREWEREVLGIKRDKSDSPAIQL RLKEPMDVDVEDYYPAFLDMVRSLLDGNIDSSQYEDSLREMFTIHAYIAFTMDKLIQSIV RQLQHIVSDEICVQVTDLYLAENNNGATGGQLNTQNSRSLLESTYQRKAEQLMSDENCFK LMFIQSQGQVQLTIELLDTEEENSDDPVEAERWSDYVERYMNSDTTSPELREHLAQKPVF LPRNLRRIRKCQRGREQQEKEGKEGNSKKTMENVDSLDKLECRFKLNSYKMVYVIKSEDY MYRRTALLRAHQSHERVSKRLHQRFQAWVDKWTKEHVPREMAAETSKWLMGEGLEGLVPC TTTCDTETLHFVSINKYRVKYGTVFKAP >gi568815583r:75368772_75627264|GENSCAN_predicted_CDS_1|3687_bp atgaagcggcgtttggatgaccaggagtcaccggtgtatgcagcccagcagcgtcggatc cctggcagcacagaggcttttcctcaccagcaccgggtgcttgcccctgcccctcctgtg tatgaagcagtgtctgagaccatgcagtcagctacgggaattcagtactctgtaacaccc agctaccaggtttcagccatgccacagagctccggcagtcatgggcccgctatagcagca gttcatagcagccatcatcacccaacagcggtgcagccccacggaggccaggtggtccag agtcatgctcatccagccccaccagttgcaccagtgcagggacagcagcaatttcagagg ctgaaggtggaggatgcgctatcttatcttgaccaggtgaagctgcagtttggtagtcag cctcaggtctacaatgatttccttgacatcatgaaggaatttaaatctcagagcatcgac accccaggagtgattagtcgtgtgtcccagctattcaaaggccaccccgatctgataatg ggattcaacaccttcttgccccctggctacaaaattgaggtgcaaaccaatgacatggtg aatgtgacaactcctggccaggttcatcagattcccacccatggcatccagccacagcct caaccaccaccccaacatccttcccagccttcagcccagtcagccccagctcctgcccag ccagctcctcagcccccacctgccaaagtcagcaagaacaatcaacctgtggagtttaat catgccatcaactatgttaataagatcaagaacagatttcagggccaaccagacatctac aaagcattcctggagattttgcacacatatcagaaagagcagagaaatgccaaggaagct ggaggaaactacactccagcattgacagagcaggaggtgtatgcccaggttgctcgtctc tttaaaaaccaggaagatttgttgtcagagtttggacaattcctaccagatgccaacagc tccgtgcttttaagcaaaacaactgctgagaaggttgattctgtgagaaatgatcatgga ggcactgtcaagaagccccaactgaacaacaagccgcagaggcccagccagaatggctgc cagatccgcagacatcctacaggaaccacccctccagttaagaagaaacccaaactgctc aatctgaaggattcttctatggcagatgccagcaaacatggtggtggaacagaatcgtta ttttttgataaggtccgaaaggctcttcggagtgcagaagcctacgaaaatttcctacgc tgtcttgttatttttaaccaggaggtgatctctcgtgctgagcttgtgcaactagtctct cctttcctggggaaattccctgagttgtttaattggtttaaaaactttctgggctataag gagtctgtacatctggaaacttatccaaaggagcgagccacagagggcattgctatggag atagattatgcttcttgtaaacgattgggctccagctatcgagccttaccaaagagttac cagcagcccaagtgtacaggacggactcctctctgtaaagaggttttaaatgatacctgg gtttccttcccttcgtggtctgaggactctacctttgtgagttccaagaagactcaatat gaagaacatatttatcgttgtgaagatgaacgctttgagcttgatgtagttttagagacc aatctggcaacaatccgggttctggaagcaatacagaagaagctttcccgcttgtctgct gaagaacaagccaaatttcgcttggacaacacccttgggggcacatcagaagtcatccat agaaaagcactccagaggatatatgctgataaagcagctgacatcattgatggtctgaga aagaatccctccattgctgttccaattgtccttaaaaggttgaagatgaaagaggaagaa tggcgagaagctcagagaggctttaacaaagtatggcgagaacaaaatgagaaatactac ttgaagtctctggaccaccaggggatcaactttaaacagaatgacaccaaggtcctgagg tctaagagcttactcaatgagattgagagtatctatgatgagaggcaagagcaggctacg gaggagaatgctggtgtacctgttggcccacacctctcacttgcgtatgaagacaaacaa atactggaagatgctgctgctctgattatccaccatgtgaagaggcagacaggcattcag aaggaggacaaatataagataaaacaaatcatgcatcattttattccagatttgctcttt gcccaaagaggtgatctctcagatgtggaggaagaggaagaagaagagatggatgtagat gaagccacaggggcagttaagaagcacaatggtgttgggggcagtccccctaagtccaag ttactgtttagtaacacagcagctcaaaaattaagaggaatggatgaagtatacaacctc ttctatgtcaacaacaactggtatatttttatgcgactgcaccagattctctgcctgagg ctgctacggatttgttcccaagccgaacggcaaattgaagaagaaaaccgagagagagaa tgggaacgggaagtgctgggcataaagcgagacaagagtgacagccctgccattcagcta cgtctcaaagaacctatggatgttgatgtagaagattattacccagctttcctggacatg gtgcggagcctgctggatggcaacatagactcatcacagtatgaagattcactgagagag atgttcaccattcatgcctacattgcctttaccatggacaaactgatccagagcattgtc agacagctgcagcatatcgtgagtgatgagatctgtgtgcaggtgactgacctttacctg gcagaaaataataatggggccaccggaggccagctgaacacacagaactcaaggagcctc ctggagtcaacgtatcagcggaaagctgagcagctaatgtcagatgagaattgctttaag cttatgtttattcagagccaaggccaggtccagctgactattgagcttctggacacagaa gaggagaattcggatgaccctgtggaagcagagcgctggtcagactacgtggagcgatac atgaattcagatactacctcgcctgagcttcgtgaacatctagcacagaaaccagtattt ctccccaggaatctacggcggatccggaagtgtcaacgtggtcgagagcagcaggaaaag gaagggaaggaaggaaacagcaagaagaccatggagaatgtggatagtctggataagctg gagtgtagattcaagctgaattcctacaagatggtgtatgtgatcaaatcagaggactat atgtatcggaggaccgccctgctccgggctcatcagtcccatgagcgtgtaagcaagcgt ctacatcagagattccaggcctgggtagataaatggaccaaggagcatgtgccccgtgaa atggcagcagagaccagcaagtggctcatgggtgaggggctggagggcctggtgccctgt accaccacctgtgatacagagaccctgcattttgtgagcattaacaagtatcgtgtcaaa tacggcacagtattcaaagccccttaa >gi568815583r:75368772_75627264|GENSCAN_predicted_peptide_2|111_aa MDSGGGGNTGLAGGAAGGGRGHRKEPLRLAASPASCRVSKLPGRGGRVSIRCRNPSGRDP ARSSHRCLGTGACDRFVSGEVAGSPLRRACAPRRSRGLPAELAAHAERDTQ >gi568815583r:75368772_75627264|GENSCAN_predicted_CDS_2|336_bp atggactctgggggcgggggcaacaccggacttgcgggaggggcggccgggggcgggaga ggccaccggaaagagcccctgcgcctcgccgctagccctgcgagctgccgggtctccaaa ctccctggccgtggcggtcgagtcagcattcggtgccggaacccgtcaggccgcgaccca gcaaggagctcgcatcggtgtctgggcaccggagcctgtgaccgcttcgttagtggagag gtagctggcagtccgcttcggcgggcctgtgccccgcgccgttctcggggcctccctgct gagctcgcggctcacgctgagagggacacgcagtga >gi568815583r:75368772_75627264|GENSCAN_predicted_peptide_3|593_aa MEPATAPRPDMAPELTPEEEQATKQFLEEINKWTVQYNVSPLSWNVAVKFLMARKFDVLR AIELFHSYRETRRKEGIVKLKPHEEPLRSEILSGKFTILNVRDPTGASIALFTARLHHPH KSVQHVVLQALFYLLDRAVDSFETQRNGLVFIYDMCGSNYANFELDLGKKVLNLLKGAFP ARLKKVLIVGAPIWFRVPYSIISLLLKDKVRERIQILKTSEVTQHLPRECLPENLGGYVK IDLATWNFQFLPQVNGHPDPFDEIILFSLPPALDWDSVHVPGPHAMTIQELVDYVNARQK QGIYEEYEDIRRENPVGTFHCSMSPGNLEKNRYGDVPCLDQTRVKLTKRSGHTQTDYINA SFMDGYKQKNAYIGTQGPLENTYRDFWLMVWEQKVLVIVMTTRFEEGGRRKCGQYWPLEK DSRIRFGFLTVTNLGVENMNHYKKTTLEIHNTEERQKRQVTHFQFLSWPDYGVPSSAASL IDFLRVVRNQQSLAVSNMGARSKGQCPEPPIVVHCSAGIGRTGTFCSLDICLAQLEELGT LNVFQTVSRMRTQRAFSIQTPEQYYFCYKAILEFAEKEGMVSSGQNLLAVESQ >gi568815583r:75368772_75627264|GENSCAN_predicted_CDS_3|1782_bp atggagcccgcgaccgcgccccggcccgacatggcgccggagctgaccccggaggaggag caggctaccaagcagtttctcgaagagattaacaagtggacagttcagtacaatgtttcc ccgctgtcttggaatgtggctgtcaagttcctcatggcaaggaagtttgatgtgctccgt gccatagaattgttccactcctacagagaaactcgaaggaaggaaggcattgtaaagctg aaacctcatgaggaacctcttcgttctgagatcctcagtggaaaattcaccatcttaaat gttcgggacccaacaggagcctccattgccctctttactgccaggttgcatcatccccac aagtcagtccaacatgtggtacttcaggctctgttttacttgctagacagagctgtggat agctttgaaactcagaggaatggactggtgtttatctatgacatgtgtggttctaattat gccaactttgagctggatcttggcaagaaagtcctaaacctgctgaagggagcatttcca gctcgtttgaagaaggtgctgattgtgggggcacccatatggttccgagtgccctattcc atcatcagtctcctcctgaaggacaaagtccgggagaggattcaaatattaaagacatct gaggtcacgcagcatctgcccagggagtgtcttccagaaaacctgggtgggtacgtcaaa attgatctcgccacttggaatttccagttcctaccccaggtgaacggccacccagatccc ttcgatgagatcatcctgttctccctccctcctgccttagactgggactcagtacatgtt ccaggtccccatgctatgaccatccaagagttggtggactatgttaatgccaggcaaaag caaggaatctatgaggaatatgaagacattcgtcgtgagaaccctgttggcactttccac tgttccatgtctccaggaaacctagagaaaaaccgttatggggatgtaccctgcctggac caaactagagtgaagctaacaaagcgaagtggccatactcagacagattacatcaatgcc agtttcatggatggctacaagcagaagaatgcttacattggcacacaaggtcctttggag aatacctatcgtgatttctggctcatggtatgggagcaaaaagtcttggtgattgtcatg accacccgctttgaggaaggcggcaggagaaagtgtggccagtactggcctttagaaaaa gactctcggatccgatttggcttcctcacagtgaccaatctaggcgtggagaacatgaat cattataagaaaacaacgctagaaattcacaacacagaggaacggcagaaacgccaggtg acccacttccagttcttgagctggccagactatggtgtcccttcctcagcagcttccctc attgacttcttgagagtggtcagaaaccagcagagtctggctgtgagcaacatgggagca cgctccaaagggcagtgccctgagccacccattgtggtccattgcagtgcaggcattggc aggacaggtaccttctgctcactggacatctgcctggcacagctggaggagcttggcacc cttaatgtgttccagacggtgtcacgcatgaggacccagagggccttcagcatccagacc cctgagcagtactatttttgctacaaggccatcctggagttcgcagagaaggagggcatg gtatcctctggccaaaacctgctggccgtggagagtcagtaa >gi568815583r:75368772_75627264|GENSCAN_predicted_peptide_4|335_aa XKMEELSQALASSFSVSQDLNSTAAPHPRLSQYKSKYSSLEQSERRRRLLELQKSKRLDY VNHARRLAEDDWTGMESEEENKKDDEEMDIDTVKKLPKHYANQLMLSEWLIDVPSDLGQE WIVVVCPVGKRALIVASRGSTSAYTKSGYCVNRFSSLLPGGNRRNSTAKDYTILDCIYNE VNQTYYVLDVMCWRGHPFYDCQTDFRFYWMHSKLPEEEGLGEKTKLNPVDGLLFYHKQTH YSPGSTPLVGWLRPYMVSDVLGVAVPAGPLTTKPDYAGHQLQQIMEHKKSQKEGMKEKLT HKASENGHYELEHLSTPKLKGSSHSPDHPGCLMEN >gi568815583r:75368772_75627264|GENSCAN_predicted_CDS_4|1008_bp nggaagatggaagagttgagtcaggccctggctagtagcttttctgtgtctcaagatctg aacagcacagctgccccacacccccgcctatcccagtacaagtccaagtacagttccttg gagcagagtgagcgccgccggaggttactggaactgcagaaatccaagcggctggattat gtgaaccatgccagaagactggctgaagatgactggacagggatggagagtgaggaagaa aataagaaagatgatgaagaaatggacattgacactgtcaagaagttaccaaaacactat gctaatcaattgatgctttctgagtggttaattgacgttccttcagatttggggcaggaa tggattgtggtcgtgtgccctgttggaaaaagagcccttatcgtggcctccaggggttct accagtgcctacaccaagagtggctactgtgtcaacaggttttcttcacttctgccagga ggcaacaggcgaaactcaacagcaaaagactacaccattctagattgcatttacaatgag gtaaaccagacctactacgttctggatgtgatgtgctggcggggacaccctttttatgat tgccagactgatttccgattctactggatgcattcaaagttaccagaagaagaaggactg ggagagaaaaccaagcttaatcctgtagatggacttctcttctaccacaaacagacccac tacagccccggaagcactcccttggtgggctggctgcgcccctacatggtgtcagatgtc cttggtgtagctgtgccggctggcccgctgaccaccaagccagactatgctgggcaccag ctccagcagattatggagcacaagaagagccagaaggaaggcatgaaggagaaactcaca cacaaggcctctgagaatgggcactatgaattggagcacctgtctactcccaagttgaag ggttcttcccatagcccagaccaccctggatgcctcatggagaattaa