GENSCAN 1.0 Date run: 3-Nov-116 Time: 14:44:21 Sequence gi568815581f:69405778_69641750 : 235973 bp : 40.51% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 6130 6192 63 0 0 105 82 19 0.713 4.20 1.02 Intr + 6457 6559 103 0 1 36 69 58 0.607 -2.47 1.03 Term + 7679 7776 98 0 2 123 42 79 0.412 3.95 1.04 PlyA + 7850 7855 6 1.05 2.10 PlyA - 7951 7946 6 -1.75 2.09 Term - 9144 8800 345 0 0 65 53 192 0.167 6.91 2.08 Intr - 16996 16871 126 1 0 47 110 23 0.064 0.36 2.07 Intr - 18461 18356 106 1 1 100 8 94 0.075 1.90 2.06 Intr - 36151 35990 162 0 0 53 92 76 0.080 2.67 2.05 Intr - 36669 36450 220 1 1 26 93 156 0.235 6.34 2.04 Intr - 46648 46591 58 1 1 87 102 8 0.106 -0.36 2.03 Intr - 51741 51575 167 1 2 41 99 145 0.960 9.66 2.02 Intr - 55675 55547 129 2 0 54 109 150 0.728 13.45 2.01 Init - 65004 64947 58 0 1 55 99 25 0.126 1.82 2.00 Prom - 65545 65506 40 -6.95 3.03 PlyA - 66611 66606 6 1.05 3.02 Term - 71773 71566 208 0 1 85 47 197 0.309 11.13 3.01 Init - 79526 79471 56 2 2 59 84 34 0.433 0.91 3.00 Prom - 86504 86465 40 -2.05 4.00 Prom + 87612 87651 40 -5.85 4.01 Init + 91400 91511 112 1 1 88 63 74 0.493 5.32 4.02 Intr + 100003 100069 67 2 1 133 113 47 0.970 8.74 4.03 Intr + 111078 111126 49 0 1 97 110 24 0.995 3.26 4.04 Intr + 111723 111836 114 2 0 121 75 123 0.930 14.02 4.05 Intr + 113536 113655 120 0 0 45 92 149 0.875 10.77 4.06 Intr + 114493 114609 117 0 0 51 65 92 0.884 2.94 4.07 Intr + 115272 115323 52 2 1 124 99 6 0.916 2.86 4.08 Intr + 117737 117864 128 0 2 92 97 113 0.987 12.08 4.09 Intr + 120793 120932 140 0 2 111 83 67 0.524 6.84 4.10 Intr + 130338 130383 46 0 1 74 101 41 0.202 1.49 4.11 Intr + 133883 134084 202 1 1 61 111 72 0.364 4.74 4.12 Intr + 154117 154258 142 2 1 37 60 113 0.422 1.89 4.13 Term + 155251 155401 151 1 1 54 44 139 0.936 2.50 4.14 PlyA + 155853 155858 6 1.05 5.00 Prom + 159690 159729 40 -4.25 5.01 Init + 160102 160189 88 0 1 48 100 16 0.655 -0.35 5.02 Term + 161525 161625 101 0 2 67 48 124 0.760 3.61 5.03 PlyA + 161743 161748 6 1.05 6.05 PlyA - 162581 162576 6 1.05 6.04 Term - 164371 164199 173 2 2 19 37 166 0.894 1.61 6.03 Intr - 165021 164937 85 0 1 75 94 57 0.711 3.47 6.02 Intr - 169567 169399 169 0 1 10 101 117 0.162 4.23 6.01 Init - 181932 181778 155 0 2 81 89 72 0.226 6.10 6.00 Prom - 182582 182543 40 -4.75 7.03 PlyA - 183000 182995 6 1.05 7.02 Term - 190577 190383 195 2 0 24 48 202 0.628 6.23 7.01 Init - 194652 194590 63 0 0 86 87 7 0.366 1.60 7.00 Prom - 195528 195489 40 -1.45 8.03 PlyA - 195825 195820 6 1.05 8.02 Term - 203457 203355 103 2 1 74 54 95 0.818 1.47 8.01 Init - 207881 206824 1058 2 2 82 53 272 0.565 16.84 8.00 Prom - 208005 207966 40 -10.15 9.03 PlyA - 208153 208148 6 1.05 9.02 Term - 210062 209039 1024 1 1 23 37 292 0.101 8.19 9.01 Init - 210928 210501 428 1 2 59 42 286 0.091 16.92 9.00 Prom - 212584 212545 40 -6.05 10.00 Prom + 216309 216348 40 -4.85 10.01 Init + 218909 218975 67 1 1 91 101 63 0.827 9.19 10.02 Term + 229526 229620 95 0 2 86 49 89 0.580 1.81 10.03 PlyA + 232706 232711 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f:69405778_69641750|GENSCAN_predicted_peptide_1|87_aa MGRILLLVPMIPRRLAGRRAQLLNILLVLWENELQIDFQATIHLPAVDSARLCREMNCAK HLWLPCEKGDAVLHQGAVYPLCVSGTE >gi568815581f:69405778_69641750|GENSCAN_predicted_CDS_1|264_bp atggggaggattctcctcctagtaccaatgataccacgaagactggcaggaagaagagca cagctgctgaacatattgttggtgctatgggaaaatgaattgcagattgactttcaagct acaatccatttgccagctgtggactccgcacgcctatgcagagaaatgaattgtgctaag catctctggcttccttgtgaaaaaggtgatgccgttctccatcagggtgctgtgtaccca ctatgtgtgagcggtacagaatga >gi568815581f:69405778_69641750|GENSCAN_predicted_peptide_2|456_aa MCEEVVSSSRQTAPGNKNRELAAEIGRNKSPLKEKRMKTGPVTALILSIASGPRAAGYVH RAGCWSASPGTILGVASPQGRTDFNKKRIPYENSNLSSYVHSDIHTNTNCGPNTEDFQLS GNELWIQYSDIQGRNREVIKYEVVADEVRSVFQGGRPSGLRIQGEARSREQDLEVAIMRG KIQRSGNSEVLLEQQHTSREKPLWEASTFGSQRTVSGGLGGSLKHPHLSVTGSPFSPGAN RGPVENLDVYTHLAVRRQHPLSATRPSIKTFSPKYTQYPRCKRLCEHKEYSSSKENCPPG PWANWLTTIDFLTRKLRIITPTLSIVFMGHPKAYYMKSNWLRLVLQQRCKLQACNLHASG VSLDRCSRWTFRESRWKCTPCKTDWKVIFPPVATLGKAVLQLHDGPGLAAKSSKLGTEYL SAQTSFLRRARRPLGNVVLRRSPRGSDLRKRGRPGR >gi568815581f:69405778_69641750|GENSCAN_predicted_CDS_2|1371_bp atgtgtgaggaggtagtgtcaagttctaggcagacagcaccaggaaacaaaaacagagag ttggctgcagaaattggaagaaataaatctccgctgaaggagaaaaggatgaagacaggg ccagtcactgccctcattctttccattgcatcgggaccacgtgctgctggctatgttcac agggcaggctgctggtctgcaagtccagggaccatacttggagtagcaagcccccaggga aggacagactttaataagaagaggatcccctatgaaaattccaacttgagctcctatgtt cattcagacattcatacaaataccaactgtgggccaaacactgaagatttccagctatct ggtaatgaattgtggattcaatactcagacatccaaggcaggaacagagaagttatcaag tatgaagtggtggcagatgaagtcagaagtgtctttcaaggagggagaccatcaggatta aggatccaaggagaggcaagatctagggagcaagatctggaagttgccatcatgagaggt aaaatccagaggtctggaaacagtgaagtgctcttagagcaacagcatacctcaagagaa aagccactctgggaagcgtccacatttggaagccagagaactgtcagtggaggcctagga ggaagcctgaagcacccccacctatcagtaacagggagtcccttctctccaggtgccaac agaggccccgtggagaatctagatgtctacacccacctggcagtaaggaggcagcacccc ctttcagccaccagaccctctatcaaaaccttctctcctaaatatacccaatatccacga tgtaagcgactctgtgagcacaaagaatattcgtcctcaaaagaaaattgcccacccggg ccatgggcaaattggttaacaacaattgatttcctcactcgtaaattaaggataataaca ccaactttgtccatagtgtttatgggacatcctaaagcatactatatgaaaagtaactgg cttaggctagttttgcaacaaagatgcaagctgcaggcgtgcaatcttcatgcaagtgga gtttctcttgaccgatgcagcagatggactttcagggaatcaagatggaaatgcacacct tgcaaaacagactggaaagtgatttttcccccagttgcaacgttagggaaggctgtgctg cagctacatgatgggccagggctggcagctaaaagctccaaacttggaactgagtatctt agtgcgcagacgtccttcctgcgaagggcgcgaaggccgctgggtaatgtagttctccgg cggagtccacggggctctgatttaaggaaacgtggtcgccccggtcgctga >gi568815581f:69405778_69641750|GENSCAN_predicted_peptide_3|87_aa MTQKWFPSQELPTSLSGHIPRCSLPIQQRFQPTSKQVQLDGTSFLAVPDAFYLWTPALRL LRHLKKASGTADMHNLGAKTCGTSCDQ >gi568815581f:69405778_69641750|GENSCAN_predicted_CDS_3|264_bp atgactcagaagtggttcccatctcaagaactacccacgagtctcagtggacacattccc cgatgcagccttccaatccagcagaggttccaaccaacttccaaacaggtgcaacttgat ggcacctcatttttggctgtaccagatgccttctatctctggactcctgccctcagactt ctccgacacctgaagaaagcttctgggacagctgacatgcacaacctgggagctaaaaca tgtgggaccagctgtgaccaatga >gi568815581f:69405778_69641750|GENSCAN_predicted_peptide_4|479_aa MGPDFKGLAFLVEDMDEQMGSLNQHRKVQGTVATQRKGKKRNPGLKIPKEAFEQPQTSST PPRDLDSKACISIGNQNFEVKADDLEPIMELGRGAYGVVEKMRHVPSGQIMAVKRIRATV NSQEQKRLLMDLDISMRTVDCPFTVTFYGALFREGDVWICMELMDTSLDKFYKQVIDKGQ TIPEDILGKIAVSIVKALEHLHSKLSVIHRDVKPSNVLINALGQVKMCDFGISGYLVDSV AKTIDAGCKPYMAIELAILRFPYDSWGTPFQQLKQVVEEPSPQLPADKFSAEFVDFTSQC LKKNSKERPTYPELMTQATSTELIATSRTKGKVGLAGVGSSHQDNLRFGMSCFDSKKLLS TLKDFQMSRGCGKANKHDGRVLVLMLFEKLYTVLKLAFSILVHQRCSIIAITWELINNTE KNLEFNTIPRMTLQTGRRRVVLRQELAKPLDLSTAAELFIRSMPSLLALAIWQPEKSSQ >gi568815581f:69405778_69641750|GENSCAN_predicted_CDS_4|1440_bp atgggccctgacttcaaagggcttgcatttttagtggaagacatggatgaacagatgggt tcacttaatcagcataggaaggttcagggtactgtggcaacacagaggaaaggcaagaag cgaaaccctggccttaaaattccaaaagaagcatttgaacaacctcagaccagttccaca ccacctcgagatttagactccaaggcttgcatttctattggaaatcagaactttgaggtg aaggcagatgacctggagcctataatggaactgggacgaggtgcgtacggggtggtggag aagatgcggcacgtgcccagcgggcagatcatggcagtgaagcggatccgagccacagta aatagccaggaacagaaacggctactgatggatttggatatttccatgaggacggtggac tgtccattcactgtcaccttttatggcgcactgtttcgggagggtgatgtgtggatctgc atggagctcatggatacatcactagataaattctacaaacaagttattgataaaggccag acaattccagaggacatcttagggaaaatagcagtttctattgtaaaagcattagaacat ttacatagtaagctgtctgtcattcacagagacgtcaagccttctaatgtactcatcaat gctctcggtcaagtgaagatgtgcgattttggaatcagtggctacttggtggactctgtt gctaaaacaattgatgcaggttgcaaaccatacatggccattgagttggccatccttcga tttccctatgattcatggggaactccatttcagcagctcaaacaggtggtagaggagcca tcgccacaactcccagcagacaagttctctgcagagtttgttgactttacctcacagtgc ttaaagaagaattccaaagaacggcctacatacccagagctaatgactcaggcaacatct actgagctaatagccactagcagaacaaaaggcaaggtggggctggcaggtgttggaagc agtcaccaagataatctgaggtttggcatgtcctgttttgatagtaaaaagcttttaagt acattaaaagatttccaaatgtccagaggatgtgggaaagctaataagcatgatggtaga gtccttgtcctcatgttgtttgagaagctctacacggtcctgaaacttgccttctccatc cttgtacatcaacgctgcagcatcattgcaatcacctgggagctcattaacaatacagag aagaatctggagttcaacaccatccccagaatgaccctgcagactggaaggcgaagagtt gttcttagacaagaacttgctaaaccacttgacctttcaacagctgcagaactgtttatc aggagcatgccatctctcttggctcttgcaatttggcagccagaaaagtcaagccagtga >gi568815581f:69405778_69641750|GENSCAN_predicted_peptide_5|62_aa MQSVGQEDWEKEISGHVFLPFGNSLRNVKAWWTCVLSINASGLTGAEWTALDSIQIKVAV GV >gi568815581f:69405778_69641750|GENSCAN_predicted_CDS_5|189_bp atgcaaagtgtaggacaagaggattgggaaaaagagatctctggccatgtgttcctccct tttgggaactccttaagaaatgtgaaagcctggtggacctgtgttctttccatcaatgca agcggcctcactggtgctgaatggacagctctggatagcattcagatcaaggtggccgtg ggtgtctag >gi568815581f:69405778_69641750|GENSCAN_predicted_peptide_6|193_aa MGMVAWILDDTWKDEITKYVYMTGVDQALSRFSYLKDITNMIVTEKFLTTHSKDLNSRHE TEGIVRRPEIEHYSNTGKKHEIPPNLTTHRTFTYQYENTSSTVKGFENTRMTLDDVGDTQ ISAHEFKVCCLGEIFPATYRISLILQVKKLRHLKAKKSARACTVSHQESLTIKSRFDQYP GLFPVEQCLIIDK >gi568815581f:69405778_69641750|GENSCAN_predicted_CDS_6|582_bp atgggaatggtagcatggattctggacgatacatggaaggatgaaataacaaaatatgtg tatatgactggagttgatcaagcactttctcgcttttcttacctcaaggatatcactaat atgattgtgactgaaaaatttctaaccactcacagtaaggatttaaacagtagacatgaa acagaaggaatagtcagaaggcctgaaattgagcactattcaaacacaggtaaaaagcat gaaataccaccaaatctgaccacccatcgtacattcacttatcaatatgaaaacacttca tccacggtaaagggctttgaaaatacaaggatgacactcgatgatgtaggagatacacaa ataagtgctcatgagtttaaggtatgttgccttggagagatttttccagcaacctataga attagcctcatcttacaagtgaagaaactaagacatctaaaggcgaagaaatctgcccga gcttgtacagtttcacaccaagaaagcctgacaataaaatccaggttcgaccagtaccct ggactcttccctgtggaacagtgcctgataattgataagtga >gi568815581f:69405778_69641750|GENSCAN_predicted_peptide_7|85_aa MRSNNKTMTTRAKVSRILAKKEHAPCRPRGNARVGVLVTREAPEGMLQSSLSSAVHGRQR VISSVDSLTRCVGQLPSTSENKEPV >gi568815581f:69405778_69641750|GENSCAN_predicted_CDS_7|258_bp atgaggtcaaacaacaaaacaatgacaaccagagcaaaggtatctaggatactggcaaaa aaagagcatgctccatgcaggccccgtggcaatgccagggtgggggtgcttgtgacccgt gaagccccagagggcatgttacaatcctctcttagctctgccgtccatggacggcagcgt gttatcagctcagtagattccttgactcgttgtgtagggcagctgccctccaccagtgaa aacaaagagccagtgtga >gi568815581f:69405778_69641750|GENSCAN_predicted_peptide_8|386_aa MKDLFKENYKPLLNEIKEDTNKWKNIPCSWVGRINIMKMAILPKVIYRFNAIPIKLPMTF FTELEKTTLKFIWNQKRARIAKSILSQKNKAGGIMLPDFKLYYKATVTKTAWYWYQNRDI DQWNRTEPSEITPHIYNYLIFDKPEKNKQWGKDSLFNKWCWENWLAICRKLKLDPFLTPY TKINSRWIKDLNVRPKTIKTLEENLGITIQDIGIGKDFMSKTPKAMATKDKIDKWDLIKL KSFCTAKETTIRVNRQPTEWEKIFTTYSSDKGLISRIYNELKQIYKKKTNNPIKKWAKDM NSHFSKEDIYAAKRHMKKCPSSLAIREMQIKTTMRYHLTPVRMAIIKKSGNNRKLMGVAR SYPRTQIGFVVSCGIWLAESEETTRS >gi568815581f:69405778_69641750|GENSCAN_predicted_CDS_8|1161_bp atgaaggacctcttcaaggagaactacaaaccactgctcaatgaaataaaagaggataca aacaaatggaagaacattccatgctcatgggtaggaagaatcaatatcatgaaaatggcc atactgcccaaggtaatttatagattcaatgccatccccatcaagctaccaatgactttc ttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagctcgcatc gccaagtcaatcctaagccaaaagaacaaagctggaggcatcatgctacctgacttcaaa ctatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaacagagatata gatcaatggaacagaacagagccctcagaaataacgccgcatatctacaactatctgatc tttgacaaacctgagaaaaacaagcaatggggaaaggattccctatttaataaatggtgc tgggaaaactggctagccatatgtagaaagctgaaactggatcccttccttacaccttat acaaaaattaattcaagatggattaaagacttaaacgttagacctaaaaccataaaaacc ctagaagaaaacctaggcattaccattcaggacataggcattggcaaagacttcatgtct aaaacaccaaaagcaatggcaacaaaagacaaaattgacaaatgggatctaattaaacta aagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaacctacagaatgg gagaaaattttcacaacctactcatctgacaaagggctaatatccagaatctacaatgaa ctcaaacaaatttacaagaaaaaaacaaacaaccccatcaaaaagtgggcaaaggacatg aacagtcacttctcaaaagaagacatttatgcagccaaaagacacatgaaaaaatgccca tcatcactggccatcagagaaatgcaaatcaaaaccacaatgagataccatctcacacca gttagaatggcaatcattaaaaagtcaggaaacaacagaaagctaatgggtgttgcacgc agttatccaagaacacaaattggctttgtggtttcttgtggcatttggctagctgaatca gaggaaacaactagaagctga >gi568815581f:69405778_69641750|GENSCAN_predicted_peptide_9|483_aa MEVEMNKMKREGKFREKRIKRNEQSLQEIWDYVKRPNLHLIGVRESDRENGTKLENTLQD IIQENFPNLARQANNQIQEIQRTPQRYSSRRATPRHIIVRFTKVEMKEKMLRAAREKGRV THKGKPIRLTADLSAETLQARREDPHRLKIKGWRKIYQANGKQKKAGVAILVSDKTDFKP TKIKRDKEGHYIMVKGSIQQEELTILNIYAPNTGAPRFIKQVLSDLQRDLDSHTLIVGDF NTPLSTLDRSTRQKVNKDTQELNSALHQADLIDIYRTLQPKSTEYTFFSAPHHTYSKIDH ILGSKALLSKCKRTEIITNCLSDHSAIKLELRIKKLTQNRSTTWKLNNLLLNDYWVHNEM KAEIKMFFETNENKDTTYQNLWDAFKAVCRGKFIALNAHKRKQERSKTDTLTSQLKELEK QEQTHSKASRRQEITKIRAELKETDTQKTLQNINECRSWFFERINKIDRLLARLIKKKER RIK >gi568815581f:69405778_69641750|GENSCAN_predicted_CDS_9|1452_bp atggaagttgaaatgaataaaatgaagcgagaagggaagtttagagaaaaaagaataaaa agaaacgaacaaagccttcaagaaatatgggactatgtgaaaagaccaaatctacatctg attggtgtacgtgaaagtgacagggagaatggaaccaagttggaaaacactctgcaggat attatccaggagaacttccccaatctagcaaggcaggccaacaatcagattcaggaaata cagagaacgccacaaagatactcctcgagaagagcaactccaagacacataattgtcaga ttcaccaaagttgaaatgaaggaaaaaatgttaagggcagccagagagaaaggtcgggtt acccacaaagggaagcccatcagactaacagcggatctctcggcagaaactctacaagcc agaagagaagacccacataggctcaaaataaaaggatggaggaagatctaccaagcaaat ggaaaacaaaaaaaggcaggggttgcaatcctagtctctgataaaacagactttaaacca acaaagatcaaaagagacaaagaaggccattacataatggtaaagggatcaattcaacaa gaagagctaactatcctaaatatatatgcacccaatacaggagcacccagattcataaag caagtcctgagtgacctacaaagagacttagactcccacacattaatagtgggagacttt aacaccccactgtcaacattagacagatcaacgagacagaaagttaacaaggatacccag gaattgaactcagctctgcaccaagcggacctaatagacatctacagaactctccagccc aaatcaacagaatatacatttttttcagcaccacaccacacctattccaaaattgaccac atacttggaagtaaagctctcctcagcaaatgtaaaagaacagaaattataacaaactgt ctctcagaccacagtgcaatcaaactagaactcaggattaagaaactcactcaaaaccgc tcaactacatggaaactgaacaacctgctcctgaatgactactgggtacataacgaaatg aaggcagaaataaagatgttctttgaaaccaatgagaacaaagacacaacgtaccagaat ctctgggacgcattcaaagcagtgtgtagagggaaattcatagcactaaatgcccacaag agaaagcaggaaagatccaaaactgacaccctaacatcacaattaaaagaactagaaaag caagagcaaacacattcaaaagctagcagaaggcaagaaataactaaaatcagagcagaa ctgaaggaaacagacacacaaaaaacccttcaaaacattaatgaatgcaggagctggttt tttgaaaggatcaacaaaattgatagactgctagcaagattaataaagaaaaaagagaga agaatcaaatag >gi568815581f:69405778_69641750|GENSCAN_predicted_peptide_10|53_aa MDQPITSIKVLYPLADDEYLQDASSGQNYRTTRSKNLATSSAKVTKTFPNKST >gi568815581f:69405778_69641750|GENSCAN_predicted_CDS_10|162_bp atggaccaaccaataacttcaataaaagtgctgtatcccctggcagatgatgagtatctt caagatgccagttctggtcagaactacaggacaaccaggtcgaaaaatctggcaacttct tcagcaaaagtgacaaaaacatttccaaataagagcacatga