GENSCAN 1.0 Date run: 12-Jun-117 Time: 09:52:25 Sequence gi568815588f:67784722_68016590 : 231869 bp : 40.58% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 9200 9529 330 1 0 -8 38 280 0.060 7.07 1.02 PlyA + 9544 9549 6 1.05 2.07 PlyA - 10786 10781 6 1.05 2.06 Term - 12489 12395 95 1 2 105 44 82 0.730 2.51 2.05 Intr - 15876 15781 96 1 0 56 92 83 0.582 4.56 2.04 Intr - 21066 20862 205 0 1 31 111 233 0.974 17.75 2.03 Intr - 26942 26803 140 0 2 61 76 153 0.976 10.66 2.02 Intr - 38671 38593 79 1 1 108 99 71 0.890 8.41 2.01 Init - 40215 40213 3 0 0 113 22 0 0.379 -4.05 2.00 Prom - 43571 43532 40 -5.35 3.00 Prom + 46531 46570 40 -5.05 3.01 Sngl + 49042 49194 153 0 0 93 51 164 0.511 7.25 3.02 PlyA + 49429 49434 6 1.05 4.00 Prom + 51027 51066 40 -4.55 4.01 Init + 56613 57105 493 2 1 73 71 355 0.858 27.66 4.02 Intr + 60270 60374 105 1 0 30 116 39 0.418 0.17 4.03 Intr + 64744 64844 101 2 2 7 94 89 0.629 0.31 4.04 Term + 65058 65249 192 2 0 132 44 179 0.535 14.34 4.05 PlyA + 65403 65408 6 1.05 5.00 Prom + 82464 82503 40 -4.15 5.01 Init + 89699 90167 469 1 1 87 -48 524 0.611 34.49 5.02 Term + 93943 93950 8 2 2 88 44 0 0.163 -7.35 5.03 PlyA + 94269 94274 6 1.05 6.00 Prom + 99899 99938 40 -7.95 6.01 Init + 100001 100430 430 1 1 85 69 402 0.562 32.46 6.02 Intr + 102696 102812 117 1 0 123 52 114 0.979 10.82 6.03 Intr + 104161 104402 242 2 2 113 63 211 0.993 17.45 6.04 Intr + 106681 106833 153 0 0 41 74 130 0.905 6.25 6.05 Intr + 122069 122216 148 1 1 70 49 114 0.611 4.59 6.06 Intr + 123325 123404 80 2 2 99 91 65 0.991 6.25 6.07 Intr + 124535 124721 187 1 1 22 108 166 0.993 10.34 6.08 Intr + 127753 128310 558 2 0 101 95 505 0.954 44.37 6.09 Term + 131544 131872 329 1 2 87 36 448 0.998 33.39 6.10 PlyA + 132621 132626 6 1.05 7.18 PlyA - 133069 133064 6 1.05 7.17 Term - 138418 138210 209 2 2 76 32 178 0.924 7.52 7.16 Intr - 140466 140364 103 0 1 76 75 52 0.990 1.53 7.15 Intr - 148059 147876 184 2 1 104 97 95 0.997 10.87 7.14 Intr - 151514 151432 83 2 2 46 123 92 0.992 6.02 7.13 Intr - 154933 154867 67 0 1 56 107 42 0.932 0.89 7.12 Intr - 156384 156218 167 0 2 81 88 126 0.922 9.74 7.11 Intr - 170017 169874 144 1 0 67 93 135 0.928 11.46 7.10 Intr - 170409 170242 168 0 0 109 86 39 0.917 5.02 7.09 Intr - 172255 172157 99 1 0 66 98 35 0.722 1.69 7.08 Intr - 182081 181962 120 2 0 111 78 110 0.899 12.07 7.07 Intr - 184729 184689 41 2 2 95 68 52 0.376 0.92 7.06 Intr - 205679 205490 190 2 1 22 91 115 0.757 3.44 7.05 Intr - 206294 206183 112 1 1 96 62 57 0.973 3.36 7.04 Intr - 207602 207478 125 2 2 40 71 142 0.968 6.16 7.03 Intr - 224136 223997 140 1 2 70 70 17 0.024 -2.64 7.02 Intr - 226150 225968 183 2 0 28 49 151 0.840 4.04 7.01 Intr - 229465 229305 161 0 2 66 111 116 0.657 10.41 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 224918 224788 131 0 2 44 39 124 0.870 0.46 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588f:67784722_68016590|GENSCAN_predicted_peptide_1|109_aa ECPASVTRAKPEESTVLLHAVGIVINTRVKGKILARRINVCIEHIKHSKSRDSFLKRMKE NDQKKKEANEKGSWVQLKRQPAPAGEVYFVGTNGKAPELLEPIPYEFMA >gi568815588f:67784722_68016590|GENSCAN_predicted_CDS_1|330_bp gaatgccccgcaagtgttaccagggcaaaaccggaagagtctacagtgttactgcatgct gttggcattgttataaacacacgagttaagggcaagattcttgccaggagaattaatgta tgtattgagcacattaaacactctaaaagccgagatagcttcctgaaacgcatgaaggaa aatgatcagaaaaagaaggaagccaatgagaaaggttcctgggttcaactgaagcgccag cctgctccagccggagaagtgtactttgtgggaaccaatgggaaggcgcctgagctgctg gaacctattccctatgaattcatggcataa >gi568815588f:67784722_68016590|GENSCAN_predicted_peptide_2|205_aa MVEQILAEFKVRALECHPDKHPENPKAVETFQKLQKAKEILTNEESRARYDHWRRSQMSM PFQQWEALNDSVKTSMHWVVRGKKDLMLEESDKTHTTKMENEECNEQRERKKEELASTAE KTEQKEPKPLEKSVSPQNSDSSGIANLVAPATNIVTLPEGLLWQWSLLGSKSAGSFADVN GWHLRFRWSKDAPSELLRKFRNYEI >gi568815588f:67784722_68016590|GENSCAN_predicted_CDS_2|618_bp atggttgaacaaatcctggcagaatttaaagtcagagctctggaatgtcacccagacaag catcctgaaaaccccaaagctgtggagacttttcagaaactgcagaaggcaaaggagatt ctgaccaatgaagagagtcgagcccgctatgaccactggcgaaggagccagatgtcgatg ccattccagcagtgggaagctttgaatgactcagtgaagacgtcaatgcactgggttgtc agaggtaaaaaagacctgatgctggaagaatctgacaagactcataccaccaagatggaa aatgaggaatgtaatgagcaaagagaaagaaagaaagaggagctggcttcaaccgcagag aaaacggagcagaaagaacccaagcccctagagaagtcagtctccccgcaaaattcagat tcttcaggcattgccaaccttgtggctccagccacaaacattgtgactctgccagagggc ttgctctggcaatggagcctacttggcagcaaatcagcaggcagttttgcagatgtgaat ggttggcaccttcgtttccgctggtccaaggatgctccctcagaactcctgaggaagttc agaaactatgaaatatga >gi568815588f:67784722_68016590|GENSCAN_predicted_peptide_3|50_aa MASTMVAVGLTIAAAGFAGHYVLQAMEHMEPQVKQVFQSLSKSAFSGGYY >gi568815588f:67784722_68016590|GENSCAN_predicted_CDS_3|153_bp atggccagtacaatggtagcagttggactgaccattgctgctgcaggatttgcaggccat tatgttttgcaagccatggagcacatggagcctcaagtaaagcaagtttttcaaagtcta tcaaaatctgccttcagtggtggctattactga >gi568815588f:67784722_68016590|GENSCAN_predicted_peptide_4|296_aa MRESLELLQDWLNGCDQNADGDMDSEVQANEVSDGNEELIENWNKGHSCYALPRNKAIWY SCPRNLWKFELKSDDLGYLAEEISRQQSVQEVTWLLLTAYPQMQKQRNDLKLELIFKGKA ECKSLENLQPSHVVENKNPFSGQESKQTAEQPLAKEICINEREPGIHRSTHKINEKQEKG AKHISYHTEMEMSIACSVKGIASIARTAPQRNRKQASYFHTQGHRPPLRPPPSPRFKVLY CRMKSAPFPSGFSKYEQRRHKTQVPETVPATIRPYFPTHGPLLPAKAGVLDTSFAL >gi568815588f:67784722_68016590|GENSCAN_predicted_CDS_4|891_bp atgagggaaagtttggaactcctgcaagactggttaaatggttgtgaccaaaatgctgac ggtgatatggacagtgaagtccaggctaatgaggtctcagatggaaatgaggaacttatt gagaactggaataaaggtcactcttgttatgccttaccaaggaacaaagctatatggtat tcatgccctaggaatctgtggaagtttgaacttaagagtgatgacttagggtatctggca gaagaaatttctaggcagcaaagtgttcaagaagtgacctggctgcttctaacagcctat cctcagatgcagaagcaaagaaatgacttaaagttggaacttatatttaaagggaaagca gaatgtaaaagtttggaaaatttgcaacctagccatgtggtagaaaacaaaaacccattt tcagggcaagaatctaagcagactgcagagcaaccacttgctaaagagatttgcataaat gaaagggagccaggaattcatagaagcactcacaaaataaatgaaaagcaggagaaagga gcaaaacacataagctatcatacagagatggaaatgtccattgcgtgttcagttaaaggc atcgcgagcatcgcgagaacggcgccacagaggaaccggaaacaagccagttactttcat actcagggacaccgtccccccctccgccccccgccctcccctcgcttcaaagttctgtat tgcagaatgaaatcggcgcctttccccagcggtttttccaaatacgagcaaaggaggcac aaaacgcaggttccagaaactgtgcctgcaaccattcgtccttatttcccgacacacggg cccctgctccccgctaaggctggggtcctggacacgagttttgcgctctga >gi568815588f:67784722_68016590|GENSCAN_predicted_peptide_5|158_aa MPPKFDPNKIKVVYLRCTGGEVGATSALAPKIGPLGLSPKKVGDDIAKATGDWKGLRITV KLTIQNRQAQIEVVPSASALTIKALKEPPRDGKKQRNIKHSGNITFDEIIKVARQMRHRS LARELSGTIKEILGTAQSVGCNVDGRHSHDIIDDINSI >gi568815588f:67784722_68016590|GENSCAN_predicted_CDS_5|477_bp atgccaccgaagttcgaccccaacaagatcaaagtcgtatacctgaggtgcaccggaggt gaagtcggtgccacttctgccctggcccccaagatcggccccctgggtctgtctccaaaa aaggttggtgatgacattgccaaggcaacgggtgactggaagggcctgaggattacagtg aaactgacaattcagaacagacaagcccagattgaggtggtgccttctgcctccgccctg accatcaaagccctcaaggagccaccaagagacggaaagaaacagagaaacattaaacac agtgggaatatcacttttgatgagatcatcaaggttgctcgacagatgcggcaccgatcc ttagccagagaactctctggaaccattaaagagatcctggggactgcccagtctgtgggc tgtaatgttgatggccgccactctcatgacatcatagatgacatcaacagcatatag >gi568815588f:67784722_68016590|GENSCAN_predicted_peptide_6|747_aa MADEAALALQPGGSPSAAGADREAASSPAGEPLRKRPRRDGPGLERSPGEPGGAAPEREV PAAARGCPGAAAAALWREAEAEAAAAGGEQEAQATAAAGEGDNGPGLQGPSREPPLADNL YDEDDDDEGEEEEEAAAAAIGYRDNLLFGDEIITNGFHSCESDEEDRASHASSSDWTPRP RIGPYTFVQQHLMIGTDPRTILKDLLPETIPPPELDDMTLWQIVINILSEPPKRKKRKDI NTIEDAVKLLQECKKIIVLTGAGVSVSCGIPDFRSRDGIYARLAVDFPDLPDPQAMFDIE YFRKDPRPFFKFAKEIYPGQFQPSLCHKFIALSDKEGKLLRNYTQNIDTLEQVAGIQRII QCHGSFATASCLICKYKVDCEAVRGDIFNQVVPRCPRCPADEPLAIMKPEIVFFGENLPE QFHRAMKYDKDEVDLLIVIGSSLKVRPVALIPSSIPHEVPQILINREPLPHLHFDVELLG DCDVIINELCHRLGGEYAKLCCNPVKLSEITEKPPRTQKELAYLSELPPTPLHVSEDSSS PERTSPPDSSVIVTLLDQAAKSNDDLDVSESKGCMEEKPQEVQTSRNVESIAEQMENPDL KNVGSSTGEKNERTSVAGTVRKCWPNRVAKEQISRRLDGNQYLFLPPNRYIFHGAEVYSD SEDDVLSSSSCGSNSDSGTCQSPSLEEPMEDESEIEEFYNGLEDEPDVPERAGGAGFGTD GDDQEAINEAISVKQEVTDMNYPSNKS >gi568815588f:67784722_68016590|GENSCAN_predicted_CDS_6|2244_bp atggcggacgaggcggccctcgcccttcagcccggcggctccccctcggcggcgggggcc gacagggaggccgcgtcgtcccccgccggggagccgctccgcaagaggccgcggagagat ggtcccggcctcgagcggagcccgggcgagcccggtggggcggccccagagcgtgaggtg ccggcggcggccaggggctgcccgggtgcggcggcggcggcgctgtggcgggaggcggag gcagaggcggcggcggcaggcggggagcaagaggcccaggcgactgcggcggctggggaa ggagacaatgggccgggcctgcagggcccatctcgggagccaccgctggccgacaacttg tacgacgaagacgacgacgacgagggcgaggaggaggaagaggcggcggcggcggcgatt gggtaccgagataaccttctgttcggtgatgaaattatcactaatggttttcattcctgt gaaagtgatgaggaggatagagcctcacatgcaagctctagtgactggactccaaggcca cggataggtccatatacttttgttcagcaacatcttatgattggcacagatcctcgaaca attcttaaagatttattgccggaaacaatacctccacctgagttggatgatatgacactg tggcagattgttattaatatcctttcagaaccaccaaaaaggaaaaaaagaaaagatatt aatacaattgaagatgctgtgaaattactgcaagagtgcaaaaaaattatagttctaact ggagctggggtgtctgtttcatgtggaatacctgacttcaggtcaagggatggtatttat gctcgccttgctgtagacttcccagatcttccagatcctcaagcgatgtttgatattgaa tatttcagaaaagatccaagaccattcttcaagtttgcaaaggaaatatatcctggacaa ttccagccatctctctgtcacaaattcatagccttgtcagataaggaaggaaaactactt cgcaactatacccagaacatagacacgctggaacaggttgcgggaatccaaaggataatt cagtgtcatggttcctttgcaacagcatcttgcctgatttgtaaatacaaagttgactgt gaagctgtacgaggagatatttttaatcaggtagttcctcgatgtcctaggtgcccagct gatgaaccgcttgctatcatgaaaccagagattgtgttttttggtgaaaatttaccagaa cagtttcatagagccatgaagtatgacaaagatgaagttgacctcctcattgttattggg tcttccctcaaagtaagaccagtagcactaattccaagttccataccccatgaagtgcct cagatattaattaatagagaacctttgcctcatctgcattttgatgtagagcttcttgga gactgtgatgtcataattaatgaattgtgtcataggttaggtggtgaatatgccaaactt tgctgtaaccctgtaaagctttcagaaattactgaaaaacctccacgaacacaaaaagaa ttggcttatttgtcagagttgccacccacacctcttcatgtttcagaagactcaagttca ccagaaagaacttcaccaccagattcttcagtgattgtcacacttttagaccaagcagct aagagtaatgatgatttagatgtgtctgaatcaaaaggttgtatggaagaaaaaccacag gaagtacaaacttctaggaatgttgaaagtattgctgaacagatggaaaatccggatttg aagaatgttggttctagtactggggagaaaaatgaaagaacttcagtggctggaacagtg agaaaatgctggcctaatagagtggcaaaggagcagattagtaggcggcttgatggtaat cagtatctgtttttgccaccaaatcgttacattttccatggcgctgaggtatattcagac tctgaagatgacgtcttatcctctagttcttgtggcagtaacagtgatagtgggacatgc cagagtccaagtttagaagaacccatggaggatgaaagtgaaattgaagaattctacaat ggcttagaagatgagcctgatgttccagagagagctggaggagctggatttgggactgat ggagatgatcaagaggcaattaatgaagctatatctgtgaaacaggaagtaacagacatg aactatccatcaaacaaatcatag >gi568815588f:67784722_68016590|GENSCAN_predicted_peptide_7|765_aa XQHTSAFVPSSGRIYSFGLGGNGQLGTGSTSNRKSPFTVKGNWYPYNGQCLPDIGHLKAL QKELEQFAKLLKQKRITLGYAQADVGLTLGVLFGNMFSQTTICLFEAQQLSFKNMFVWNQ TLNIFEVCLYHILFIYSYISGRLGCFYLLAIVNNAAMDVGVQNCGPPDDFRCPNPTKQIW TVNEALIQKWLSYPSGRFPVEIANNDDHYRTGTRFSGVDMNAARLLFHKLIQPDHPQISQ QVAASLEKNLIPKLTSSLPDVEALRFYLTLPECPLMSDSNNFTTIAIPFGTALVNLEKAP LKVLVVSWAPSGIYGQEAVNEKMGQIIQYDKFYIHEVQELIDIRNDYINWVQQQAYGMLA DIPVTICTYPFVFDAQAKTTLLQTDAVLQMQMAIDQAHRQNVSSLFLPVIESVNPCLILV VRRENIVGDAMEVLRKTKNIDYKKPLKVIFVGEDAVDAGGVRKEFFLLIMRELLDPKYGM FRYYEDSRLIWFSDKTFEDSDLFHLIGVICGLAIYNCTIVDLHFPLALYKKLLKKKPSLD DLKELMPDVGRSMQQLLDYPEDDIEETFCLNFTITVENFGATEVKELVLNGADTAVNKQN RQEFVDAYVDYIFNKSVASLFDAFHAGFHKVCGGKVLLLFQPNELQAMVIGNTNYDWKEL EKNTEYKGEYWAEHPTIKIFWEVFHELPLEKKKQFLLFLTGSDRIPILGMKSLKLVIQST GGGEEYLPVSHTCFNLLDLPKYTEKETLRSKLIQAIDHNEGFSLI >gi568815588f:67784722_68016590|GENSCAN_predicted_CDS_7|2298_bp nngcagcacacttctgcttttgttccttcatcaggacgaatttactcttttgggcttggt ggtaatgggcagctgggaaccggttcaacaagcaacaggaaaagcccctttactgtaaaa ggaaattggtacccctataatgggcagtgtctaccagatattggacatctcaaagctctg cagaaagaacttgagcaatttgccaagctcctgaagcagaagaggatcaccctgggatat gcacaggctgatgtggggcttaccctgggggttctatttgggaatatgttcagccaaacg accatctgcctctttgaggctcaacagcttagcttcaagaacatgtttgtctggaaccaa accctaaatatctttgaggtatgcttataccacattttgtttatctattcatatatcagt ggacgattgggttgtttctaccttttggctattgtgaataatgctgctatggatgtaggt gtacagaactgtgggccaccagatgacttcagatgtcccaatccgacaaagcagatctgg acagtgaatgaagctctaattcagaaatggctgagctatccttctggaaggtttcctgtg gagatagccaacaatgatgatcactatagaacaggtaccagattttcaggggttgatatg aatgctgctaggcttttattccacaaacttatacaacctgatcatccgcagatatctcag caggtggcagctagtttggaaaagaatcttattcctaaactgactagctccttacctgat gttgaagcattgaggttttatcttactctaccagaatgtcccctgatgagtgattccaac aatttcacaacaatagcaattccctttggtacagctcttgtgaacctagaaaaggcacca ctgaaagtacttgttgtttcatgggcacccagtggcatctatggacaagaggctgtaaat gagaaaatgggacagattatacagtatgataaattttatatacatgaagtacaagaattg atagacataagaaatgattatatcaactgggtccaacagcaggcctatggaatgttggca gatatccctgttacaatctgtacatatccatttgtatttgatgcccaagcaaaaactact ctgttacagaccgatgcagtcttacagatgcagatggctattgatcaggcccacaggcag aatgtctcctctctttttctcccagtgattgaatctgtgaatccctgcttaattctagtg gtgcgtagagaaaatattgtaggagatgcaatggaagtccttaggaaaacaaagaacata gattacaagaagccactcaaggttatatttgttggagaagatgctgtggatgcaggaggg gtgcgcaaagaatttttcttgctcatcatgagggaattattggatcctaaatacggcatg tttaggtattatgaagattccaggctcatttggttttctgataagacatttgaagacagt gatttgttccatttgattggtgttatctgtggcttagcaatttataattgtaccattgtg gacctccattttcctttggctttatataagaaactactgaaaaagaagccatccttggat gatttgaaagaactaatgcctgatgttgggagaagcatgcaacagttactggattatcca gaagatgacatagaggaaacattttgtcttaattttacgatcacagttgaaaactttggt gcaacagaagtgaaagagctggttctaaatggtgcagacacagctgttaacaaacaaaat cggcaagagtttgtcgatgcttatgtggattacatattcaataaatcagtggcttcctta tttgatgcttttcatgcgggctttcataaggtctgtggaggaaaagtccttctgctcttt cagcctaatgaactacaagcaatggtcattggaaatacaaattatgattggaaggaactg gaaaagaatacagaatacaaaggggaatattgggcagaacatcctacgataaaaattttt tgggaagtatttcacgaattaccattggaaaagaagaaacagtttctgttatttttgaca ggtagtgatcgcattcctattcttggtatgaagagtctgaaactagtcatccagtccaca ggaggtggtgaggagtatctcccagtttcccatacttgttttaatcttctggatcttcca aaatatacagaaaaagaaactctacgctctaaactgatccaagctattgatcacaatgaa ggcttcagtttaatataa