GENSCAN 1.0 Date run: 2-Nov-116 Time: 18:56:54 Sequence gi568815580f:58222440_58496266 : 273827 bp : 43.80% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 4860 5010 151 2 1 61 105 57 0.535 4.92 1.02 Term + 5716 5831 116 2 2 53 44 105 0.796 1.33 1.03 PlyA + 8123 8128 6 1.05 2.00 Prom + 11002 11041 40 -2.46 2.01 Init + 33202 34335 1134 0 0 115 127 1054 0.996 106.09 2.02 Intr + 34447 34505 59 0 2 65 58 28 0.876 -4.82 2.03 Term + 35230 35305 76 1 1 97 40 124 0.966 5.81 2.04 PlyA + 36082 36087 6 1.05 3.00 Prom + 42489 42528 40 -2.46 3.01 Init + 50675 50704 30 1 0 65 89 36 0.012 1.16 3.02 Intr + 93543 93593 51 2 0 100 107 59 0.664 8.10 3.03 Intr + 99986 100047 62 1 2 121 100 32 0.696 5.23 3.04 Intr + 100793 100895 103 2 1 68 94 86 0.974 7.38 3.05 Intr + 102557 102723 167 1 2 96 76 48 0.955 3.16 3.06 Intr + 106556 106688 133 2 1 74 80 252 0.862 23.65 3.07 Intr + 108299 108475 177 1 0 39 87 94 0.809 4.52 3.08 Intr + 111379 111453 75 0 0 122 48 13 0.516 0.41 3.09 Intr + 117886 117970 85 0 1 48 29 71 0.262 -3.41 3.10 Intr + 118642 118730 89 2 2 64 96 57 0.485 3.79 3.11 Intr + 119239 119358 120 0 0 54 78 57 0.779 1.99 3.12 Intr + 120467 120664 198 1 0 80 82 92 0.910 7.35 3.13 Intr + 141829 141894 66 0 0 66 76 46 0.137 0.30 3.14 Intr + 143560 143789 230 0 2 84 113 201 0.970 18.87 3.15 Intr + 145307 145428 122 2 2 71 115 87 0.999 9.84 3.16 Intr + 147958 148028 71 2 2 88 103 43 0.633 4.70 3.17 Intr + 150735 150830 96 2 0 87 88 113 0.983 11.41 3.18 Term + 152319 152552 234 2 0 77 54 110 0.864 2.82 3.19 PlyA + 153791 153796 6 1.05 4.03 PlyA - 154024 154019 6 1.05 4.02 Term - 156874 156445 430 0 1 101 42 207 0.527 12.17 4.01 Init - 158454 158447 8 0 2 55 95 0 0.728 -2.20 4.00 Prom - 158528 158489 40 -7.06 5.00 Prom + 158621 158660 40 -5.16 5.01 Init + 162938 163032 95 1 2 81 47 55 0.871 -0.83 5.02 Intr + 163087 163147 61 1 1 107 99 84 0.957 10.14 5.03 Intr + 165000 165059 60 2 0 108 68 50 0.938 3.93 5.04 Intr + 166646 166753 108 1 0 63 88 98 0.994 7.78 5.05 Intr + 168207 168303 97 2 1 97 89 114 0.999 11.98 5.06 Intr + 169048 169120 73 2 1 104 101 19 0.996 3.26 5.07 Intr + 173728 173821 94 1 1 113 -3 181 0.106 11.47 5.08 Intr + 187512 187648 137 2 2 107 77 128 0.985 12.97 5.09 Intr + 189775 189799 25 1 1 97 102 -13 0.288 -0.97 5.10 Intr + 198348 198417 70 2 1 60 103 52 0.160 2.65 5.11 Term + 226870 226979 110 2 2 114 43 68 0.507 3.57 5.12 PlyA + 228810 228815 6 1.05 6.04 PlyA - 232406 232401 6 1.05 6.03 Term - 232849 232747 103 0 1 121 52 62 0.311 3.65 6.02 Intr - 238658 238519 140 2 2 -9 103 104 0.173 1.46 6.01 Init - 244042 243911 132 1 0 60 97 136 0.382 9.85 6.00 Prom - 255474 255435 40 -3.76 7.03 PlyA - 259429 259424 6 1.05 7.02 Term - 263511 263403 109 2 1 80 47 95 0.796 2.58 7.01 Init - 264576 264518 59 0 2 95 98 12 0.373 3.71 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 173728 173830 103 1 1 113 47 183 0.863 14.45 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815580f:58222440_58496266|GENSCAN_predicted_peptide_1|88_aa MGEKSRPRRLRRVCVRESPGSLPGGVLKDLLIVILVSVEAVVSSELSVFGDPLCDWNVMH TSDQGMALFARSFQTNSRNGHISKQLRY >gi568815580f:58222440_58496266|GENSCAN_predicted_CDS_1|267_bp atgggagaaaaaagcagaccccggagacttaggagggtctgtgtgagggagtctccaggc agtcttcctgggggtgtcctgaaggacctcctgatcgtgattttggtcagtgtggaggct gtggtttcttctgagctgtcagtatttggagacccactgtgtgactggaatgtgatgcac acaagtgaccaaggcatggccctgttcgcaaggagcttccagaccaatagcagaaatgga cacataagtaaacaactgcgctactga >gi568815580f:58222440_58496266|GENSCAN_predicted_peptide_2|422_aa MAHRLRFHFGSGRSNTAPESDILDQEREDDFFMAFHTLPRRSSPHPFAQNGGEDGGGGLQ GGVGALKRSSSMFIPQLLTSIDARPTCSSSVQISLQRKATDGATDGCGPPEGADDGPPCA TPDPRDQASATATTRASPQSGSREPSPRDTPGSSPPRAARDPGLQVNGTCGRRVRCSGPV DCAEEAAPGLRIQHRASSADVRQVRLLPLGPDGQGGPAAAEPRRWSLQHVPDASGSSGKR CFVFQLQQPQQGASGPGSDLNFGFTGTKGDRLVRYPRIRLERSTSYPTQPRSERGSPTED RGALEASPRAGRMAPEIRRTNSAERTPQGQGCTFKIRQDQNAGQQHFRILVTRGPEEAPQ NPEEKSAKSPVSTGADTTASGSECLAEFWHDDLWSSVCEMEYRETAARSIDQAYDDSAAF VV >gi568815580f:58222440_58496266|GENSCAN_predicted_CDS_2|1269_bp atggcccatcggcttcggtttcattttggctctggtcgcagcaacacagcccccgaatca gacatcctagaccaggagagagaagacgacttcttcatggcattccacaccctaccgcgg agaagcagcccgcaccccttcgcccagaacggaggggaggacggcggcggaggcctgcag ggaggcgtgggtgcgcttaagcggagctcgtccatgttcatcccgcagctcttgaccagc atcgacgcccgccccacgtgcagctcctccgtgcagatctccctgcagcgcaaggccacg gacggggccacggacgggtgcgggccgcccgagggcgccgacgatgggcctccatgcgca acgcccgaccccagggaccaggcctccgccactgccaccacgagggcctcgccccagagt ggctcccgggagccctcgccgagggacacccccgggagctcccctccgagggcagcccgg gacccagggctccaggtcaacggcacgtgcggccgccgcgtgcggtgctccggccccgtg gactgcgcggaggaggctgccccgggcctgcgcatccagcaccgcgcctccagcgccgac gtgcgccaggtgaggctgctgcccctgggccccgatggccagggcggcccggccgcggca gagcccaggcgctggtccctgcagcacgttccagatgcttctggaagctctgggaagcgg tgttttgtcttccagttgcagcagccccaacaaggcgcttcggggccaggcagcgacctc aactttggcttcacgggcacaaagggggacaggttggtgaggtatcctcgcattcggctg gagaggagcacctcgtaccccacgcagccccgaagcgagcgagggagccccacggaagat cggggagccctggaggcatcgcctcgagctggcaggatggctcctgaaatccgcaggacg aactccgcggagaggactccgcagggccaggggtgcacatttaagatcaggcaggatcag aacgcggggcagcagcattttagaattcttgtaacccgggggccggaagaagctccccag aatcccgaggagaaaagcgccaaaagccctgtttccacgggagctgacaccacggccagt ggatctgaatgtttggccgagttctggcacgatgatttgtggtccagtgtttgtgaaatg gagtacagagagacagctgcacgctccatcgaccaggcatatgatgactctgcagctttc gtggtttag >gi568815580f:58222440_58496266|GENSCAN_predicted_peptide_3|702_aa MLEDQKGIIQTRDDFLGQVDVPLSHLPTEDPTMERPYTFKDFLLRPRSHKSRVKGFLRLK MAYMPKNGGQDEENSDQRDDMEHGWEVVDSNDSASQHQEELPPPPLPPGWEEKVDNLGRT YYVNHNNRTTQWHRPSLMDVSSESDNNIRQINQEAAHRRFRSRRHISEDLEPEPSEGGDV PEPWETISEEVNIAGDSLGLALPPPPASPGSRTSPQELSEELSRRLQITPDSNGEQFSSL IQREPSSRLRSCSVTDAVAEQGHLPPVSKPKNCQAVFTFKETHSYQQTWVHIAWGWEERK DAKGRTYYVNHNNRTTTWTRPIMQLAEDGASGSATNSNNHLIEPQIRRPRSLSSPTVTLS APLEGAKDSPVRRAVKDTLSNPQSPQPSPYNSPKPQHKVTQSFLPPGWEMRIAPNGRPFF IDHNTKTTTWAVPYSREFKQKYDYFRKKLKKPADIPNRFEMKLHRNNIFEESYRRIMSVK RPDVLKARLWIEFESEKGLDYGGVAREWFFLLSKEMFNPYYGLFEYSATDNYTLQINPNS GLCNEDHLSYFTFIGRVAGLAVFHGKLLDGFFIRPFYKMMLGKQITLNDMESVDSEYYNS LKWILENDPTELDLMFCIDEENFGQSPGSACSSAGPVDAGVTLPAVLHPLLFCLTLAIWI SSSYLWLRLHPSPLTTSLLSQAGVCLPASRIPRSNGLLNCFS >gi568815580f:58222440_58496266|GENSCAN_predicted_CDS_3|2109_bp atgctggaggatcagaaagggatcatccaaacacgagacgacttcctgggccaggtggac gtgccccttagtcaccttccgacagaagatccaaccatggagcgaccctatacatttaag gactttctcctcagaccaagaagtcataagtctcgagttaagggatttttgcgattgaaa atggcctatatgccaaaaaatggaggtcaagatgaagaaaacagtgaccagagggatgac atggagcatggatgggaagttgttgactcaaatgactcggcttctcagcaccaagaggaa cttcctcctcctcctctgcctcccgggtgggaagaaaaagtggacaatttaggccgaact tactatgtcaaccacaacaaccggaccactcagtggcacagaccaagcctgatggacgtg tcctcggagtcggacaataacatcagacagatcaaccaggaggcagcacaccggcgcttc cgctcccgcaggcacatcagcgaagacttggagcccgagccctcggagggcggggatgtc cccgagccttgggagaccatttcagaggaagtgaatatcgctggagactctctcggtctg gctctgcccccaccaccggcctccccaggatctcggaccagccctcaggagctgtcagag gaactaagcagaaggcttcagatcactccagactccaatggggaacagttcagctctttg attcaaagagaaccctcctcaaggttgaggtcatgcagtgtcaccgacgcagttgcagaa cagggccatctaccaccggtctctaaacccaaaaactgccaagctgtgttcaccttcaaa gaaacacactcctatcagcagacttgggtgcacatagcctggggctgggaagaaagaaaa gatgctaaggggcgcacatactatgtcaatcataacaatcgaaccacaacttggactcga cctatcatgcagcttgcagaagatggtgcgtccggatcagccacaaacagtaacaaccat ctaatcgagcctcagatccgccggcctcgtagcctcagctcgccaacagtaactttatct gccccgctggagggtgccaaggactcacccgtacgtcgggctgtgaaagacaccctttcc aacccacagtccccacagccatcaccttacaactcccccaaaccacaacacaaagtcaca cagagcttcttgccacccggctgggaaatgaggatagcgccaaacggccggcccttcttc attgatcataacacaaagactacaacctgggctgtcccttactccagagaatttaagcag aaatatgactacttcaggaagaaattaaagaaacctgctgatatccccaataggtttgaa atgaaacttcacagaaataacatatttgaagagtcctatcggagaattatgtccgtgaaa agaccagatgtcctaaaagctagactgtggattgagtttgaatcagagaaaggtcttgac tatgggggtgtggccagagaatggttcttcttactgtccaaagagatgttcaacccctac tacggcctctttgagtactctgccacggacaactacacccttcagatcaaccctaattca ggcctctgtaatgaggatcatttgtcctacttcacttttattggaagagttgctggtctg gccgtatttcatgggaagctcttagatggtttcttcattagaccattttacaagatgatg ttgggaaagcagataaccctgaatgacatggaatctgtggatagtgaatattacaactct ttgaaatggatcctggagaatgaccctactgagctggacctcatgttctgcatagacgaa gaaaactttggacagtcccctggaagtgcatgctcctctgctggccctgtggatgcaggt gtcaccctgccagctgtcctgcacccgctgcttttttgtctcacccttgctatctggatt tcctcatcctacctgtggctccggctgcacccatctccgctcaccaccagcctcttatct caggccggcgtctgccttccagcctccagaattcccaggtctaacggcctcctcaactgc ttctcctga >gi568815580f:58222440_58496266|GENSCAN_predicted_peptide_4|145_aa MKRCCGSALAPVMRVDSIPEAEIQQSDQGETANCRWSWCWHPGALLPGEQSAEPALWGGF LPPSATKRTPRPPRAFWAIALLCLGSWRTPHKEHNAQPQQALSKSSWENTAVEKFLFKST GPSINLALFLTEIRGTSGGLPALAS >gi568815580f:58222440_58496266|GENSCAN_predicted_CDS_4|438_bp atgaaaaggtgctgcggcagcgccctggcccccgtcatgagggtggacagcatcccggag gcagaaatccagcaaagtgaccaaggagaaacagcgaactgcagatggagctggtgctgg cacccaggggccctgctccctggtgagcagtctgcagagcctgctctgtggggtggattc ttgccaccttcagccacgaaacgcacaccccgccctcccagggctttctgggccatcgcc ctcctgtgcctgggctcctggaggaccccacacaaggaacacaatgcccagccacagcag gcgctcagcaagagctcgtgggagaacactgcagtagaaaaattcctctttaagtctact ggccccagcataaacttggctctcttcctcacagagatcagaggaacttctgggggactc cctgccttggccagttaa >gi568815580f:58222440_58496266|GENSCAN_predicted_peptide_5|309_aa MNKLNSGGTEETLPSALLSWMEEKHSLLRRKHLVIQWRFVNRVQKQMNAFLEGFTELLPI DLIKIFDENELELLMCGLGDVDVNDWRQHSIYKNGYCPNHPVIQWFWKAVLLMDAEKRIR LLQFVTGTSRVPMNGFAELYGSNGPQLFTIEQWGSPEKLPRAHTCFNRLDLPPYETFEDL REKLLMAVENAQGFEGFPVQFALVQPNSNIYILLQKPFKGLPVLTNENETKIYTPYEQGH SSSVGLGASKRTSEVISKVESIRCGSTLSPELHDYSKPKANRAPMEKWRMLRYCYGALRS EMGPSPCPG >gi568815580f:58222440_58496266|GENSCAN_predicted_CDS_5|930_bp atgaataaactaaacagtgggggcacagaggagaccctcccctctgctctgctgtcgtgg atggaggagaagcactccctgttgcggagaaagcacttagtcatccagtggagatttgtg aacagggtccagaagcagatgaacgccttcttggagggattcacagaactacttcctatt gatttgattaaaatttttgatgaaaatgagctggagttgctcatgtgcggcctcggtgat gtggatgtgaatgactggagacagcattctatttacaagaacggctactgcccaaaccac cccgtcattcagtggttctggaaggctgtgctactcatggacgccgaaaagcgtatccgg ttactgcagtttgtcacagggacatcgcgagtacctatgaatggatttgccgaactttat ggttccaatggtcctcagctgtttacaatagagcaatggggcagtcctgagaaactgccc agagctcacacatgctttaatcgccttgacttacctccatatgaaacctttgaagattta cgagagaaacttctcatggccgtggaaaatgctcaaggatttgaagggttcccagtccag tttgcactggtgcagcccaactccaatatctacattctcttgcagaaacccttcaaaggc ttgcctgtattgaccaatgaaaatgaaactaaaatctacactccttatgagcaagggcat tcaagctcagttgggctgggtgcttccaagagaacatcagaggtcatcagcaaggtggaa tctattcgctgtggttcaactttgtctcccgaactgcatgattattcaaagccaaaagcc aacagagcccccatggagaagtggaggatgctcagatactgctatggagccctgagaagt gaaatgggtccaagcccttgcccaggatag >gi568815580f:58222440_58496266|GENSCAN_predicted_peptide_6|124_aa MWHWKRPRLSMVVGGAGGVWTSSPGHSMGLSSEAAMEVSPLSAQISKLEAYTGDTSSYAN RKGIPGRHAYSTRKFPLPFSTMCAVSMVLVRNLDIKEKKADAPILLPLHKAWGLPPQIYT ERLS >gi568815580f:58222440_58496266|GENSCAN_predicted_CDS_6|375_bp atgtggcactggaagaggcccaggctcagcatggtggtggggggggcgggtggcgtctgg acatcgtctcctggacactccatgggcctgagcagtgaagctgccatggaagtatcaccc ctcagtgcccagataagcaagctggaagcctacacaggtgacaccagcagctatgccaac agaaaagggatacctggacgccacgcatattcaacacggaagtttcctcttccctttagc accatgtgtgcagtcagtatggtgctggtcagaaacttggatatcaaagagaagaaagca gatgcccccattctgctacctcttcacaaggcctgggggcttccgccacagatttacact gagcggctttcctga >gi568815580f:58222440_58496266|GENSCAN_predicted_peptide_7|55_aa MAYSSRFNPQGSGWHMSHVRDQMLSEFEKPRGSPDANDRDLFPMCSDREAAPAHS >gi568815580f:58222440_58496266|GENSCAN_predicted_CDS_7|168_bp atggcctacagctccagattcaaccctcaaggcagcgggtggcacatgtcgcatgtaaga gaccagatgttgtctgaatttgaaaagcctcgaggcagtcctgatgcaaatgaccgggac ttgtttccaatgtgctcagacagggaagctgcccctgcccactcgtga