GENSCAN 1.0 Date run: 6-Nov-116 Time: 17:12:58 Sequence gi568815593f:79136651_79416304 : 279654 bp : 40.74% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 18374 18517 144 1 0 62 54 121 0.809 6.27 1.02 Term + 20438 20497 60 1 0 113 32 38 0.657 -2.47 1.03 PlyA + 21218 21223 6 1.05 2.00 Prom + 42658 42697 40 -4.35 2.01 Init + 51678 51857 180 2 0 68 76 130 0.393 9.03 2.02 Term + 71538 71735 198 2 0 32 48 240 0.508 10.82 2.03 PlyA + 72371 72376 6 1.05 3.00 Prom + 72691 72730 40 -6.05 3.01 Init + 100001 101032 1032 1 0 106 105 1088 0.543 106.23 3.02 Term + 119530 119619 90 0 0 74 41 140 0.283 4.64 3.03 PlyA + 119927 119932 6 -0.45 4.04 PlyA - 120400 120395 6 1.05 4.03 Term - 122280 121836 445 2 1 56 53 147 0.458 1.72 4.02 Intr - 122999 122847 153 1 0 64 92 41 0.409 0.37 4.01 Init - 123450 123071 380 0 2 79 48 148 0.402 4.37 4.00 Prom - 136443 136404 40 -4.65 5.00 Prom + 139416 139455 40 -5.95 5.01 Init + 140793 140795 3 2 0 108 81 0 0.852 1.35 5.02 Intr + 141260 141433 174 1 0 79 63 255 0.741 21.31 5.03 Term + 141753 141794 42 2 0 105 38 9 0.563 -6.22 5.04 PlyA + 142687 142692 6 1.05 6.04 PlyA - 143919 143914 6 1.05 6.03 Term - 147638 147482 157 1 1 43 39 266 0.941 13.62 6.02 Intr - 147918 147644 275 0 2 -10 -25 279 0.431 2.21 6.01 Init - 148268 148065 204 2 0 113 81 214 0.453 22.10 6.00 Prom - 148357 148318 40 -7.55 7.00 Prom + 151042 151081 40 -7.95 7.01 Init + 153477 153621 145 2 1 46 100 159 0.746 13.23 7.02 Intr + 154480 154649 170 2 2 21 77 274 0.996 18.34 7.03 Intr + 163503 163668 166 2 1 52 80 140 0.922 8.21 7.04 Intr + 164026 164213 188 2 2 81 63 254 0.998 20.69 7.05 Intr + 169725 169811 87 2 0 80 94 114 0.994 10.45 7.06 Intr + 177607 178201 595 0 1 60 94 347 0.895 23.71 7.07 Term + 179350 179657 308 2 2 73 42 248 0.997 12.99 7.08 PlyA + 183654 183659 6 1.05 8.00 Prom + 194067 194106 40 -5.05 8.01 Init + 197567 197618 52 1 1 74 87 67 0.188 6.57 8.02 Intr + 198396 198522 127 1 1 31 85 124 0.086 5.22 8.03 Term + 204001 204292 292 1 1 57 41 188 0.053 4.83 8.04 PlyA + 204526 204531 6 1.05 9.04 PlyA - 205860 205855 6 1.05 9.03 Term - 206601 206531 71 1 2 87 48 91 0.097 2.12 9.02 Intr - 227933 227807 127 2 1 26 71 83 0.171 -0.27 9.01 Init - 231882 231787 96 0 0 54 78 135 0.943 9.46 9.00 Prom - 237010 236971 40 -6.05 10.05 PlyA - 237335 237330 6 1.05 10.04 Term - 239547 239359 189 0 0 40 49 260 0.784 13.77 10.03 Intr - 260253 260173 81 0 0 97 99 82 0.977 9.12 10.02 Intr - 260987 260877 111 2 0 95 53 97 0.971 6.56 10.01 Intr - 265405 265249 157 0 1 49 58 158 0.397 7.99 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 240517 240515 3 1 0 113 22 0 0.989 -4.05 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:79136651_79416304|GENSCAN_predicted_peptide_1|67_aa MAFEKLKKIKPKERGTKDEAGEVGKDSVTKHLAGHKDFGLHPKSSGKMLLTQFQPLSSLT QTMETDL >gi568815593f:79136651_79416304|GENSCAN_predicted_CDS_1|204_bp atggcctttgagaaactaaagaagattaagccaaaagagagagggactaaagatgaggca ggagaggtaggcaaggattctgtcacaaagcaccttgcaggccataaggattttggactt catcctaagagcagtgggaaaatgctactaactcagttccagcccttatcctctctcacc cagacaatggaaacagatttataa >gi568815593f:79136651_79416304|GENSCAN_predicted_peptide_2|125_aa MKVVAMASVEGKWRVSEGQEVGVRQVVAEEWVGPTDRVTPPAGQAISVDFVYRKQASYVL PGQQSKTLSQKKKKKKEEKEEEEGEGGEEGQEEGEEKVEEEEGREGEEGEGGEEEEDDDD STSQI >gi568815593f:79136651_79416304|GENSCAN_predicted_CDS_2|378_bp atgaaggtggtggcaatggccagtgttgaggggaaatggagagtatctgaaggacaggag gtaggagtaagacaggtggtagcagaggaatgggtgggacccactgacagagtgacacct ccagctggccaagctatcagtgtggactttgtgtataggaaacaagcttcatacgtattg cctgggcaacagagcaagactctgtctcaaaaaaaaaaaaaaaaaaaagaagaaaaggaa gaggaagaaggagaaggaggagaagaaggacaggaagaaggagaagagaaggtggaggag gaggaaggaagagaaggagaagaaggagaaggaggagaagaagaagaagacgacgacgac agtacatctcaaatttaa >gi568815593f:79136651_79416304|GENSCAN_predicted_peptide_3|373_aa MSFALEETLESDWVAVRPHVFDEREKHKFVFIVAWNEIEGKFAITCHNRTAQRQRSGSRE QAGARGGAEAGGAASDGSRGPGSPAGRGRPEATASATLVRSPGPRRSSAWAEGGSPRSTR SLLGDPRLRSPGSKGAESRLRSPVRAKPIPGQKTSEADDAAGAAAAAARPAPREAQVSSV RIVSASGTVSEEIEVLEMVKEDEAPLALSDAEQPPPATELESPAEECSWAGLFSFQDLRA VHQQLCSVNSQLEPCLPVFPEEPSGMWTVLFGGAPEMTEQEIDTLCYQLQVYLGHGLDTC GWKILSQVLFTETDDPEEYYESLSELRQKGYEEVLQRARKRIQEWQRNLIPEEQAQGEYW QTAADVPFRPKGS >gi568815593f:79136651_79416304|GENSCAN_predicted_CDS_3|1122_bp atgtcgttcgcgctggaggagacgctcgagtcggactgggtggctgtgcggccccatgtg ttcgacgagcgcgagaaacacaaattcgtctttattgtggcctggaacgagattgagggc aagtttgccataacctgccacaaccggacggcccagaggcagaggagcggctcccgggag caagcgggggcgcgagggggcgccgaggccggcggagctgcgtccgacgggagccgcggg cccggcagcccggcgggcaggggtcggcccgaggccactgcctctgcaactctggttagg agccccgggccccggcggagctcggcctgggcggagggcggctctcctcggagcactcgc agccttctgggggacccgcggctgcggagtcctggcagcaaaggggcggagagtcgtctt aggagcccagtgcgggccaaacccatcccgggtcagaaaacatctgaagccgacgatgcg gcgggggcagccgctgcagcagcccggccggcgcccagagaggcccaggtgtcctctgta cggatagtgagcgcctctgggacggtctccgaggagatagaggtgctggaaatggtgaag gaggacgaggcacctctggcgctctcggacgcggagcagccgccgcccgccaccgagctg gagtctccggccgaagagtgcagctgggccggactgttttctttccaggacctgcgcgcc gtgcaccagcagctgtgctcggtgaactcgcagttggagccgtgcctgccggtgttcccc gaggaaccttcgggcatgtggactgtgctgtttgggggcgcccccgagatgaccgagcag gaaatcgacactctgtgttaccagctccaggtctacctgggccacggcctggacacctgc ggctggaagatcctctcccaggtgctcttcaccgagaccgatgatcccgaggagtattac gaaagcctcagcgagctgcggcagaagggctacgaagaagtgcttcagcgggccaggaag cgcatccaggagtggcagaggaacctcatcccagaggaacaagcccagggggagtactgg cagactgctgctgatgttcccttcaggcccaagggctcttaa >gi568815593f:79136651_79416304|GENSCAN_predicted_peptide_4|325_aa MPPVPRPLSWETLQWGQAELPASRVAVQSGTEGQAERGPEAEVGPGCAYAMLHGASGSQG HAGFPPSQGWLQLPKLQLQIQASLHSPGPGKAPPCPCRLGGVCSHCLDFPQYQRLFHSQG KVEANPSLLRTLGTNEHRREAKGVLRAAQRWPTDTPWHKQPGHHKWQQEADKLLGRSGTR TQDPGNGRAKRAVTQTGLKHPLPTMLQPTRGQKSCGPLGSPDLGAPQARAVTPSLRRCSS WSLQASRHHTAFPGVSHGNCLWYPWSSHSLIGSWCQYWHLELPAPPPPECLVVCSGQTPR SLTHPSPLHDWLSLGRHGTQTGSRS >gi568815593f:79136651_79416304|GENSCAN_predicted_CDS_4|978_bp atgccacctgtacctcgaccactgtcctgggagacactgcaatggggccaggctgagttg cctgccagcagggtagcagtacagtcgggcactgaggggcaggcagagaggggccctgag gcagaggtgggcccaggatgtgcatatgccatgctccatggagccagtgggagccagggg catgcaggattcccaccctcccaggggtggctacagctgcccaagctacagctgcagatc caggcatctctgcactctccggggcctgggaaggccccaccttgcccctgcaggctcgga ggtgtctgctcccactgcctggactttcctcaatatcagcgcctgttccactcacaaggc aaagttgaggccaaccctagcctgctgcggactttgggcaccaatgagcacagaagagag gccaagggggtgctgagggcagctcagcgctggcccacagacactccttggcacaaacag cctgggcaccataaatggcagcaggaggcagacaagctcctgggcaggagtgggacaaga actcaagacccagggaatggcagggctaaaagagctgtaacacaaacagggctgaaacac cccttgcccacaatgttacagccaacaagaggacagaagagctgtggccctctggggagc ccagacctaggagctccccaagccagagcggtgacaccctctttaaggcgctgcagttcc tggagtctccaagcttccaggcaccacactgcattccctggtgtcagccatggaaactgc ttatggtatccctggtccagccacagcctcatagggagctggtgccagtactggcacctg gagctgcccgccccaccaccgccagagtgcctggttgtatgcagtggccagaccccacgc tcactaacacacccctcaccgctccatgactggctctcccttggcaggcatgggacccag actggtagcaggagctga >gi568815593f:79136651_79416304|GENSCAN_predicted_peptide_5|72_aa MLLDKHKNTESMVELLDLYQMEDEAYSSLAEATTELYQYLLQPFRDMRELAMLRRQQIKS CLLFNVLETVYS >gi568815593f:79136651_79416304|GENSCAN_predicted_CDS_5|219_bp atgctcttggataagcacaagaatacagagagcatggtggagcttctggacttgtatcag atggaggatgaagcctacagcagccttgcagaagctacaaccgaactctatcagtattta ctacagccattccgagacatgagagaacttgccatgctacgaagacagcagatcaagtca tgtctcctctttaatgttttggagactgtgtactcttag >gi568815593f:79136651_79416304|GENSCAN_predicted_peptide_6|211_aa MAVGKNKHLTKGSKKGAKKKVVDPFSKKDWYDVKAPAMFNIRNIGKMLITRTQGTKIASD GLKGRVFETMIEAHVDVRTIDGYLLRLFCVGFTKKRNNQIQKTSYAQHQRVRQIRKKMME IMTREVQTNALKEVVNKLIPDSIGKDIEKACQSILSMMSSKVKMLKKPKFELGKLMELHG EGSSSGKATGDERGAKVERADGFEPPVQESV >gi568815593f:79136651_79416304|GENSCAN_predicted_CDS_6|636_bp atggcggttggcaagaacaagcaccttacgaaaggcagcaaaaagggggccaagaagaaa gtggttgatccattttctaagaaagattggtatgatgtgaaagcacctgctatgttcaat ataagaaatattggaaagatgctcatcaccaggacccaaggaaccaaaattgcatctgat ggtctcaagggtcgtgtgtttgaaacaatgattgaagctcacgttgacgtcaggactatt gatggttacttgcttcgtctgttctgtgttggttttactaaaaaacgcaacaatcagata cagaagacctcttatgcccagcaccaacgggtccgccaaatccggaagaagatgatggaa atcatgacccgagaggtgcagacaaatgccttgaaagaagtggtcaataaattgattcca gacagcattggaaaagacatagaaaaggcttgccaatctatcctctccatgatgtcttca aaagtaaaaatgctgaagaagcccaagtttgaattgggaaagctcatggagcttcatggt gaaggcagtagttctggaaaagccactggggacgagagaggtgctaaagttgaacgagct gatggttttgaaccaccagtccaagaatctgtttaa >gi568815593f:79136651_79416304|GENSCAN_predicted_peptide_7|552_aa MENDYLGPRRIESLQKEDADWQRKAHMAVLSIQDLTVKYFEITAKAQKAVYDRMRADQKK FGKASWAAAAERMEKLQYAVSKETLQMMRAKEICLEQRKHALKEEMQSLRGGTEAIARLD QLEADYYDLQLQLYEVQFEILKCEELLLTAQLESIKRLISEKRDEVVYYDTYESMEAMLE KEEMAASAYLQREELQKLQQKARQLEARRGRVSAKKSYLRNKKEICIAKHNEKIQQRTRI EDEYRTHHTVQLRYPGQVILKSTRLRLAHARRKGAASPVLQEDHCDSLPSVLQVEEKTEE VGEGRVKRGPSQTTEPQSLVQLEDTSLTQLEATSLPLSGVTSELPPTISLPLLNNNLEPC SVTINPLPSPLPPTPPPPPPPPPPPPPPPLPVAKDSGPETLEKDLPRKEGNEKRIPKSAS APSAHLFDSSQLVSARKKLRKTAEGLQRRRVSSPMDEVLASLKRGSFHLKKVEQRTLPPF PDEDDSNNILAQIRKGVKLKKVQKDVLRESFTLLPDTDPLTRSIHEALRRIKEASPESED EEEALPCTDWEN >gi568815593f:79136651_79416304|GENSCAN_predicted_CDS_7|1659_bp atggagaatgattatctgggacctcgaagaattgagagtctacaaaaagaagatgctgat tggcagcggaaagctcacatggctgtactgtctattcaagatcttactgtcaagtacttt gaaataacagctaaagctcaaaaagctgtgtatgatcgaatgcgagctgatcagaagaaa tttggtaaagcatcatgggcagcggctgctgaacggatggaaaaactccagtatgcagtt tctaaggaaactttgcagatgatgagagctaaagagatatgcttggaacagcggaaacat gcactaaaggaagagatgcagagtttgcggggtggtacagaagcgatagcacgattggat cagttagaagctgattattatgatctgcaacttcagttgtatgaagtacagtttgaaatc ttgaagtgtgaagagttactattgacagcgcaactagaaagcatcaaaagacttatatca gaaaaaagagatgaagtggtatactatgacacttacgaaagcatggaggccatgctggag aaggaagagatggcagcatctgcgtacttacagagagaagagctgcagaaacttcagcag aaagcacgccagctggaagcaagacgtggacgggtttctgccaagaaatcctacctcaga aataaaaaggaaatatgtattgcaaaacacaatgaaaaaatccaacagcgcactcggatt gaagatgaatatagaacccatcacacagtacaactaaggtatcctgggcaagtcatactt aaatcaaccagattacgactagctcatgcaagaagaaaaggtgcagcaagtcctgttctc caagaggatcattgtgactctttaccaagtgtgttacaggtagaagagaaaactgaagag gtgggagaaggaagagtcaagcgtgggccatcacagacaacagaaccccagagccttgtg caacttgaagatacttcattaacacaacttgaagccacctcattacctctcagtggtgtt acctctgaactgcctcccactatatctcttccacttttgaataacaacctcgaaccatgt tctgttaccataaatccactcccatcccctcttcctccaacaccaccacctcccccacct cctccccctcccccaccaccaccacctctgcctgttgctaaggacagtggcccagagaca ctggagaaagatctgcctagaaaggaggggaatgagaagaggatcccaaagtcagccagt gccccctcagcacacctctttgacagcagccagctggtcagtgcacggaagaagctcaga aagactgctgaaggtttgcagaggaggagagtgagctcacccatggatgaggtgctagcc tccttgaagcgtggtagttttcatctgaaaaaggttgaacagcgaactctgcctcctttt cctgatgaagatgatagtaataatatcttggcacaaataaggaaaggggtaaaattgaag aaggtacagaaggatgttttgagagaatccttcacacttctacccgatacagaccctcta acacggagcatccatgaagctcttagaagaattaaagaagcatccccagagtcagaggac gaagaggaggctttaccttgcacagactgggagaactaa >gi568815593f:79136651_79416304|GENSCAN_predicted_peptide_8|156_aa MSQRRSGLNQQEQEEVEAIRVGLPFMGSRRFPDGREERSSKMRTQECSHSGQLGGHWVPW FLAPLQWVRTSSFSSEKFVITDHLKPTSVNSSESFSVQLCSIAGEELRSFGGEEALWFLE FSAFLLGFSPSLWFYLPLVFDAGDLQMRFWCGCPFS >gi568815593f:79136651_79416304|GENSCAN_predicted_CDS_8|471_bp atgagccagaggagatctggactcaaccaacaggaacaagaggaggttgaagcaattcga gttgggcttcctttcatgggcagcagaagattccctgatggtcgtgaggagcgttccagt aagatgaggactcaggagtgctcccacagtgggcaacttggaggtcactgggtaccctgg tttttagctcccttgcaatgggttcgaacatcctcctttagctcagagaagtttgttatt accgaccatctgaagcctacttctgtcaactcgtcagagtcattctctgtccagctttgc tccattgctggcgaggagctccgatcctttggaggagaagaggcactctggtttttggaa ttttcagcttttctgctcggtttctccccatctttgtggttttatctacctttggtcttt gatgctggtgacctacagatgcggttttggtgtggatgtccttttagttga >gi568815593f:79136651_79416304|GENSCAN_predicted_peptide_9|97_aa MDRYSPKHRMLGKDVANDGDGEVAKGQSRKDFALCQALECKDGKTQPYPPEAYNLMREAC ILTKKQSYINAVMKEISRKSVAEFGRRLQVVRQSTPI >gi568815593f:79136651_79416304|GENSCAN_predicted_CDS_9|294_bp atggatcgatacagcccgaagcataggatgttagggaaggacgtggcaaatgatggggat ggagaagtagccaaaggccagagcaggaaggactttgcactctgccaggcactggaatgc aaagatgggaagacacagccctatcctccagaagcttataatctaatgagggaagcatgc atattaaccaagaagcaatcatacatcaatgctgttatgaaagagatttcccggaaatca gttgcagagtttggcagaagacttcaggttgtaagacagtcaacccccatataa >gi568815593f:79136651_79416304|GENSCAN_predicted_peptide_10|179_aa XSAISKHWEAELATLKGNNAKLTAALLESTANVKQWKQQLAAYQEEAERLHKRVTELECV SSQANAVHTHKTELNQTIQELEETLKLKEEEIERLKQEIDNARELQEQRDSLTQKLQEVE IRNKDLEGQLSDLEQRLEKSQNEQEAFRNNLKTLLEILDGKIFELTELRDNLAKLLECS >gi568815593f:79136651_79416304|GENSCAN_predicted_CDS_10|540_bp nnttcagcaatcagcaaacattgggaggctgaactggctaccctcaaaggaaataatgcc aaactcactgcagccctgctggagtccactgccaatgtgaaacaatggaaacagcaactt gctgcctatcaagaggaagcagaacgtctgcacaagcgggtgactgaacttgaatgtgtt agtagccaagcaaatgcagtacatactcataagacagaattaaatcagacaatacaagaa ctggaagagacactgaaactgaaggaagaggaaatagaaaggttaaaacaagaaattgat aatgccagagaactacaagaacagagggattctttgactcagaaactacaggaagtagaa attcggaacaaagacctggagggacaactgtctgacttagagcaacgtctggagaaaagt cagaatgaacaagaagcttttcgcaataacctgaagacactcttagaaattctggatgga aagatatttgaactaacagaattacgagataacttggccaagctactagaatgcagctaa