GENSCAN 1.0 Date run: 5-Nov-116 Time: 04:15:05 Sequence gi568815591r:6064947_6290532 : 225586 bp : 45.70% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 3870 3990 121 2 1 86 82 50 0.419 4.55 1.02 Intr + 16215 16324 110 1 2 86 83 34 0.444 2.70 1.03 Term + 16727 16870 144 1 0 -9 47 217 0.779 5.81 1.04 PlyA + 16963 16968 6 1.05 2.03 PlyA - 18011 18006 6 1.05 2.02 Term - 27277 27011 267 1 0 24 42 395 0.910 23.99 2.01 Init - 32356 32174 183 1 0 61 29 115 0.408 2.03 2.00 Prom - 34722 34683 40 -5.36 3.00 Prom + 41550 41589 40 -3.16 3.01 Init + 46188 46428 241 2 1 71 79 104 0.683 5.84 3.02 Intr + 50377 50577 201 2 0 105 103 9 0.405 3.36 3.03 Intr + 70895 71005 111 0 0 67 88 36 0.182 1.85 3.04 Intr + 74146 74248 103 2 1 69 97 73 0.654 5.53 3.05 Intr + 75182 75249 68 2 2 79 99 -26 0.674 -3.85 3.06 Intr + 75968 76038 71 0 2 64 94 19 0.893 -1.00 3.07 Intr + 77986 78068 83 0 2 108 70 127 0.790 11.34 3.08 Intr + 79139 79250 112 2 1 75 95 29 0.982 2.68 3.09 Intr + 80570 80710 141 1 0 118 80 128 0.999 15.55 3.10 Intr + 81202 81302 101 0 2 95 72 14 0.784 -0.59 3.11 Intr + 82793 82946 154 2 1 141 87 32 0.934 8.47 3.12 Intr + 84637 85356 720 0 0 84 113 498 0.945 43.53 3.13 Intr + 85466 85560 95 1 2 88 82 40 0.997 2.16 3.14 Intr + 88810 90249 1440 1 0 16 99 1543 0.998 136.93 3.15 Intr + 91808 92109 302 2 2 106 90 122 0.934 10.58 3.16 Term + 92276 92385 110 0 2 70 48 75 0.457 0.37 3.17 PlyA + 99497 99502 6 1.05 4.18 PlyA - 100009 100004 6 1.05 4.17 Term - 100481 100323 159 2 0 129 48 183 0.414 16.34 4.16 Intr - 100670 100599 72 2 0 92 79 65 0.978 5.60 4.15 Intr - 100864 100788 77 2 2 72 98 85 0.948 7.13 4.14 Intr - 103018 102891 128 0 2 56 89 30 0.701 0.22 4.13 Intr - 103919 103835 85 0 1 63 105 47 0.821 2.78 4.12 Intr - 105700 105589 112 1 1 124 80 223 0.993 25.05 4.11 Intr - 106032 105884 149 1 2 58 84 393 0.670 35.85 4.10 Intr - 106368 106256 113 2 2 98 82 110 0.998 11.42 4.09 Intr - 108787 108707 81 0 0 89 115 64 0.994 7.95 4.08 Intr - 112995 112877 119 0 2 105 113 134 0.985 16.86 4.07 Intr - 122170 122104 67 0 1 17 116 52 0.422 -0.19 4.06 Intr - 122775 122711 65 0 2 89 77 58 0.720 2.32 4.05 Intr - 125585 125503 83 0 2 96 83 117 0.901 11.36 4.04 Intr - 167323 167162 162 2 0 19 116 52 0.017 1.05 4.03 Intr - 192846 192790 57 1 0 16 96 76 0.087 0.06 4.02 Intr - 208087 207528 560 0 2 49 101 190 0.215 8.87 4.01 Intr - 214213 214078 136 2 1 57 98 116 0.469 9.13 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:6064947_6290532|GENSCAN_predicted_peptide_1|124_aa MEAKSKICRVDWQTRDPEKEESVLQFKSETCLPETFLLRADQSNSESGGSVFPRHPVYLR FKMRESSVGASVQRLTESSDGGLRRSGERVIALLKADCAYVDCGFAPGRSSLLFTRKPKV CVDP >gi568815591r:6064947_6290532|GENSCAN_predicted_CDS_1|375_bp atggaggctaaatccaaaatctgcagggtggactggcagactagagacccagagaaggaa gagtcagtgttacagtttaagtcagaaacttgtctgccggaaactttcttgctcagggca gatcagagtaactctgagtctggtgggagtgtcttcccccgacaccccgtatacctccgc tttaagatgagagaatcttctgtcggtgctagcgttcagcggctaacggagtcttcggac ggcggcctgcggaggtcgggtgagagggtgatcgcgctgctgaaagccgactgcgcctac gtggactgcgggttcgctccgggccgctcctccctgctgttcacgaggaaacctaaagtc tgtgttgacccatga >gi568815591r:6064947_6290532|GENSCAN_predicted_peptide_2|149_aa MLKCMLPAGDIPSQGLENTVLPGPRRAETASAVLEKWYPEVAVMYKPGLITGDAVTELGC LPGRQSENLSQKKKKKEKKKKEEEEEEEEEEEEEGRGRRRRRGGRRRRGRGRRRRRRKKK KKKKKKKKKKKKKKKKKKKKKKKKYVLED >gi568815591r:6064947_6290532|GENSCAN_predicted_CDS_2|450_bp atgctaaaatgcatgctaccagctggtgacattccatcacaaggcctggaaaacaccgtt ttaccaggaccacggagggcagagacagcatcagctgtgttggaaaagtggtaccccgag gtggctgtcatgtataagccagggctaataacaggagatgctgttacagaactgggctgt ctgcctgggcgacaaagcgagaatctgtctcaaaaaaaaaaaaaaaaagagaagaagaag aaagaagaagaagaagaggaagaggaagaagaagaagaagaaggaagaggaagaagaaga agaagaggaggaagaagaagaagaggaagaggaagaagaagaagaagaagaaagaagaag aagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaag aagaagaaaaaatacgtccttgaggactaa >gi568815591r:6064947_6290532|GENSCAN_predicted_peptide_3|1350_aa MTIVDKASESSDPSAYQNQPGSSEAVSPGDMDAGSASWGAVSSLNDVSNHTLSLGPVPGA VVYSSSSVPDKSKPSPQKDQALGDGIAPPQKVLFPSEKICLKWQQTHRVGAGLQNLGNTC FANAALQCLTYTPPLANYMLSHEHSKTCHAEGFCMMCTMQAHITQALSNPGDVIKPMFVI NEMRRIARHFRFGNQEDAHEFLQYTVDAMQKACLNGSNKLDRHTQATTLVCQIFGGYLRS RVKCLNCKGVSDTFDPYLDITLEIKAAQSVNKALEQFVKPEQLDGENSYKCSKCKKMVPA SKRFTIHRSSNVLTLSLKRFANFTGGKIAKDVKYPEYLDIRPYMSQPNGEPIVYVLYAVL VHTGFNCHAGHYFCYIKASNGLWYQMNDSIVSTSDIRSVLSQQAYVLFYIRSHDVKNGGE LTHPTHSPGQSSPRPVISQRVVTNKQAAPGFIGPQLPSHMIKNPPHLNGTGPLKDTPSSS MSSPNGNSSVNRASPVNASASVQNWSVNRSSVIPEHPKKQKITISIHNKLPVRQCQSQPN LHSNSLENPTKPVPSSTITNSAVQSTSNASTMSVSSKVTKPIPRSESCSQPVMNGKSKLN SSVLVPYGAESSEDSDEESKGLGKENGIGTIVSSHSPGQDAEDEEATPHELQEPMTLNGA NSADSDSDPKENGLAPDGASCQGQPALHSENPFAKANGLPGKLMPAPLLSLPEDKILETF RLSNKLKGSTDEMSAPGAERGPPEDRDAEPQPGSPAAESLEEPDAAAGLSSTKKAPPPRD PGTPATKEGAWEAMAVAPEEPPPSAGEDIVGDTAPPDLCDPGSLTGDASPLSQDAKGMIA EGPRDSALAEAPEGLSPAPPARSEEPCEQPLLVHPSGDHARDAQDPSQSLGAPEAAERPP APVLDMAPAGHPEGDAEPSPGERVEDAAAPKAPGPSPAKEKIGSLRKVDRGHYRSRRERS SSGEPARESRSKTEGHRHRRRRTCPRERDRQDRHAPEHHPGHGDRLSPGERRSLGRCSHH HSRHRSGVELDWVRHHYTEGERGWGREKFYPDRPRWDRCRYYHDRYALYAARDWKPFHGG REHERAGLHERPHKDHNRGRRGCEPARERERHRPSSPRAGAPHALAPHPDRFSHDRTALV AGDNCNLSDRFHEHENGKSRKRRHDSVENSDSHVEKKARRSEQKDPLEEPKAKKHKKSKK KKKSKDKHRDRDSRHQQDSDLSAACSDADLHRHKKKKKKKKRHSRKSEDFVKDSELHLPR VTSLETVAQFRRAQGGFPLSGGPPLEGVGPFREKTKHLRMESRDDRCRLFEYGQVWFRCT VKPLAFIEGLSDTGLRAALPVSPKPWNVQL >gi568815591r:6064947_6290532|GENSCAN_predicted_CDS_3|4053_bp atgaccatagttgacaaagcttctgaatcttcagacccatcagcctatcagaatcagcct ggcagctccgaggcagtctcacctggagacatggatgcaggttctgccagctggggtgct gtgtcttcattgaatgatgtgtcaaatcacacactttctttaggaccagtacctggtgct gtagtttattcgagttcatctgtacctgataaatcaaaaccatcaccacaaaaggatcaa gccctaggtgatggcatcgctcctccacagaaagttcttttcccatctgagaagatttgt cttaagtggcaacaaactcatagagttggagctgggctccagaatttgggcaatacctgt tttgccaatgcagcactgcagtgtttaacctacacaccacctcttgccaattacatgcta tcacatgaacactccaaaacatgtcatgcagaaggcttttgtatgatgtgtacaatgcaa gcacatattacccaggcactcagtaatcctggggacgttattaaaccaatgtttgtcatc aatgagatgcggcgtatagctaggcacttccgttttggaaaccaagaagatgcccatgaa ttccttcaatacactgttgatgctatgcagaaagcatgcttgaatggcagcaataaatta gacagacacacccaggccaccactcttgtttgtcagatatttggaggatacctaagatct agagtcaaatgtttaaattgcaagggcgtttcagatacttttgatccatatcttgatata acattggagataaaggctgctcagagtgtcaacaaggcattggagcagtttgtgaagccg gaacagcttgatggagaaaactcgtacaagtgcagcaagtgtaaaaagatggttccagct tcaaagaggttcactatccatagatcctctaatgttcttacactttctctgaaacgtttt gcaaattttaccggtggaaaaattgctaaggatgtgaaataccctgagtatcttgatatt cggccatatatgtctcaacccaacggagagccaattgtctacgtcttgtatgcagtgctg gtccacactggttttaattgccatgctggccattacttctgctacataaaagctagcaat ggcctctggtatcaaatgaatgactccattgtatctaccagtgatattagatcggtactc agccaacaagcctatgtgctcttttatatcaggtcccatgatgtgaaaaatggaggtgaa cttactcatcccacccatagccccggccagtcctctccccgccccgtcatcagtcagcgg gttgtcaccaacaaacaggctgcgccaggctttatcggaccacagcttccctctcacatg ataaagaatccacctcacttaaatgggactggaccattgaaagacacgccaagcagttcc atgtcgagtcctaacgggaattccagtgtcaacagggctagtcctgttaatgcttcagct tctgtccaaaactggtcagttaataggtcctcagtgatcccagaacatcctaagaaacaa aaaattacaatcagtattcacaacaagttgcctgttcgccagtgtcagtctcaacctaac cttcatagtaattctttggagaaccctaccaagcccgttccctcttctaccattaccaat tctgcagtacagtctacctcgaacgcatctacgatgtcagtttctagtaaagtaacaaaa ccgatcccccgcagtgaatcctgctcccagcccgtgatgaatggcaaatccaagctgaac tccagcgtgctggtgccctatggcgccgagtcctctgaggactctgacgaggagtcaaag gggctgggcaaggagaatgggattggtacgattgtgagctcccactctcccggccaagat gccgaagatgaggaggccactccgcacgagcttcaagaacccatgaccctaaacggtgct aatagtgcagacagcgacagtgacccgaaagaaaacggcctagcgcctgatggtgccagc tgccaaggccagcctgccctgcactcagaaaatccctttgctaaggcaaacggtcttcct ggaaagttgatgcctgctcctttgctgtctctcccagaagacaaaatcttagagaccttc aggcttagcaacaaactgaaaggctcgacggatgaaatgagtgcacctggagcagagagg ggccctcccgaggaccgcgacgccgagcctcagcctggcagccccgccgccgaatccctg gaggagccagatgcggccgccggcctcagcagcaccaagaaggctccgccgccccgcgat cccggcacccccgctaccaaagaaggcgcctgggaggccatggccgtcgcccccgaggag cctccgcccagcgccggcgaggacatcgtgggggacacagcaccccctgacctgtgtgat cccgggagcttaacaggcgatgcgagcccgttgtcccaggacgcaaaggggatgatcgcg gagggcccgcgggactcggcgttggcggaagccccggaagggttgagtccggctccgcct gcgcggtcggaggagccctgcgagcagccactccttgttcaccccagcggggaccacgcc cgggacgctcaggacccatcccagagcttgggcgcacccgaggccgcagagcggccgcca gctcctgtgctggacatggccccggccggtcacccggaaggggacgctgagcctagcccc ggcgagagggtcgaggacgccgcggcgccgaaagccccaggcccttccccagcgaaggag aaaatcggcagcctcagaaaggtggaccgaggccactaccgcagccggagagagcgctcg tccagcggggagcccgccagagagagcaggagcaagactgagggccaccgtcaccggcgg cgccgcacctgcccccgggagcgcgaccgccaggaccgccacgccccggagcaccacccc ggccacggcgacaggctcagccctggcgagcgccgctctctgggcaggtgcagtcaccac cactcccgacaccggagcggggtggagctggactgggtcagacaccactacaccgagggc gagcgtggctggggccgggagaagttctaccccgacaggccgcgctgggacaggtgccgg tactaccatgacaggtacgccctgtacgctgcccgggactggaagcccttccacggcggc cgcgagcacgagcgggccgggctgcacgagcggccgcacaaggaccacaaccggggccgt aggggctgcgagccggcccgggagagggagcggcaccgccccagcagcccccgcgcaggc gcgccccacgccctcgccccgcaccccgaccgcttctcccacgacagaactgcacttgta gccggagacaactgtaacctctctgatcggtttcacgaacacgaaaatggaaagtcccgg aaacggagacacgacagtgtggagaacagtgacagtcatgttgaaaagaaagcccggagg agcgaacagaaggatcctctagaagagcctaaagcaaagaagcacaaaaaatcaaagaag aaaaagaaatccaaagacaaacaccgagaccgcgactccaggcatcagcaggactcagac ctctcagcagcgtgctctgacgctgacctccacagacacaaaaaaaagaagaagaaaaag aagagacattcaagaaaatcagaggactttgttaaagattcagaactgcacttacccagg gtcaccagcttggagactgtcgcccagttccggagagcccagggtggctttcctctctct ggtggcccgcctctggaaggcgtcggacctttccgtgagaaaacgaaacacttacggatg gaaagcagggatgacaggtgtcgtctctttgagtatggccaggtgtggttccgttgcaca gttaagcccttagcgtttattgaaggcctaagtgacacaggactgagggcagcactacct gtgtcaccaaagccctggaacgtacagctctag >gi568815591r:6064947_6290532|GENSCAN_predicted_peptide_4|741_aa XWQTDSFWLVNQKAGKIPTSVDNNDEYLLSAYCVLHTVYDSIRFSHLSKYWNTGVSHRAW PGLTSECIRFLIFYPECTCKLHPWPLGFAECGVPVSGAPRPSAGYWNAEGEEASLHAHRI GLHFRGQVYSLPQRRRRRPGRFPRAGLRGLTPRPPRAEAGHGSRDGTPTSEAPPGGRRPG RDPGRALRSRPVACALPPAASPARGPSAAGPRLAGCGSGLKMDEDGGGEGGGAVYCNLEL KASGVILAVAAGLLESGCGTPKIGEPISLLVCFNNQNCAYVWNEFKGGYAYQGLGIGNGS SGRVKVPEDLSLEEREELLDIRRRKKELIDDIERLKYEIAEVMTEIDNLTSVEESKTTQR NKQIAMGRKKFNMDPKKGIQFLIENDLLQSSPEDVAQFLYKGEGLNKTVIGDYLGERDEF NIKVLQAFVELHEFADLNLVQALRQFLWSFRLPGEAQKIDRMMEAFASRYCLCNPGVFQS TDTCYVLSFAIIMLNTSLHNHNVRDKPTAERFIAMNRGINEGGDLPEELLRNLYESIKNE PFKIPEDDGNDLTHTFFNPDREGWLLKLVSPRGTVTPSACLAANVTPATRERSALETPGL QRPPGPEDWTGMGLCMGGVSGGPKVELRGHLADPSQAGPGGRVKTWKRRWFILTDNCLYY FEYTTDKEPRGIIPLENLSIREVEDPRKPNCFELYNPSHKGQVIKACKTEADGRVVEGNH VVYRISAPSPEEKEEWMKSIK >gi568815591r:6064947_6290532|GENSCAN_predicted_CDS_4|2226_bp ncttggcagacagattccttctggcttgtgaatcagaaggcgggaaaaattcctacttct gtggataacaatgacgagtatttattgagtgcttattgtgtgctacacactgtatatgac agcatccggttctcacacctctcaaagtattggaatacaggtgtgagccaccgagcctgg ccaggcctgacttctgaatgcattcgtttcttgatcttttaccccgagtgcacgtgcaag ttgcacccctggccactgggttttgcagagtgtggcgttccagtgtccggggccccacgc ccgtccgcaggctactggaatgctgagggcgaggaggcttcgctccacgcacaccggatc ggcctgcacttcaggggccaggtctacagccttccccagcgcagacgccgccggcccggc cggttcccgagggcggggctgcgcggcctcacgccccgcccaccgcgggcagaggcgggg catgggtcacgtgacggcacgccgaccagcgaggccccgcccggcgggaggaggcccggc cgcgacccgggccgcgcgctgaggagccgcccggtcgcctgcgcgctccctccggcggcg tccccagcccgcggcccctctgctgccggcccccggctcgccggctgcgggagtggcctc aagatggatgaagacggcggcggcgagggtggtggcgcagtgtactgtaacttggagctc aaggcatctggtgtgatactggctgtagcagcaggcctccttgaatctggctgtggtact cccaagattggagaaccaattagcctgctggtttgtttcaataaccagaattgtgcatat gtctggaatgagtttaagggtggttatgcttatcaaggattggggattggcaatgggtct agtggaagggtaaaagtgcctgaagacctctcattagaagagagagaagaacttctagac attcgtcgaagaaaaaaggaacttattgatgacattgagaggctgaaatatgaaattgca gaggtgatgacagagatcgacaatctaacttccgtagaggagagcaaaacgactcagagg aacaaacagatagccatgggaagaaagaaattcaacatggatcccaaaaagggaattcag tttctaatagaaaatgacctgctacagagttccccagaagacgtcgcccagttcctttat aaaggagaaggcctaaataagaccgtcattggggactacctgggtgaaagggatgaattt aatattaaagttcttcaagcctttgttgaactccatgagtttgctgatctcaaccttgta caagccttaaggcagttcttatggagcttcaggctgcccggggaggcgcagaagattgat cgcatgatggaggctttcgcttctcgctactgcctgtgcaaccccggggtcttccagtcc acagacacgtgctacgtgctgtcattcgccatcatcatgctcaacaccagcctccacaac cacaacgtgcgtgacaagcccacggcagaacggttcatcgccatgaaccgcggcatcaac gagggcggggacctccctgaggagctgctgaggaatttgtatgagagcattaagaacgag ccatttaagatcccggaggacgacgggaacgacctgacccacaccttcttcaaccccgac cgcgagggctggctcctgaagctggtgtcccctcgagggacagtgacgccatcagcctgc ctggcagccaatgtgacccctgccaccagggaaaggtcggctctagagactcctggtctg caacgaccccctggccctgaggactggacagggatggggctctgcatgggaggtgtgagc gggggacccaaggtggagctgagaggccacctggcagacccctcccaggcggggccagga gggcgtgtgaagacctggaagcgccggtggttcatcctgaccgataactgcctctattac tttgaatacacaacagataaggagcccaggggaatcatcccgttggaaaacctcagcatc agggaggtggaggacccccggaaacccaactgttttgagctctacaatcccagccacaaa gggcaggtcatcaaggcctgtaagactgaggccgacggccgcgtggtagaggggaaccat gtggtgtaccggatctcagccccgagcccggaggagaaggaggagtggatgaaatccatc aagtga