GENSCAN 1.0 Date run: 7-Nov-116 Time: 21:21:48 Sequence gi568815575r:24329576_24529920 : 200345 bp : 42.05% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 6963 7704 742 1 1 35 7 379 0.326 14.07 1.02 Term + 7842 9048 1207 0 1 67 47 290 0.550 12.94 1.03 PlyA + 9084 9089 6 -0.45 2.00 Prom + 9380 9419 40 -3.65 2.01 Init + 16144 16369 226 0 1 90 14 144 0.936 5.88 2.02 Intr + 17148 17371 224 1 2 53 51 194 0.787 9.12 2.03 Intr + 29428 29513 86 0 2 61 97 53 0.178 1.20 2.04 Intr + 32503 32648 146 1 2 60 64 132 0.156 7.01 2.05 Intr + 32776 32886 111 2 0 65 49 121 0.091 5.33 2.06 Intr + 32942 32983 42 0 0 111 80 31 0.349 1.89 2.07 Intr + 33166 35781 2616 2 0 103 69 2416 0.777 227.58 2.08 Intr + 39111 39363 253 1 1 43 86 119 0.542 2.77 2.09 Intr + 43316 43460 145 2 1 41 86 114 0.479 5.86 2.10 Intr + 48406 48571 166 0 1 64 30 91 0.020 -0.49 2.11 Intr + 58772 58917 146 0 2 112 85 45 0.001 5.68 2.12 Intr + 78672 78803 132 2 0 49 92 73 0.115 3.72 2.13 Term + 82522 82563 42 0 0 144 48 47 0.592 2.48 2.14 PlyA + 84485 84490 6 1.05 3.04 PlyA - 84531 84526 6 1.05 3.03 Term - 95678 95559 120 2 0 93 53 61 0.793 0.59 3.02 Intr - 97235 97094 142 1 1 55 44 91 0.499 0.83 3.01 Init - 100345 100002 344 1 2 75 75 359 0.854 30.05 3.00 Prom - 106678 106639 40 -5.35 4.00 Prom + 113919 113958 40 -1.95 4.01 Init + 135881 136312 432 1 0 77 26 327 0.001 21.66 4.02 Intr + 151333 151364 32 0 2 76 110 21 0.008 -0.89 4.03 Intr + 156056 156090 35 2 2 59 98 60 0.126 1.05 4.04 Intr + 165167 165308 142 0 1 49 111 109 0.770 7.79 4.05 Intr + 169254 169325 72 0 0 140 97 46 0.997 8.30 4.06 Intr + 173752 173936 185 1 2 106 98 151 0.996 16.41 4.07 Intr + 175634 175723 90 0 0 95 116 44 0.906 6.95 4.08 Intr + 189358 189435 78 2 0 104 53 100 0.802 6.70 4.09 Intr + 196623 196699 77 1 2 104 115 -30 0.355 -0.38 4.10 Intr + 197999 198100 102 1 0 33 88 97 0.341 3.65 4.11 Intr + 198501 198611 111 2 0 33 99 77 0.353 2.96 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 182866 182924 59 0 2 86 82 33 0.815 -0.74 S.002 Intr + 183904 184124 221 1 2 66 44 178 0.935 8.22 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575r:24329576_24529920|GENSCAN_predicted_peptide_1|649_aa XRLIKKKTEKNQIDAIKNDKGDITTDPTEIQTTIREYYKHLYANKLENLEEMDKFLDTYT LPRLNQEEVESLNRPITGSEIVAIINSLPTKNSPGPDGFTAEFYQRYKEELVPFLLRLFQ STEKEGILLNSFYEASIILIPKPDRDTTKKENFRPISLMNIDAKILNKILANRIQQHIKK LIHHDQVGFIPGMQGWFNICKSINVIQHINRTKDKNHMIISIDAEKAFDKIQQPFMLKTL NKLGIDGTVGSWAISQEKEIKGIQLAKEEVKLSLFADDMIGYLENPIVSAQNLLKLISNF SKVSGYKINVQKSQAFLYTNSRQTESQIMSELPFTIASKRIKYLGIQLTRDVKDLFKENY KPLLNEIKEDTNKWKNIPCSWVGRINIVKMAILPKVIYRFNAIPIKLPMTFFTELEKTTL KFIWNQKRARIAKSILSQKNKAGGITLPDFKLYYKATVTKTAWYWYQNRDIDQWNRTESS EIMSHIYNHLNFDKPDKNKKWGKDSLFNKWCWENWLAICRKLKLDPFLTPYTKINSRWIK DLHVRPKTIKILEENLGNTIQDIGMGKDFMSKTPKAMATKAKIDKWDLIKLKSFCTAKET TIRVNRQPTKWEKIFATYSSDKGLISRIYNELKQIYKKNTTPSKSGQRI >gi568815575r:24329576_24529920|GENSCAN_predicted_CDS_1|1950_bp ncaagactaataaagaagaaaacagagaagaatcaaatagacgcaataaaaaatgataaa ggggatatcaccaccgatcccacagaaatacaaactaccatcagagaatactacaaacac ctctacgcaaataaactagaaaatctagaagaaatggataaattcctcgacacatacacc ctcccaagactaaaccaggaagaagttgaatctctgaatagaccaataacaggctctgaa attgtggcaataatcaatagcttaccaaccaaaaacagtccaggaccagacggattcaca gccgaattctaccagaggtacaaggaggagctggtaccattccttctgagactattccaa tcaacagaaaaagagggaatcctccttaactcattttatgaggccagcatcatcctgata ccaaagcctgacagagacacaaccaaaaaagagaattttagaccaatatccttgatgaac attgatgcaaaaatcctcaataaaatactggcaaaccgaatccagcagcacatcaaaaag cttatccaccatgatcaagtgggcttcatccctgggatgcaaggctggttcaatatatgc aaatcaataaatgtaatccagcatataaacagaaccaaagacaaaaaccacatgattatc tcaatagatgcagaaaaggcctttgacaaaattcaacaacccttcatgctaaaaactctc aataaattaggtattgatgggactgttggaagttgggcaattagtcaggagaaggaaata aagggtattcaattagcaaaagaggaagtcaaattgtccctgtttgcagatgacatgatt ggatatctagaaaaccccattgtctcagcccaaaatctccttaagctgataagcaacttc agcaaagtctcaggatacaaaatcaatgtgcaaaaatcacaagcattcttatacaccaat agcagacaaacagagagccaaatcatgagtgaactcccattcacaattgcttcaaagaga ataaaatacctaggaatccaacttacaagggatgtgaaggacctcttcaaggagaactac aaaccactgctcaatgaaataaaagaggatacaaacaaatggaagaacattccatgctca tgggtaggaagaatcaatatcgtgaaaatggccatactgcccaaggtaatttatagattt aatgccatccccatcaagctaccaatgactttcttcacagaattggaaaaaactacttta aagttcatatggaaccaaaaaagagcccgcatcgccaaatcaatcctaagccaaaagaac aaagctggaggcatcacgctacctgacttcaaactatactacaaggctacagtaaccaaa acagcatggtactggtaccaaaacagagatatagatcaatggaacagaacagagtcctca gaaataatgtcacatatctacaaccatctgaactttgacaaacctgacaaaaataagaaa tggggaaaggattccctatttaataaatggtgctgggaaaactggctagccatatgtaga aagctgaaactggatcccttccttacaccttatacaaaaattaattcaagatggattaaa gacttacatgttagacctaaaaccataaaaatcctagaagaaaacctaggcaataccatt caggacataggcatgggcaaggacttcatgtccaaaacaccaaaagcaatggcaacaaaa gccaaaattgacaaatgggatctaattaaactaaagagcttctgcacagcaaaagaaact accatcagagtgaacaggcaacctacaaaatgggagaaaatttttgcaacctactcatct gacaaagggctaatatccagaatctacaatgaactcaaacaaatttacaagaaaaacaca accccatcaaaaagtgggcaaaggatatga >gi568815575r:24329576_24529920|GENSCAN_predicted_peptide_2|1444_aa MVVLSNRKDRKGKGEKEKSIACGELGKARSSEWPEKGPSITATLNQKFRRLLISCEGIFS SSPISLQVSLSEEKKVRCNRPLTGSKPVNYQIQSDPGPSPVSVMTSKPSLDQKFAQTQLQ TQICGALEPERELTMISSCSKRETAQWVWQKILFVDISINFVHNCLTGKNPNVLQLVNSG IPVETDVYARCERYIYTVIGGLCFCMYAFKVKWQVVECKCGQQTRKCACRDVKPGPRRRP LRYRCFQKPSNRLGSASLGAGFWAGRRDAKSGHDDDPSGALFSVTMDRDLEQALDRAENI IEIAQQRPPRRRYSPRAGKTLQEKLYDIYVEECGKEPEDPQELRSNVNLLEKLVRRESLP CLLVNLYPGNQGYSVMLQREDGSFAETIRLPYEERALLDYLDAEELPPALGDVLDKASVN IFHSGCVIVEVRDYRQSSNMQPPGYQSRHILLRPTMQTLAHDVKMMTRDGQKWSQEDKLQ LESQLILATAEPLCLDPSVAVACTANRLLYNKQKMNTDPMKRCLQRYSWPSVKPQQEQSD CPPPPELRVSTSGQKEERKVGQPCELNIAKAGSCVDTWKGRPCDLAVPSEVDVEKLAKGY QSVTAADPQLPVWPAQEVEDPFGFALEAGCQAWDTKPSIMQSFNDPLLCGKIRPRKKARQ KSQKSPWQPFPDDHSACLRPGSETDAGRAVSQAQESVQSKVKGPGKMSHSSSGPASVSQL SSWKTPEQPDPVWVQSSVSGKGEKHPPPRTQLPSSSGKISSGNSFPPQQAGSPLKRPFSA AAAIAAAAAAAAAAAAAAAAAAPAPALAAAAAPALAAAAAPALAAAAAPAPAPAAAPAVA AAPAAAASAAPSHSQKPSVPLIQASRPCPAAQPPTKFIKIAPAIQLRTGSTGLKAINVEG PVQGAQALGSSFKPVQAPGSGAPAPAGISGSDLQSSGGPLPDARPGAVQASSPAPLQFFL NTPEGLRPLTLLQVPQGSAVLTGPQQQSHQLVSLQQLQQPTAAHPPQPGPQGSALGLSTQ GQAFPAQQLLKVNPTRARSGLQPQPQPAVLSLLGSAQVPQQGVQLPSVLRQQQPQPQPPK LQLQPQWQPKPRQEQPQSQQQQPQHIQLQTQQLRVLQQPQHIQLQTQQLRVLQQPVFLAT GAVQIVQPHPVLNSHVWLVAIVWVSAVLEGQEQSRLWGQSIRHDLGLGHTGGVRFMMLIA CSGLELLEILMAYGHLGQDLQDQVAPAMERGGWKRIFAGEEVKELKAQATGRIIYVDTKI IKNSYCWHSVEEDDNESGPRIFKNTQVTVTEVNQVGLGGWALRISISKFPGVAAAAAGSG TIHILRTCLDANHMVALQDPVWRVAASGLVQTMEDTSRSEIGRERQAYLFLQPTPCQAML VSSFFPLCVRGNRLWVKSSLVIHPWCLRRFHKCIHIPYQGLAGQSNSRPLWGSDWHLTSD GITI >gi568815575r:24329576_24529920|GENSCAN_predicted_CDS_2|4335_bp atggttgtcctgagtaacagaaaagatagaaaagggaaaggagaaaaagagaaaagcatt gcttgcggcgagttggggaaggcaaggagctcagagtggccagagaaaggcccatccatt acagcaacactgaatcaaaagttcaggcggctccttatcagttgtgaagggatcttttct agcagtcctattagcctgcaagtttccctttccgaggaaaaaaaagtcagatgtaatagg cctctaactggatccaaaccagttaattatcagatccaatccgatcccggacccagtcca gtttctgtcatgacttccaaacccagtttggatcagaaatttgctcaaactcagctccaa acacaaatctgtggagctttggaaccggagagagaacttaccatgatatccagctgctcc aagagagaaacggcacaatgggtctggcagaaaatcttatttgtggatatttccatcaac tttgttcataattgcctaactggaaagaatccaaatgtccttcaactggtaaacagcggc attcctgtggaaactgatgtctatgcacggtgcgaaagatacatctataccgtcattggt ggcttatgcttttgcatgtatgctttcaaggtgaaatggcaagtagtcgagtgcaagtgc gggcagcagacccggaagtgcgcatgcagagacgtgaagcccgggccacgacgacgaccc ctcaggtaccgatgttttcagaaaccctcaaacagattgggtagcgcttccctgggggca ggtttctgggcaggccgcagagacgcgaagtccggccacgacgacgacccctcaggggcc ctgttttctgttacaatggatcgagatttagaacaggctctggatcgcgcagagaatatc attgaaattgcccaacagagacctcctagaaggagatactcacctagggcgggaaaaact ctgcaggaaaaactttatgacatttatgttgaagaatgtggaaaagagcctgaggatcct caggaattgagaagcaatgtaaacttgttagaaaagcttgttaggagagagtccttgcca tgtttactggtcaatctatacccaggcaatcaggggtattctgtgatgctccagagagaa gatgggtcctttgcagagaccattcggctgccttatgaagaaagggcattgctggactac ttggatgcagaagaattaccccctgctttgggtgatgtcctggataaagcttcggttaac atttttcatagtgggtgtgtcatagtagaagttcgtgactacaggcagtccagtaatatg caacctcctggttaccaaagcaggcatattcttctacgtccaacgatgcagactttagcc catgatgtgaagatgatgacaagagatggccagaaatggagccaggaagacaagcttcag cttgagagccagctgatcttagcgacagctgaaccactgtgtcttgatccttctgtagca gttgcctgcactgcaaacaggctgctgtacaacaagcaaaagatgaataccgacccgatg aaacggtgcctccagaggtattcgtggccctctgtaaagccacagcaggagcagtctgac tgtccacctcctcctgagctgagagtgtcgacttctggccaaaaagaagaaagaaaagta ggtcagccttgtgagctgaacattgctaaagcaggaagttgtgtagacacgtggaaaggc agaccctgtgatttggccgtgccttcagaagtggatgtggagaaacttgctaaagggtat cagtccgtcacagctgctgacccacagctcccagtctggccagcccaggaggtagaagac ccttttggatttgcgttggaagctggctgtcaggcctgggacaccaagccaagcatcatg cagtcgtttaatgatccgcttctctgtggtaaaatacggccacgtaaaaaagccaggcag aagagccagaagtctccctggcagcccttcccagatgaccattcagcttgtctcaggcct gggtcagagactgatgctgggagggcagtgagtcaggcccaggaatcggtgcagagcaaa gtcaaaggtccaggcaagatgtcacacagctccagtggcccagccagtgtcagtcagctc tcttcatggaaaacaccagaacagcctgatcctgtgtgggtccagtcttcagtatcgggg aagggagagaaacatccacctccccgcacccaacttccctcaagctcaggaaagatttcc tcaggtaacagttttcccccacaacaggcaggcagccctcttaagcgtccattttctgct gctgctgctattgctgctgctgctgctgctgctgctgctgctgctgctgctgctgctgct gctgctcctgctcctgctctagctgctgctgctgctcctgctctagctgctgctgctgct cctgctctagctgctgctgctgctcctgctcctgctcctgccgctgctcctgctgtagct gctgctcctgctgctgctgcttctgcggcaccaagtcattctcagaagccctctgtgcct ctcattcaagctagcaggccctgtccagctgcccagccccccaccaaattcataaaaata gcgccagccattcagttgaggacaggctccactggcctaaaggccatcaatgtggagggc ccagtccagggagcccaggctttggggagcagtttcaagcctgtgcaggcccctggctcg ggtgcccccgctcctgcaggaatcagtggcagtgaccttcagtcctcaggaggtccacta ccagatgcaaggcccggtgcagtgcaggcatcttctccagcaccccttcagtttttccta aatactccggaaggtctcaggcctctgacactcctccaggttccgcagggctcggcggtt ctgaccggcccgcagcagcagtcccatcagctggtttccctgcagcagctccagcagccc acagctgctcatcctcctcagccagggccacagggttccgcactaggtttgagcacgcaa gggcaggccttccctgctcagcaacttcttaaggtgaaccccactagagccagaagtggt ctgcagccccagccccagcctgctgtgttgagtctgcttggctctgcccaggttcctcag cagggtgtccagctcccctctgtcttgaggcagcagcagccacagccacagccgccgaag ctgcaactgcaaccgcagtggcagccaaagccacggcaggagcagccacagtcgcagcag cagcagccgcagcatatccagctccagactcagcagttgagagtcttgcagcagccgcag catatccagctccagactcagcagttgagagtcctgcagcagccagtgtttttggcaaca ggcgctgttcagatagtgcagccacatccagtgcttaatagccatgtatggctagtggct attgtgtgggtcagcgcagttctagagggacaggagcaaagcagattatggggacagagc atcaggcatgacctgggtttggggcacacaggtggagtgagatttatgatgctgattgca tgtagtgggctggagcttcttgagatcctcatggcctatggtcatctgggccaggacctg caggatcaagtcgccccagccatggagcgaggagggtggaaaaggatctttgcaggagag gaggtcaaggagttgaaggcccaggctactggaaggattatctatgtggataccaaaatc atcaagaattcatactgctggcatagtgttgaagaggatgataatgagtcaggccctaga atcttcaagaatactcaggtaactgtaactgaagtcaaccaagtaggtctgggtgggtgg gccctgagaattagcatttccaagtttccaggtgttgctgctgctgctgctggatcaggg accatccacattttgagaacctgtctggatgcaaatcacatggtggccctccaggatcca gtctggagagtagctgcttctggtttggtccagacaatggaggacaccagtagatctgag ataggaagagagaggcaggcgtacttattcctgcagcccactccctgccaggctatgctg gtcagcagcttttttcctctgtgtgtccgtggaaatcgactttgggttaagagtagtctg gttatccatccgtggtgtttgcggaggtttcacaaatgcatacacatcccttatcaagga cttgctggccagtcaaatagtagacctttgtgggggtcagactggcatctgacatcagat ggcatcaccatctga >gi568815575r:24329576_24529920|GENSCAN_predicted_peptide_3|201_aa MTKKRRNNGRAKKGRGHVQPIRCTKCARCVPKDKAIKKFVIRNIVEAAAIRDISEASVFD AYVLPKLYVKLHYCVSCAIHSKVVRNRSREARKDRTPPPRFRPAGAAPRPPPKPMIWVQN TADIRHVLTGFAAGKPPSLGRSCVGKSGSCPLSRCLQDGCPQVIPVGCINIRQKPPLLCN HVGKCIIRSKLCGADLLAGIH >gi568815575r:24329576_24529920|GENSCAN_predicted_CDS_3|606_bp atgacaaagaaaagaaggaacaatggtcgtgccaaaaagggccgcggccacgtgcagcct attcgctgcactaaatgtgcccgatgcgtgcccaaggacaaggccattaagaaattcgtc attcgaaacatagtggaggccgcagcaatcagggacatttctgaagcgagcgtcttcgat gcctatgtgcttcccaagctgtatgtgaagctacattactgtgtgagttgtgcaattcac agcaaagtagtcaggaatcgatctcgtgaagcccgcaaggaccgaacacccccaccccga tttagacctgcgggtgctgccccacgtcccccaccaaagcccatgatctgggtccagaat actgcagacatcagacatgtgttaactgggtttgctgccggcaagccaccatccttggga aggagttgtgtgggcaaatcaggaagttgccctctcagcagatgtctccaggatggctgc cctcaggttattcctgttggttgtataaatataagacagaagccaccactgctttgcaat cacgtggggaagtgtatcatccgctccaagctctgtggagcagacctcctggccgggatc cactga >gi568815575r:24329576_24529920|GENSCAN_predicted_peptide_4|452_aa MRLFRWLLKQPVPKQIERYSRFSPSPLSIKQFLDFGEYGAKGPHGPGAAAIRAATPDLRS FGVKSVGTAGGAREYPGGSPGARISGRGGAGPGPLAPSLGLTRRHRNGVSLQDPHRRRDS PNLRSLASNLLDTNLTALKTSERDVFEIRVVSGNRTLTAGSYPYRQGRDNACEKTSYMFL RKELPVRLANTMREVNLLPDNLLNRPSVGLVQSWYMQSFLELLEYENKSPEDPQVLDNFL QVLIKVRNRHNDVVPTMAQGVIEYKEKFGFDPFISTNIQYFLDRFYTNRISFRMLINQHT LLFGGDTNPVHPKHIGSIDPTCNVADVVKDAYETAKMLCEQYYLVAPELEVEEFNAKAPD KPIQVVYVPSHLFHMLFELFKNSMRATVELYEDRKEGYPAVKTLVTLGKEDLSIKISDLG GGVPLRKIDRLFNYMYSTAPRPSLEPTRAAPL >gi568815575r:24329576_24529920|GENSCAN_predicted_CDS_4|1356_bp atgcggctgttccggtggctgctgaagcagccggtgcccaagcagatcgagcgctactcg cgcttttcgccgtcgccgctctccatcaaacaattcctggacttcggtgagtacggggcc aagggtccccatgggcccggggccgccgccattcgggctgccaccccggatctccgcagc ttcggggtaaagtctgtgggaaccgcaggcggggctagggaatatccggggggctctcct ggggctcggatttcggggcgcggaggggcagggcctggccccctcgcacccagcctggga cttacccggagacacagaaacggcgtctcattgcaggaccctcatcgcaggcgagactcc ccaaacctgcggtcgttagcttcaaatctgcttgacacaaacctcacagcccttaaaact tcagaacgagatgtttttgaaataagggtggtttctgggaacagaacattaacagcaggc agttatccctatcggcaagggagagataatgcatgtgagaaaacttcatatatgtttcta cgaaaggaacttcctgtgcggctggctaacacaatgagagaagttaatcttctgccggat aatttacttaaccgcccttcagtgggattggttcagagttggtatatgcagagttttctt gaacttttagaatatgaaaataagagccctgaggatccacaggtcttggataactttcta caagttctgattaaagtcagaaatagacacaatgatgtggttcctacaatggcacaagga gtgattgaatacaaggagaagtttgggtttgatcctttcattagcactaacatccaatat tttctggatcggttttataccaaccgcatctctttccgcatgcttattaatcagcacaca cttctgtttgggggtgacactaatcctgttcatcctaaacacataggaagtatcgatccc acctgtaacgtggcggatgtggtgaaagatgcatatgaaacagccaagatgctgtgtgaa cagtattacctggtagctccagagctggaagttgaagaattcaatgccaaagcgccagac aaacctattcaggtggtttatgtgccctcacatctgtttcatatgctatttgagttgttc aagaactcaatgagagcgacagttgaactctatgaagacagaaaagagggctaccctgct gttaaaaccctcgttactttgggtaaagaagacttatccattaagatcagtgacctaggt ggtggtgtcccacttcgaaaaatagatcgtctttttaactacatgtattctactgctcct agacccagcctggagcctaccagagctgcccctttg