GENSCAN 1.0 Date run: 3-Nov-116 Time: 15:43:55 Sequence gi568815578r:37029023_37279480 : 250458 bp : 44.73% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.33 Intr - 6486 6220 267 1 0 92 52 105 0.808 4.80 1.32 Intr - 11263 11131 133 1 1 87 92 66 0.991 7.12 1.31 Intr - 15228 15064 165 0 0 121 81 67 0.986 9.46 1.30 Intr - 18168 18031 138 0 0 128 100 21 0.708 7.96 1.29 Intr - 32247 32152 96 0 0 96 39 61 0.249 2.21 1.28 Intr - 33248 33062 187 1 1 58 24 183 0.472 8.59 1.27 Intr - 36451 36402 50 1 2 73 67 23 0.949 -3.82 1.26 Intr - 37862 37702 161 0 2 65 79 142 0.992 10.71 1.25 Intr - 38099 37971 129 0 0 69 116 9 0.820 2.47 1.24 Intr - 43721 43574 148 2 1 64 19 73 0.001 -2.19 1.23 Intr - 66217 66079 139 0 1 98 27 79 0.630 3.27 1.22 Intr - 67003 66737 267 0 0 24 75 353 0.140 24.25 1.21 Intr - 73813 73678 136 2 1 67 28 112 0.167 2.73 1.20 Intr - 79642 79577 66 2 0 91 127 38 0.970 6.88 1.19 Intr - 83424 83298 127 0 1 88 47 64 0.792 2.55 1.18 Intr - 85123 84968 156 1 0 95 94 140 0.999 15.51 1.17 Intr - 86315 86151 165 2 0 38 58 132 0.469 5.36 1.16 Intr - 91596 91482 115 2 1 92 94 30 0.730 4.45 1.15 Intr - 94777 94587 191 1 2 104 29 156 0.476 9.68 1.14 Intr - 100145 100010 136 1 1 58 61 82 0.874 3.07 1.13 Intr - 108948 108788 161 0 2 111 92 12 0.539 2.69 1.12 Intr - 112292 112140 153 2 0 62 92 50 0.842 3.07 1.11 Intr - 114819 114695 125 1 2 84 90 96 0.996 9.70 1.10 Intr - 118887 118752 136 0 1 84 56 36 0.931 0.14 1.09 Intr - 126145 126041 105 1 0 98 105 83 0.910 11.41 1.08 Intr - 128933 128838 96 2 0 81 92 15 0.709 1.41 1.07 Intr - 130014 129870 145 2 1 52 78 83 0.956 3.98 1.06 Intr - 131154 131075 80 0 2 136 64 78 0.978 8.55 1.05 Intr - 132644 132607 38 0 2 83 100 29 0.698 1.58 1.04 Intr - 138518 138435 84 0 0 50 70 77 0.568 1.89 1.03 Intr - 143035 142901 135 2 0 28 89 124 0.843 7.04 1.02 Intr - 145094 144973 122 1 2 112 98 88 0.994 12.34 1.01 Init - 145264 145248 17 1 2 53 88 -26 0.076 -6.22 1.00 Prom - 148805 148766 40 -5.66 2.00 Prom + 148935 148974 40 -8.16 2.01 Init + 149216 149229 14 1 2 72 66 12 0.086 -4.19 2.02 Intr + 150230 150347 118 2 1 73 76 139 0.121 11.67 2.03 Intr + 150453 150543 91 2 1 44 -5 94 0.085 -4.73 2.04 Intr + 155158 155351 194 2 2 47 116 216 0.869 19.51 2.05 Intr + 169375 169470 96 0 0 53 105 71 0.957 5.51 2.06 Intr + 170028 170203 176 2 2 99 69 124 0.987 10.34 2.07 Intr + 174863 174938 76 2 1 90 48 76 0.996 3.32 2.08 Intr + 175745 175879 135 1 0 79 67 135 0.942 11.26 2.09 Intr + 178251 178427 177 2 0 53 94 261 0.885 23.32 2.10 Intr + 181025 181143 119 1 2 80 110 44 0.937 5.06 2.11 Intr + 184738 184843 106 1 1 77 92 131 0.989 12.62 2.12 Intr + 194856 194947 92 2 2 98 94 97 0.995 10.09 2.13 Intr + 196666 196780 115 1 1 81 71 145 0.986 12.55 2.14 Intr + 199528 199722 195 0 0 39 68 201 0.914 12.81 2.15 Intr + 200951 201037 87 1 0 98 76 75 0.954 7.47 2.16 Intr + 203274 203369 96 2 0 72 105 85 0.935 8.81 2.17 Intr + 204998 205073 76 1 1 67 119 60 0.999 6.09 2.18 Intr + 207558 207687 130 1 1 81 94 184 0.914 18.05 2.19 Term + 210591 210672 82 0 1 117 38 13 0.278 -3.63 2.20 PlyA + 212575 212580 6 1.05 3.10 PlyA - 213846 213841 6 1.05 3.09 Term - 213987 213958 30 0 0 103 39 37 0.264 -1.85 3.08 Intr - 214709 214571 139 1 1 97 19 96 0.227 4.07 3.07 Intr - 225307 225188 120 0 0 54 80 95 0.635 4.91 3.06 Intr - 227476 227372 105 0 0 106 86 218 0.990 22.73 3.05 Intr - 227647 227592 56 1 2 73 84 -27 0.419 -6.82 3.04 Intr - 227886 227785 102 0 0 129 115 8 0.511 7.97 3.03 Intr - 232650 232602 49 2 1 13 105 60 0.200 -1.12 3.02 Intr - 232833 232721 113 0 2 18 94 125 0.466 5.28 3.01 Intr - 247219 247136 84 1 0 7 111 66 0.039 0.82 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578r:37029023_37279480|GENSCAN_predicted_peptide_1|1457_aa MKKIRRICSQEEVVIPCAYDSDSESVDLELSNLEIIKKGSSSIELTDLDIPDIPGLHCEP LSHSPRHLTQQDPLSEAIVEKLIQSIQKVFNVPDSSRNCLGNLGYKDKEDKIPIYAAKQG KRNPLEAAETQKVLVQEERPHSLSSSMRQEVFVTIADLSYQDVHLLLGSEDRAELFSLTI KSIITLPSVRTLTQIQEIMPNGTCNTECLYRQTFQAFSEMLQSLVVKDPHLENLDTIIKH LVPWLQSVKDHERERATASMAQVLKCLSKHLNLKLPLRFQRLGHLVALMALLCGDPQEKV AEEAAEGIHSLLHITLRLKYITHDKKDQQNLKRALTKCREFLELHSSAAKCFYNCPFRIA QVFEGFLDSNELCQFIMTTFDTLKTLKHPCIQRSAGELLLTLAKNTESQFEKVPEIMGVI CAQLSIISQPRVRQQIINTVSLFISRPKYTDIVLSFLLCHPVPYNRHLAEVWRMLSVELP STTWILWRLLRKLQKCHNEPAQEKMAYVAVAATDALYEVFLGNRLRAATFRLFPQLLMTL LIQIHHSIGLTMSDVDIPSGLYTEQEVPSEVTPLCALLERNQLLAQKVMYLLVPLLNRGN DKHKLTSAGFFVEREDIKSLLPYIVDSLRETDEKIVLSAIQILLQLVRTMDFTTLAAMMR TLFSLFGDVRSDVHRFSVTLFGAAIKSVKNPDKKSIENQVLDSLVPLLLYSQDENDAVAE ESRQVLTICAQFLKWKLPQEVYSKDPWHIKPTEAGTICRFFVCLLSKYMDHNELRRMGTD WIEDDLRDLLCDPEPSLCIIASQTLLLVQMARAEPKPKQRVNWLQKLMGRFSRALAQVVV GSAPGREKEVGGRGAQPAGPEGMFEDKPHAEGAAVVAAAGEALQALCQELNLDEGSAAEA LDDFTAIRGNYSLEVSGSRVELPVAVQQFDQSQCAHEADLSPTFPPPSEHRPDALHQCLA LNKTNEIDPALRIHHFFTSLHQQLQIQNLKQMYLIKDIRLYAFILASSKADLFSGNFRMI GDDLVNSYHLLLCCLDLIFANAIMCPNRQDLLNPSFKGLPSDFHTADFTASEEPPCIIAV LCELHDGLLVEAKGIKEHYFKPYISKLFDRKILKGECLLDLSSFTDNSKAVNKEYEEYVL TVGDFDERIFLGADAEEEIGTPRKFTRDTPLGKLTAQANVEYNLQQHFEKKRSFAPSTPL TGRRYLREKEAVITPVASATQSVLLEQDIFHRSLMACCLEIVLFAYSSPRTFPWIIEVLN LQPFYFYKVIEVVIRSEEGLSRDMVKHLNSIEEQILESLAWSHDSALWEALQVSANKVPT CEEVIFPNNFETGNGGNVQGHLPLMPMSPLMHPRVKEVRTDSGSLRRDMQPLSPISVHER YSSPTAGSAKRRLFGEDPPKEMLMDKIITEGTKLKIAPSSSITAENVSILPGQTLLTMAT APVTGTTGHKVTIPLHX >gi568815578r:37029023_37279480|GENSCAN_predicted_CDS_1|4371_bp atgaaaaaaataagaaggatctgtagtcaggaagaagtagtgatcccctgtgcctatgac agtgattcagaaagtgtggatttggagctgagcaacttagagattattaaaaaaggctca agtagcattgaactgacagacttggacatccctgacatccctggactccattgtgagccc ctgtcacatagccccagacacctgacccaacaggacccgctcagtgaggccattgttgag aaactgatccagtccatccagaaggttttcaatgtgcctgacagttccaggaactgtctt gggaatttgggctacaaagacaaagaagacaaaatccctatttatgcagccaagcaaggt aagagaaatcctctagaagcagctgaaacacaaaaggtactggtacaagaggaacgcccg cattctctgtccagttccatgcgccaggaggtctttgtcaccatcgctgatctcagttac caagatgtccatttgctgttgggctctgaagatcgagctgagttgttcagtcttaccatc aagagtataatcactctgccctctgtaaggacccttacccagatacaggaaatcatgccc aatgggacctgcaacacagagtgtctttacaggcagacgtttcaggcattctctgagatg ctccagagtttggtggtaaaagacccacatttggaaaatcttgacaccattattaagcac ttggtcccctggttacagtcagtcaaagaccatgagcgggaacgggccacggccagcatg gctcaagttctgaagtgcctatccaaacatctcaacttgaagcttccactgcgattccaa agacttggacacctagtggctctgatggcactgctctgtggggacccacaggaaaaggtg gctgaggaggctgcagagggcattcactccctgctgcatatcaccctgaggctgaagtat atcactcatgacaagaaagatcagcaaaacttgaaaagagcattgacaaaatgtcgagaa ttcctggagctccacagctctgccgctaaatgcttctacaactgtcccttcagaattgcc caggtctttgaaggttttcttgattcaaatgagctctgccagtttataatgactacattt gataccctgaaaaccctgaaacatccctgcatccagcgatcagcaggagaattactgcta actttggcaaaaaatacagagtcccaatttgagaaggtgccagaaattatgggagttatc tgtgcccagttatccataatcagccagcctagagtccgccaacaaatcataaataccgtg agtttatttatatccagacccaagtacacagatatagtgctcagcttccttctgtgtcat ccagtgccgtataacaggcacctggctgaggtgtggagaatgctgtcggtggagcttccc agcacgacctggattctgtggaggctcctgaggaagctgcagaaatgccataatgagcct gcacaggagaagatggcatatgtggctgtggctgcaacagatgccctttatgaggtgttt ttgggaaacaggcttcgagcagctacgttccgactctttcctcagcttctcatgacactg cttatccagattcatcacagcatcggcctcaccatgtctgatgtcgacatcccaagtggc ctgtacacagaacaggaagtgccttcagaggtcacccctttgtgcgcattgctggaaaga aatcagctccttgcacagaaggtcatgtacttattagtccctcttcttaaccgagggaat gataaacataaactcacatctgcaggcttttttgtggagagagaagacatcaagagcctg ttgccatacattgtagacagcttgcgtgaaaccgatgagaagatcgttctgtcagccatc cagatactcctgcaacttgttagaacaatggatttcactaccctggctgccatgatgagg accctgttctccttatttggtgatgtgagatctgatgttcatcgtttctccgtgactctc tttggagccgccataaagtctgtaaaaaacccagataagaagagtatagagaaccaagtc ctggacagcttggtcccactacttctgtattctcaggatgaaaatgatgcagtagctgag gagagcaggcaagtcctaactatatgtgcccagttcctgaagtggaagctgccccaagaa gtgtactccaaagatccctggcacatcaaacctactgaagcaggaacaatctgcagattc tttgtatgccttttatcgaagtacatggatcacaatgagctcaggaggatgggtactgac tggatagaggacgatctgagagacctgctgtgtgaccctgagccctcgctgtgcatcatc gcttcccagactctgttactagtccagatggcgagggccgaaccaaaacctaagcagaga gtgaactggttgcagaagctcatgggcagattttcgcgcgctttggcgcaggtggttgtg ggtagcgcgcctgggagggagaaagaagtcgggggccgtggcgcgcagcccgcggggcct gaagggatgttcgaggacaagccccacgctgagggggcggcggtggtcgccgcagccggg gaggcgctacaggccctgtgccaggagctgaacctggacgaggggagcgcggccgaagcc ctggacgactttactgccatccgaggcaactacagcctagaggtgagcggcagcagggtg gagctgccggtcgctgtgcagcagtttgatcaaagccaatgtgcacacgaagctgatctc agccctacgtttcctcctccgtcagagcatcgtcctgatgctttgcatcagtgtctggct ctgaataaaacaaatgagattgatcctgctctaagaattcaccacttcttcacttctctg caccagcagcttcagattcaaaatctcaagcagatgtatctgataaaggatatcagatta tatgccttcatcctagcttccagtaaagctgatctgtttagtggtaattttcggatgatt ggggatgacttagtaaactcttatcatttacttctatgctgcttggatctgatttttgcc aatgcgattatgtgcccaaatagacaagacttgctaaatccatcatttaaaggtttacca tctgattttcatactgctgactttacggcttctgaagagccaccctgcatcattgctgta ctgtgtgaactgcatgatggacttctcgtagaagcaaaaggaataaaggagcactacttt aagccatatatttcaaaactctttgacaggaagatattaaaaggagaatgcctcctggac ctttcaagttttactgataatagcaaagcagtgaataaggagtatgaagagtatgttcta actgttggtgattttgatgagaggatctttttgggagcagacgcagaagaggaaattgga acacctcgaaagttcactcgtgacaccccattagggaaactgacagcacaggctaatgtg gagtataaccttcaacagcactttgaaaaaaaaaggtcatttgcaccttctaccccactg accggacggagatatttacgagaaaaagaagcagtcattactcctgttgcatcagccacc caaagtgttcttttagagcaagatatatttcatcgttccttgatggcttgttgtttggaa attgtgctctttgcctatagctcacctcgtacttttccttggattattgaagttctcaac ttgcaaccattttacttttataaggttattgaggtggtgatccgctcagaagaggggctc tcaagggacatggtgaaacacctaaacagcattgaagaacagattttggagagtttagca tggagtcacgattctgcactgtgggaggctctccaggtttctgcaaacaaagttcctacc tgtgaagaagttatattcccaaataactttgaaacaggaaatggaggaaatgtgcaggga catcttcccctgatgccaatgtctcctctaatgcacccaagagtcaaggaagttcgaact gacagtgggagtcttcgaagagatatgcaaccattgtctccaatttctgtccatgaacgc tacagttctcctaccgcagggagtgctaagagaagactctttggagaggaccccccaaag gaaatgcttatggacaagatcataacagaaggaacaaaattgaaaatcgctccttcttca agcattactgctgaaaatgtatcaattttacctggtcaaactcttctaacaatggccaca gccccagtaacaggaacaacaggacataaagttacaattccattacatgnn >gi568815578r:37029023_37279480|GENSCAN_predicted_peptide_2|724_aa MSRLQACRESLASPVAGSWSHFPERKSARGSDSGGTCSEEWRRRGHGHKLWLGRSRIEGP KEGCELVGVPATWRGSSTVFLLALTIIASTWALTPTHYLTKHDVERLKASLDRPFTNLES AFYSIVGLSSLGAQVPDAKKACTYIRSNLDPSNVDSLFYAAQASQALSGCEISISNETKD LLLAAVSEDSSVTQIYHAVAALSGFGLPLASQEALSALTARLSKEETVLATVQALQTASH LSQQADLRSIVEEIEDLVARLDELGGVYLQFEEGLETTALFVAATYKLMDHVGTEPSIKE DQVIQLMNAIFSKKNFESLSEAFSVASAAAVLSHNRYHVPVVVVPEGSASDTHEQAILRL QVTNVLSQPLTQATVKLEHAKSVASRATVLQKTSFTPVGDVFELNFMNVKFSSGYYDFLV EVEGDNRYIANTVELRVKISTEVGITNVDLSTVDKDQSIAPKTTRVTYPAKAKGTFIADS HQNFALFFQLVDVNTGAELTPHQTFVRLHNQKTGQEVVFVAEPDNKNVYKFELDTSERKI EFDSASGTYTLYLIIGDATLKNPILWNVADVVIKFPEEEAPSTVLSQNLFTPKQEIQHLF REPEKRPPTVVSNTFTALILSPLLLLFALWIRIGANVSNFTFAPSTIIFHLGHAAMLGLM YVYWTQLNMFQTLKYLAILGSVTFLAGNRMLAQQAVKRFNGVVIYLQKNVPFLVYNSVNP EKPL >gi568815578r:37029023_37279480|GENSCAN_predicted_CDS_2|2175_bp atgtcgaggctgcaagcctgccgcgagtccctggcgtcccctgtggcgggctcttggagc cactttcccgagcggaagtcagcccgcggctcggactccggcgggacctgctcggaggaa tggcgccgccgggggcatgggcacaagctctggctggggcgctctcggatcgagggtccg aaggagggctgcgagctggtgggagtgcccgcgacctggcggggttcaagcactgtcttc ctgttggccctgacaatcatagccagcacctgggctctgacgcccactcactacctcacc aagcatgacgtggagagactaaaagcctcgctggatcgccctttcacaaatttggaatct gccttctactccatcgtgggactcagcagccttggtgctcaggtgccagatgcaaagaaa gcatgtacctacatcagatctaaccttgatcccagcaatgtggattccctcttctacgct gcccaggccagccaggccctctcaggatgtgagatctctatttcaaatgagaccaaagat ctgcttctggcagctgtcagtgaggactcatctgttacccagatctaccatgcagttgca gctctaagtggctttggccttcccttggcatcccaagaagcactcagtgcccttactgct cgtctcagcaaggaggagactgtgctggcaacagtccaggctctgcagacagcatcccac ctgtcccagcaggctgacctgaggagcatcgtggaggagattgaggaccttgttgctcgc ctggatgaactcgggggcgtgtatctccagtttgaagaaggactggaaacaacagcgtta tttgtggctgccacctacaagctcatggatcatgtggggactgagccatccattaaggag gatcaggtcatccagctgatgaacgcgatcttcagcaagaagaactttgagtccctctcc gaagccttcagcgtggcctctgcagctgctgtgctctcgcataatcgctaccacgtgcca gttgtggttgtgcctgagggctctgcttccgacactcatgaacaggctatcttgcggttg caagtcaccaatgttctgtctcagcctctgactcaggccactgttaaactagaacatgct aaatctgttgcttccagagccactgtcctccagaagacatccttcacccctgtaggggat gtttttgaactaaatttcatgaacgtcaaattttccagtggttattatgacttccttgtc gaagttgaaggtgacaaccggtatattgcaaataccgtagagctcagagtcaagatctcc actgaagttggcatcacaaatgttgatctttccaccgtggataaggatcagagcattgca cccaaaactacccgggtgacatacccagccaaagccaagggcacattcatcgcagacagc caccagaacttcgccttgttcttccagctggtagatgtgaacactggtgctgaactcact cctcaccagacatttgtccgactccataaccagaagactggccaggaagtggtgtttgtt gccgagccagacaacaagaacgtgtacaagtttgaactggatacctctgaaagaaagatt gaatttgactctgcctctggcacctacactctctacttaatcattggagatgccactttg aagaacccaatcctctggaatgtggctgatgtggtcatcaagttccctgaggaagaagct ccctcgactgtcttgtcccagaaccttttcactccaaaacaggaaattcagcacctgttc cgcgagcctgagaagaggccccccaccgtggtgtccaatacattcactgccctgatcctc tcgccgttgcttctgctcttcgctctgtggatccggattggtgccaatgtctccaacttc acttttgctcctagcacgattatatttcacctgggacatgctgctatgctgggactcatg tatgtctactggactcagctcaacatgttccagaccttgaagtacctggccatcctgggc agtgtgacgtttctggctggcaatcggatgctggcccagcaggcagtcaagaggtttaat ggagttgtaatttatctgcagaaaaatgtaccctttttagtgtacaattctgtgaatcct gaaaaacctctgtag >gi568815578r:37029023_37279480|GENSCAN_predicted_peptide_3|265_aa VFTESENSTLKGNDEVLWSNTLTLQTAQENEEINDGNARRLPEQTPSPGPLDLSSASEQR DICRIRNTESKRPTAILFAHGLVPPRVKDATLGVLLCDPHPQQQLPLLPTSPFDPQRRQT QLHRGHSEGSESLLGMRRYADAIFTNSYRKVLGQLSARKLLQDIMSRQQGESNQERGARA RLGRQVDSMWAEQKQMELESILVALLQKHSHWLSFVPSDQDTQQPLYFHPQALLPLLHTT VHQPRTQSVFQARTGLDPIQNGIEF >gi568815578r:37029023_37279480|GENSCAN_predicted_CDS_3|798_bp gtattcactgaatcagagaactcgactcttaaaggaaatgatgaggtcttatggtctaat accctcactttacagacagctcaggaaaatgaagagataaatgatgggaacgccaggcgg ctgccagagcaaacacccagcccagggcccctggatttgagcagtgcctcggagcagagg gatatctgccgcatcagaaacactgagtccaagaggcccaccgccatcctctttgcccat ggactggtgccaccccgggtgaaggatgccactctgggtgttcttctttgtgatcctcac cctcagcaacagctcccactgctccccacctccccctttgaccctcagaggaggcagaca cagcttcacagaggtcactcagaggggtctgagtctctcttggggatgcggcggtatgca gatgccatcttcaccaacagctaccggaaggtgctgggccagctgtccgcccgcaagctg ctccaggacatcatgagcaggcagcagggagagagcaaccaagagcgaggagcaagggca cggcttggtcgtcaggtagacagcatgtgggcagaacaaaagcaaatggaattggagagc atcctggtggccctgctgcagaagcacagccactggctgtcctttgttcccagtgaccag gacacccagcagcctttgtacttccaccctcaggccctactacccctgctccacaccact gtccaccagccccgcacccagtctgtcttccaggcccgcacaggtctggatcccattcag aacggcatagagttctag