GENSCAN 1.0 Date run: 4-Nov-116 Time: 15:42:58 Sequence gi568815595f:169983973_170184986 : 201014 bp : 39.76% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1833 1893 61 2 1 85 111 38 0.861 3.19 1.02 Intr + 4268 4387 120 0 0 96 111 56 0.963 8.25 1.03 Term + 8622 9091 470 1 2 96 33 478 0.997 37.25 1.04 PlyA + 9834 9839 6 1.05 2.00 Prom + 17134 17173 40 -3.15 2.01 Init + 17985 18262 278 2 2 72 58 226 0.371 14.50 2.02 Intr + 28228 28252 25 1 1 66 84 -2 0.004 -5.69 2.03 Intr + 30092 30288 197 1 2 53 57 157 0.032 6.39 2.04 Intr + 34351 34701 351 1 0 55 110 115 0.616 3.91 2.05 Intr + 51752 51936 185 2 2 149 72 14 0.748 4.41 2.06 Term + 54943 55136 194 2 2 95 47 212 0.528 14.40 2.07 PlyA + 55739 55744 6 1.05 3.04 PlyA - 56483 56478 6 1.05 3.03 Term - 60657 60466 192 0 0 87 48 80 0.021 0.34 3.02 Intr - 73879 73681 199 0 1 118 115 78 0.920 11.93 3.01 Init - 74401 74241 161 1 2 97 74 31 0.949 2.09 3.00 Prom - 75212 75173 40 -3.15 4.00 Prom + 77554 77593 40 -8.75 4.01 Init + 78281 78394 114 1 0 63 48 148 0.840 6.58 4.02 Intr + 78916 79020 105 0 0 2 72 115 0.144 0.79 4.03 Intr + 79776 79961 186 2 0 46 68 129 0.584 5.66 4.04 Intr + 80268 80411 144 2 0 0 69 151 0.470 3.96 4.05 Term + 80918 81004 87 1 0 87 48 83 0.879 0.88 4.06 PlyA + 81132 81137 6 1.05 5.00 Prom + 82114 82153 40 -7.45 5.01 Init + 88627 88651 25 0 1 84 89 9 0.269 0.06 5.02 Intr + 93859 93951 93 2 0 59 67 129 0.336 6.92 5.03 Term + 94350 94714 365 1 2 86 44 218 0.382 10.94 5.04 PlyA + 95168 95173 6 1.05 6.14 PlyA - 96430 96425 6 1.05 6.13 Term - 113412 113258 155 1 2 68 42 224 0.999 12.90 6.12 Intr - 118738 118507 232 1 1 40 84 243 0.911 15.52 6.11 Intr - 118962 118830 133 2 1 84 62 70 0.908 3.73 6.10 Intr - 122974 122860 115 2 1 104 63 94 0.936 7.09 6.09 Intr - 129547 129388 160 1 1 88 76 140 0.997 11.44 6.08 Intr - 133504 133254 251 2 2 105 95 176 0.997 16.23 6.07 Intr - 138772 138619 154 1 1 66 66 125 0.995 6.82 6.06 Intr - 145580 144712 869 0 2 115 91 536 0.920 46.73 6.05 Intr - 152693 152447 247 2 1 77 75 138 0.927 7.51 6.04 Intr - 161549 161451 99 2 0 77 99 55 0.753 4.89 6.03 Intr - 165272 165114 159 2 0 92 95 108 0.765 11.16 6.02 Intr - 188740 188585 156 1 0 53 110 151 0.910 13.09 6.01 Init - 194944 194801 144 1 0 66 68 136 0.469 9.57 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:169983973_170184986|GENSCAN_predicted_peptide_1|216_aa VYVWIYDPVHFKTFVMGLILVIAVIAATLFPLWPAEMRVGVYYLSVGAGCFVASILLLAV ARCILFLIIWLITGGRHHFWFLPNLTADVGFIDSFRPLYTHEYKGPKADLKKDEKSETKK QQKSDSEEKSDSEKKEDEEGKVGPGNHGTEGSGGERHSDTDSDRREDDRSQHSSGNGNDF EMITKEELEQQTDGDCEEDEEEENDGETPKSSHEKS >gi568815595f:169983973_170184986|GENSCAN_predicted_CDS_1|651_bp gtgtatgtatggatctatgacccagttcactttaaaacatttgtcatgggattaattctt gtgattgcagtaatagcggccaccctcttccccctttggccagcagaaatgagagtaggt gtttattacctcagtgtgggtgcaggctgttttgtagccagtattcttctccttgctgtt gctcgatgcattctatttctcatcatttggctcataactggaggaaggcaccacttttgg ttcttgccaaatctgactgctgatgtgggcttcattgactccttcaggcctctgtacaca catgaatacaaaggaccaaaagcagacttaaagaaagatgagaagtctgaaaccaaaaag caacagaagtccgacagtgaggaaaagtcagacagtgagaaaaaggaagatgaggagggg aaagtaggaccaggaaatcatggaacagaaggctcggggggagaacggcattcagacacg gacagtgacaggagggaagatgatcgatcccagcacagtagtggaaatggaaatgatttt gaaatgataacaaaagaggaactggaacagcaaacagatggggattgtgaagaggatgag gaagaggaaaatgatggagaaacacctaaatcttcacatgaaaaatcataa >gi568815595f:169983973_170184986|GENSCAN_predicted_peptide_2|409_aa MAVLCNTQKASHLKNLKTGDLLHNPFLSSFSQHLLSLDYMLNNNINEEPTVPAQNPGKEQ GNQRLPAHAHVAGGDPSSALKATISAIGKLPACFPELWDYRTVLLLLIYFLKLTVEQPQA GPTGGIPEEVILIMGDDSSMDVFAPEDLPVGQDVEVEDSDTDDPGPGSGGQKSKIHLTGL KSRCQPGRFLEALEENLFPFLFQMLQAAASSGSWPPSSIFKTSTSINPTSASILTSPQTL LPLSPLSFTYKDSCHYSDPIWIIQDISPSQDSSLFYICTNPFARSSDPLAIPLLEHELSA WLSTSHFAHVGILAEAPHIPLNPVLLCDSVGLCTSPGSSLCHSRPGGLEVVAGPPPAVRR RTHGPGLRRQVRLEGTALASSYTCANVSGKGKCRLNPRDNPHREASQKT >gi568815595f:169983973_170184986|GENSCAN_predicted_CDS_2|1230_bp atggcagttctctgcaatactcagaaggcaagtcacctgaagaatcttaaaactggagac ctgctgcacaatccttttctttcttcattcagccaacatttactgagcctcgactatatg ctgaacaataatataaatgaggaacccacagtcccggctcagaatccggggaaggagcaa ggaaaccagcggctgcctgcacatgcacacgttgccggaggagatccatcttctgcccta aaagctacaatctccgccatcggaaagcttccagcctgcttcccagagctctgggattac aggactgtactcctgctacttatatattttttaaaattaactgtggaacagcctcaggca ggtcctacaggaggtattccggaagaagtcattcttatcatgggagatgacagctccatg gatgtttttgcccctgaagaccttccagtgggacaagatgtggaggtggaagacagtgac actgatgatcccggccctggttctggaggtcagaagtctaaaatacatctcacagggcta aaatcaaggtgtcagccaggtcgatttctggaggctctagaggagaatctgtttcctttc cttttccagatgctacaggctgccgcctcctctggctcatggcccccttcctccatcttc aaaaccagtacaagcatcaatccaacttctgcttccatcctcacatctcctcagactctc ttgcctctctcgcctctctctttcacttataaggactcttgccattacagtgaccccatc tggataatccaagatatctccccatctcaagattcttcacttttttacatctgcacaaat ccttttgccaggtcttctgacccacttgccattcccttattagaacatgaactctctgcc tggctttccacttcccactttgcacacgttggaatcttggcggaagcacctcacatcccg ttgaacccggtactcctctgtgattcagttggcctctgcacttctccaggaagttctctc tgccactccaggccaggtggcctcgaggtggtggcagggccgccccctgcagtccggaga cgaacgcacggaccgggcctccggaggcaggttcggctggaaggaaccgctctcgcttcg tcctacacttgcgcaaatgtctccggtaaggggaagtgtcgcttaaatcccagagataac ccccaccgagaagcatcacaaaagacatga >gi568815595f:169983973_170184986|GENSCAN_predicted_peptide_3|183_aa MRKTISGLAYSPPTHFLWESRAILKPIWVPSAGSLRVQPPSGLKYPPRPATREKQLSVPP PGLSGLLPNQVSKCHLVFLLSCGISDTADPGPGPRKQHGSASGSEAVSLSLKLPEAHSFQ DTLGNRQGAQDPPAYSQRPNGVGVWSHSFSLLRPAHTYSILVKLQGVANKFLISEKIGSM GEK >gi568815595f:169983973_170184986|GENSCAN_predicted_CDS_3|552_bp atgaggaagaccatttctggcctagcttattccccacccacccacttcctgtgggaaagc agggcaattctgaaacccatatgggtgccgtcagctgggtccctgagggtacagccaccc agtggcctgaaatacccacccaggcctgctacacgagagaagcagctttcagtccctcca cctggcctctcaggacttcttccaaaccaggtctcaaaatgccacttggtgttcctgctc agttgtggtatcagcgatactgcggatccaggccctgggccaagaaagcagcatggctca gcttctggttctgaagctgtgtctttgtctctgaaacttccagaagcccattccttccag gacaccctggggaaccgccaaggtgcacaagacccacctgcctactcacaaagacccaat ggtgtgggagtttggagccattccttttccctactgaggcctgcccacacctactccatt cttgtcaaactgcagggggtagcgaacaagttccttataagtgagaaaataggcagcatg ggagaaaaatga >gi568815595f:169983973_170184986|GENSCAN_predicted_peptide_4|211_aa MSKSRAAEAAAGVTATAPSPQIVEQRGPGRRCTDVGENRELSRGAFVVEKHRSLTEITDA TGPLHGGSPWISTALFEVYEYIFKRLGSKLVFPVLSRKLTELVALSAGRHRFPLPPAAAF LSTWKHRGVKSNQTWFQSCRLGLVFVRHPRLLRGRSHELAGVSTQAQIPQPELRLPASEK GAQVNLNGSRYESLSSPLIPPPHPGPHNGAT >gi568815595f:169983973_170184986|GENSCAN_predicted_CDS_4|636_bp atgtccaagtcccgtgcggcggaggcagcagcgggggtgacagcgacggccccgagcccg cagatagtggagcagaggggtccagggaggcgctgcaccgacgttggggagaatagggag ctctcccggggcgcctttgtggtagaaaagcaccggagcctcacggagatcaccgacgcc acgggacccctccatggtggatctccgtggatctccacagcgctctttgaagtgtatgaa tatatttttaaaagattgggcagtaagcttgtcttccccgtgctttctcgaaagcttact gagctcgtggccctgagcgcgggccggcatagatttcctcttccacccgctgcggctttt ctgagcacctggaagcatcggggtgtgaagtcaaaccagacgtggttccagagctgccgt ctaggcctcgtgtttgtcagacacccacggctcctgcgaggtcgcagccatgagctggcg ggggtgagtacacaagcccagatcccccaacccgaactgcggctgccagcctcagagaaa ggcgcccaggtaaatctgaatggaagcaggtatgagagcctgtcctcgcctttgattccc ccaccccaccctgggcctcacaacggtgctacctaa >gi568815595f:169983973_170184986|GENSCAN_predicted_peptide_5|160_aa MVEGRGGAVAWPGSVGQTQQLVPCVRNAAMDRDGTLENEAPPLPGKGATEETEVFPEPPP PINWKKDKGYTTVMGPCLKQAALEGELLVCLVMQDQQGNRVHEPITFNTYKEIRKSIREN GAASPFMKGLIEAMADNFHMTPWDWSVLTKTTLEASQYLL >gi568815595f:169983973_170184986|GENSCAN_predicted_CDS_5|483_bp atggtggaaggcagagggggagcagttgcttggccagggtctgtgggacagacccagcag ctggtgccctgtgtgaggaacgctgcaatggatcgcgacggaaccctcgaaaacgaagcc ccaccgttaccgggtaaaggtgccacagaggagacagaggttttccctgagccccctccc ccaataaactggaaaaaggacaagggatacactacagttatgggaccctgtcttaagcaa gcggcattagaaggggagctcttggtctgcctggtaatgcaagatcaacaaggcaatcgg gtacatgaacccattactttcaacacttacaaagagataagaaaaagcattagagaaaat ggagccgctagcccgtttatgaaaggattaattgaggccatggcagacaacttccatatg accccatgggactggtcagtgctaactaaaacaactttagaggcaagtcaatacctcctc tag >gi568815595f:169983973_170184986|GENSCAN_predicted_peptide_6|957_aa MDTEPNPGTSSVSTTTSSTTTTTITTSSSRMQQPQISVYSGSDRHAVQVIQQALHRPPSS AAQYLQQMYAAQQQHLMLHTAALQQQHLSSSQLQSLAAVQINLSTSPTPAQLISRSQASS STSGSITQQTMLLGSTSPTLTASQAQMYLRAQMLIFTPATTVAAVQSDIPVVSSSSSSSC QSAATQVQNLTLRSQKLGVLSSSQNGPPKSTSQTQSLTICHNKTTVTSSKISQRDPSPES NKKGESPSLESRSTAVTRTSSIHQLIAPASYSPIQPHSLIKHQQIPLHSPPSKVSHHQLI LQQQQQQIQPITLQNSTQDPPPSQHCIPLQNHGLPPAPSNAQSQHCSPIQSHPSPLTVSP NQSQSAQQSVVVSPPPPHSPSQSPTIIIHPQALIQPHPLVSSALQPGPNLQQSTANQVQA TAQLNLPSHLPLPASPVVHIGPVQQSALVSPGQQIVSPSHQQYSSLQSSPIPIASPPQMS TSPPAQIPPLPLQSMQSLQVQPEILSQGQVLVQNALVSEEELPAAEALVQLPFQTLPPPQ TVAVNLQVQPPAPVDPPVVYQVEDVCEEEMPEESDECVRMDRTPPPPTLSPAAITVGRGE DLTSEHPLLEQVELPAVASVSASVIKSPSDPSHVSVPPPPLLLPAATTRSNSTSMHSSIP SIENKPPQAIVKPQILTHVIEGFVIQEGLEPFPVSRSSLLIEQPVKKRPLLDNQVINSVC VQPELQNNTKHADNSSDTEMEDMIAEETLEEMDSELLKCEFCGKMGYANEFLRSKRFCTM SCAKRYNVSCSKKFALSRWNRKPDNQSLGHRGRRPSGPDGAAREHILRQLPITYPSAEED LASHEDSVPSAMTTRLRRQSERERERELRDVRIRKMPENSDLLPVAQTEPSIWTVDDVWA FIHSLPGCQDIADEFRAQEIDGQALLLLKEDHLMSAMNIKLGPALKICARINSLKES >gi568815595f:169983973_170184986|GENSCAN_predicted_CDS_6|2874_bp atggatactgaaccaaacccgggaacatcttctgtgtcaacaacaaccagcagtaccacc accaccaccatcaccacttcctcctctcgaatgcagcagccacagatctctgtctacagt ggttcagaccgacatgctgtacaggtaattcaacaggcattgcatcggccccccagctca gctgctcagtaccttcagcaaatgtatgcagcccaacaacagcacttgatgctgcatact gcagctcttcagcagcagcatttaagcagctcccagcttcagagccttgctgctgttcag atcaacctctccacttctcctacacctgcacagttaataagccgttcccaggcttccagt tctaccagcggcagtattacccaacagactatgttactagggagtacttcccctacccta acggcaagccaagctcaaatgtatctccgagctcaaatgctgattttcacacccgctacc actgtggctgctgtacagtctgacattcctgttgtctcgtcgtcatcgtcatcttcctgt cagtctgcagctactcaggttcagaatttaacattacgcagccagaagttgggtgtatta tctagctcacagaatggtccaccaaaaagcactagtcaaactcagtcattgacaatttgt cataacaaaacaacagtgaccagttctaaaatcagccaacgagatccttctccagaaagt aataagaaaggagagagcccaagcctggaatcacgaagcacagctgtcacccggacatca agtattcaccagttaatagcaccagcttcatattctccaattcagcctcattctctaata aaacatcagcagattcctcttcattcaccaccttccaaagtttcccatcatcagctgata ttacaacagcagcaacagcaaattcagccaatcacacttcagaattcaactcaagaccca cccccatcccagcactgtataccactccagaaccatggccttcctccagctcccagtaat gcccagtcacagcattgttcaccgattcagagtcatccctctcctttaacagtgtctcct aatcagtcacagtcagcacagcagtctgtagtggtgtctcctccaccacctcattcacca agtcagtctcctactataattattcatccacaagcacttattcagccacaccctcttgtg tcatcagctctccagccagggccaaatttgcagcagtccactgctaatcaggtgcaagct acagcacagttgaatcttccatcccatcttccacttccagcttcccctgttgtacacatt ggcccagttcagcagtctgccttggtatccccaggccagcagattgtctctccatcacac cagcaatattcatccctgcagtcctctccaatcccaattgcaagtcctccacagatgtcg acatctcctccagctcagattccaccactgcccttgcagtctatgcagtctttacaagtg cagcctgaaattctgtcccagggccaggttttggtgcagaatgctttggtgtcagaagag gaacttccagctgcagaagctttggtccagttgccatttcagactcttcctcctccacag actgttgcggtaaacctacaagtgcaaccaccagcacctgttgatccaccagtggtttat caggtagaagatgtgtgtgaagaagaaatgccagaagagtcagatgaatgtgtccggatg gatagaaccccaccaccacccactttgtctccagcagctataacagtggggagaggagaa gatttgacttctgaacatcctttgttagagcaagtggaattacctgctgtggcatcagtc agtgcttcagtaattaaatctccatcagatccctcacatgtttctgttccaccacctcca ttgttacttccagctgccaccacaaggagtaacagtacatctatgcacagtagcattccc agtatagagaacaaacctccacaggctattgttaaaccacagatcctaacccatgttatt gaaggctttgtgattcaggagggattggagccatttcctgtgagtcgttcctctttgcta atagaacagcctgtgaaaaaacggcctcttttggataatcaggtgataaattcagtgtgt gttcagccagagctacagaataatacaaaacatgcggataattcatctgacacagagatg gaagacatgattgctgaagagacattagaagaaatggacagtgagttgctcaagtgtgaa ttctgtgggaaaatgggatatgctaatgaatttttgcggtcaaaacgattctgcactatg tcatgtgccaaaaggtacaatgttagctgttctaaaaaatttgcacttagtcgttggaat cgtaagcctgataatcaaagtcttgggcatcgtggccgtcgtccaagtggccctgatggg gcagcgagagaacatatccttaggcagcttccaattacttatccatctgcagaagaagac ttggcttctcatgaagattctgtgccatctgctatgacaactcgtctgcgcaggcagagc gagcgggaaagagaacgtgagcttcgggatgtgagaattcggaaaatgcctgagaacagt gacttgctaccagttgcacaaacagagccatctatatggacagttgatgatgtctgggcc ttcatccattctttgcctggctgccaggatatcgcagatgaattcagagcacaggagatt gatggacaggcccttctcttgctgaaagaagaccatctcatgagtgcaatgaatatcaag ctaggcccagccctgaagatctgtgcacgcatcaactctctgaaggaatcttaa