GENSCAN 1.0 Date run: 3-Nov-116 Time: 23:16:39 Sequence gi568815591f:139241385_139445701 : 204317 bp : 41.06% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 10572 10673 102 2 0 99 68 153 0.999 13.85 1.02 Intr + 17104 17241 138 0 0 22 113 141 0.989 9.74 1.03 Intr + 17883 17986 104 2 2 118 71 179 0.999 17.25 1.04 Intr + 19868 20357 490 2 1 86 86 434 0.995 35.08 1.05 Intr + 24949 25019 71 0 2 106 83 39 0.874 2.26 1.06 Intr + 28010 28139 130 2 1 44 110 124 0.968 9.98 1.07 Intr + 30938 31104 167 1 2 104 54 120 0.353 8.04 1.08 Intr + 31913 32026 114 2 0 62 110 188 0.995 17.04 1.09 Intr + 32547 32690 144 0 0 40 116 89 0.966 5.38 1.10 Intr + 37934 37976 43 2 1 54 115 36 0.903 0.22 1.11 Intr + 41640 43190 1551 2 0 16 91 1070 0.717 88.67 1.12 Intr + 51848 52079 232 1 1 52 89 120 0.671 4.92 1.13 Intr + 52505 52597 93 0 0 92 93 59 0.933 5.82 1.14 Intr + 56533 56624 92 2 2 39 61 51 0.047 -3.71 1.15 Term + 80421 80552 132 2 0 130 45 175 0.779 14.51 1.16 PlyA + 80909 80914 6 1.05 2.00 Prom + 80967 81006 40 -4.25 2.01 Init + 86847 86941 95 2 2 77 33 60 0.613 -0.70 2.02 Term + 87854 87971 118 2 1 114 53 117 0.903 7.83 2.03 PlyA + 89897 89902 6 1.05 3.00 Prom + 94395 94434 40 -8.15 3.01 Init + 98806 99133 328 0 1 93 65 274 0.781 23.13 3.02 Intr + 99927 100138 212 1 2 -7 63 332 0.778 18.91 3.03 Intr + 104117 104269 153 1 0 82 60 115 0.764 7.45 3.04 Intr + 118632 118938 307 2 1 48 127 268 0.681 21.80 3.05 Intr + 134073 134195 123 1 0 80 20 126 0.882 4.64 3.06 Intr + 134678 134772 95 0 2 80 40 56 0.015 -1.24 3.07 Intr + 157215 157313 99 2 0 121 42 110 0.741 9.09 3.08 Intr + 160753 160863 111 0 0 66 116 123 0.901 12.56 3.09 Intr + 164260 164403 144 0 0 119 84 167 0.999 18.96 3.10 Intr + 165790 165966 177 0 0 63 83 185 0.999 14.69 3.11 Intr + 168179 168270 92 1 2 64 64 111 0.174 4.17 3.12 Intr + 176154 176345 192 0 0 8 57 264 0.159 12.99 3.13 Term + 180779 180956 178 2 1 62 36 191 0.768 7.48 3.14 PlyA + 181746 181751 6 1.05 4.03 PlyA - 182029 182024 6 1.05 4.02 Term - 196590 196422 169 2 1 104 42 90 0.234 2.37 4.01 Intr - 204151 204055 97 2 1 99 59 35 0.103 -0.15 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 135283 135395 113 2 2 67 44 139 0.920 5.14 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:139241385_139445701|GENSCAN_predicted_peptide_1|1200_aa GGKPRKHRKDRLQDLIDIGFGYDETDPFIDNSEAYDELVPASLTTKYGGFYINTGTLQFR QASDTEEDDITDNQKHKPPKVPKIKEDDIEMKKRKRKEEGEKEKKPRKKVPKQLGVVALN SHKSEKKKKRYKDSLSLAAMIRKFQKEKDALKKESNPKVPVTLSTPSLNKPPCAAAALGN DVPDLNLSSGDPDLPIFVSTNEHELFQEAENALEMLDDFDFDRLLDAASDGSPLSESGGE NGTTTQPTYTSQVMPKVVPTLPEGLPVLLEKRIEDLRVAAKLFDEEGRKKFFTQDMNNIL LDIELQLQELGPVIRSGVYSHLEAFVPCNKETLVKRLKKLHLNVQDDRLREPLQKLKLAV SNVMPEQLFKYQEDCQARSQAKCAKYVSLLFGLAFAFEKCILQTDEEREKNGSEEDDDEK PGKRVIGPRKKFHWDDTIRTLLCNLVEIKLGCYELEPNKSQSAEDYLKSFMETEVKPLWP KGWMQARAKKKVIPAPKPKVKECSPKKDQKTPTSLVASVSGPPTSSSTAAIAAASSSSAP AQETICLDDSLDEDLSFHSPSLDLVSEALAVINNGNKGPPVGSRISMPTTKPRPGLREEK LASIMSKLPLATPKKLDSTQTTHSSSLIAGHTGPVPKKPQDLAHTGISSGLIAGSSIQNP KVSLEPLPARLLQQGLQRSSQIHTSSSSQTHVSSSSQAQIAASSHALGTSEAQDASSLTQ VTKVHQHSAVQQNYVSPLQATISKSQTNPVVKLSNNPQLSCSSSLIKTSDKPLMYRLPLS TPSPGNGSQGSHPLVSRTVPSTTTSSNYLAKAMVSQISTQGFKSPFSMAASPKLAASPKP ATSPKPLPSPKPSASPKPSLSAKPSVSTKLISKSNPTPKPTVSPSSSSPNALVAQGSHSS TNSPVHKQPSGMNISRQSPTLNLLPSSRTSGLPPTKNLQAPSKLTNSSSTGTVGKNSLSG IAMNVPASRGSNLNSSGANRTSLSGGTGSGTQGATKPLSTPHRPSTASGSSVVTASVQST AGASLLANASPLTLMTSPLSVTNQNVTPFGMLGGLVPVTMPFQFPLEIFGFGTDTAGVTT TSGSTSAAFHHSLTQNLLKGLQPGGAQHAATLSHSPLPAHLQQAFHGLLFLSMFTFSRPK HCGEEEKEKKTLLEQSQDSSRPGTAASKAGGGSMRKRNDTDRDPTDSRLEEERGFQVNDI >gi568815591f:139241385_139445701|GENSCAN_predicted_CDS_1|3603_bp ggtgggaaaccccgtaaacaccggaaggatcggctacaagatttaattgatataggcttt ggctatgatgagacagatccatttattgataactcagaggcttatgatgaattagttccc gcttctctaacaacaaaatatggaggcttttatatcaacactggcactctacagtttcgc caagcttcagatactgaagaagatgatattacagacaaccaaaagcacaagccacccaag gtccccaaaataaaagaagatgatattgagatgaagaagcggaagcggaaagaggaaggg gaaaaggagaagaagccaaggaaaaaagttcccaaacaactgggagttgtggctctaaat tcacacaagtctgaaaaaaagaagaaacgttataaagattctctttctctagctgccatg attagaaaattccagaaagagaaggatgcattaaagaaggagtctaaccccaaagtccca gtgaccttgtcaaccccttctctgaataaacccccatgtgctgctgcagcactggggaat gacgtcccggacttaaatctgagcagcggtgatccagaccttcccatttttgttagcaca aatgaacatgagctgtttcaggaagctgaaaatgccctagagatgctagatgattttgac ttcgacagattactggatgctgcttctgatggtagccccctatctgagtcggggggtgaa aatggaaccaccacccagccaacctacacttctcaggttatgcccaaagtggtacctaca ctcccagagggtctacctgtacttcttgaaaaacgtatcgaagaccttcgtgtagctgcc aaactttttgatgaagaaggaaggaaaaaattctttacacaggatatgaataatattctt ctggacattgagttacagctacaagaactaggccctgtcattcgcagtggtgtctactcc caccttgaagcttttgtgccatgcaataaagaaacactagtaaaacgtctgaagaagtta catctcaatgtccaggatgatcgtttaagagaacctctgcaaaaactgaaactggctgtt agcaatgtcatgcctgaacagctatttaaataccaggaggactgccaggctcgtagtcaa gctaagtgtgccaagtatgtatctcttctatttggactggcttttgcatttgagaagtgt atattgcagacagatgaagaacgagaaaaaaatggatctgaagaggatgatgatgagaaa ccaggaaaacgtgtcataggaccaagaaagaaattccactgggatgacactatcagaact ttgttatgtaaccttgttgagatcaaattgggatgctatgagttagaaccaaataaaagc cagtctgctgaagattatcttaagtcttttatggagacagaagtgaagcccctgtggcct aagggctggatgcaggcaagggcaaagaaaaaggtgattcctgcacctaaacccaaagta aaggagtgtagtccaaaaaaggaccagaaaactccaacatccctggtggcttcggttagc ggtcctccaacgagctccagcacagctgccattgctgcagctagctctagctctgcacca gcccaagaaaccatctgcctcgacgactcactagatgaagacctttctttccattcacct tcactggatcttgtttctgaagctttagcggttatcaacaatgggaacaagggccctcca gttggctcaaggataagcatgccaaccacaaagcctcgtccaggactgagagaagaaaaa ttagcaagtatcatgagtaagctgccactagctactcccaaaaaactagattctactcag actacacattcttcaagtcttattgctggtcacacagggccagtaccaaagaaaccccag gatttagctcatactggcatctcttcaggccttattgctggttcttccattcagaaccct aaagtttctttagaacctttgccagccaggctacttcaacaaggacttcagaggtcaagc cagattcacacttcttcctcttcacagacccatgtctcctcttcttcccaagcccaaatt gctgcctcttctcatgctctgggaacatccgaggcccaagatgcttcttcgttaacacaa gtaacaaaggtgcaccagcattcagctgtccagcagaactatgtgtctccattacaggcc accatcagtaaatcccagaccaaccccgtcgtgaagttaagtaataatccccaactctcc tgttcctcctcacttattaagacttcagataagccacttatgtaccgccttcccttatct accccctcacctggaaatggttctcaagggtcccaccccctggtttctaggacagtacct agcaccactacctccagtaactatttagccaaggctatggtgtcacagatctccacgcag ggtttcaaatctcccttctcgatggctgcctccccaaaacttgccgcatctcccaagcct gccacatctcctaaacccctgccctcgcctaagccttctgcctcacccaagccctctctg tcagctaagccttcagtatcaactaaacttatttctaaatccaacccaactcccaagcct actgtatccccaagtagttccagtccaaatgcactagttgcccagggtagccactccagc actaacagcccagtccataaacagcccagtggaatgaacatcagcagacagtctcccacc ttgaatttattgccctctagtcgcacttcaggccttccacctacaaaaaatcttcaggcc ccctcaaagctaacaaactcatcatccactggaactgttgggaagaacagcttgagtgga attgcaatgaatgtacctgccagcagaggtagcaaccttaactcaagcggagctaatagg actagtctgtctgggggaacaggaagtggaacacagggtgctaccaaaccattgtctact ccacatagaccatccactgcctcagggtcttcagtggtaacagccagtgtgcagtccaca gcaggagcatcattattggctaatgcctcacctctgactctcatgacatcacctttgtct gtaacaaatcaaaatgtgactccttttgggatgctgggtggccttgttccagtgaccatg cccttccagtttcccttggagatatttggctttggaacggacacagctggagtgacaacc acctcgggatctacctcagccgctttccaccatagcctaactcagaatttactaaagggt ttacagccaggaggagctcagcatgcagcaacgctttcccactcacctctgcctgcacac ttacagcaagcatttcacgggctgctgtttctgtcgatgtttacattctctcgtcccaag cactgtggtgaggaggaaaaagaaaagaaaacattacttgagcaaagccaggacagcagc aggccagggacagcagccagcaaggccggaggaggcagtatgagaaagagaaatgacact gaccgtgaccccacggactccaggctggaggaggagagagggtttcaagtcaatgacatt tga >gi568815591f:139241385_139445701|GENSCAN_predicted_peptide_2|70_aa MMEGEEEARHILHGGRGESGGESATLLNHQLLGLALEMSLNNLDPDKKGQGEIGYSCDSP SQPLFQLLIC >gi568815591f:139241385_139445701|GENSCAN_predicted_CDS_2|213_bp atgatggaaggtgaagaagaagcaaggcatatcttacatggtggcaggggagagagtgga ggggaaagtgccacacttttaaaccatcagctcttgggccttgccttagagatgtctctg aacaatttggaccctgacaagaagggtcaaggagaaatcggatactcttgtgattccccc agtcaaccactgttccagttgcttatctgctga >gi568815591f:139241385_139445701|GENSCAN_predicted_peptide_3|736_aa MASTLQPEAEIDRGRRHLTAAGAKLLLEIRNTGKGPAASGKNPPELTFRGRVSKIRPICP ARCAPRRRMPAFAQARGLRRRRGDYCAGKRVRRSVRAPVQRPEHRCSFEGAWAGGGVFRV VGRRWAPRSEKDRTMAALGSPSHTFRGLLRELRYLSAATGRPYRDTAAYRYLVKAFRAHR VTSEKLCRAQHELHFQAATYLCLLRSIRKHVALHQEFHGKGERSVEESAGLNRTESEGTR VAVGGCRLPRTRRREERSPDSSRNPSGSLSARLEAAAATETAAHLPPSLPLIPQPKRARS APHPRPSARYAAAMSAQAQMRAMLDQLMGTSRDGLYALKRVILPSLTPPSSKRETLQLEQ NSGRNGVWGSVGIRRDTTRQRIKFSDDRVCKSHLLNCCPHDVLSGTRMDLGECLKVHDLA LRADYEIASKEQDFFFELDAMDHLQSFIADCDRRTEVAKKRLAETQEEISAEVAAKAERV HELNEEIGKLLAKVEQLGAEGNVEESQKVMDEVEKARAKKREAEEVYRNSMPASSFQQQK LRVCEVCSAYLGLHDNDRRLADHFGGKLHLGFIEIREKLEELKRVVAEKQEKRNQERLKR REEREREEREKLRRSRSREHRRHRSRSMSRERKRRTRSKSREKRHRHRSRSSSRSRSRSH QRSRHSSRDRSRERSKRRSSKERFRDQDLASCDRDRSSRDRSPRDRDRKDKKRSYESANG RSEDRRSSEEREAGEI >gi568815591f:139241385_139445701|GENSCAN_predicted_CDS_3|2211_bp atggcctcaacgcttcaaccggaagcagaaattgaccgcggcaggcgccatctaacggcc gctggagcaaagctcctcctggaaattcgcaacaccggaaaaggtccggctgcttccggt aaaaacccaccagagctgacgttcagagggcgagtctcgaagatccggccaatttgccca gcgcgctgtgctccgcgacggcgcatgcccgcttttgcgcaggcgcggggactacggcgc aggcgcggagactattgcgcaggcaagcgcgtacgcagaagcgtgcgcgcgcccgttcaa cgtccggagcatcggtgcagtttcgagggcgcctgggccggaggaggggttttcagggtc gtaggacgccgttgggcaccacgctcggagaaggacaggacaatggcggccttagggtcc ccgtcgcacacttttcgaggacttctgcgggagttgcgctacctgagcgcggccaccggc cgaccctatcgcgacaccgcggcctatcggtaccttgtgaaggctttccgtgcacatcgg gtcaccagtgaaaagttgtgcagagcccaacatgagcttcatttccaagctgccacctat ctctgcctcctgcgtagcatccggaaacatgtggccctacatcaggaatttcatggcaag ggtgagcgctcggtggaggagtctgctggcttgaaccggacggagagtgagggcacgagg gtcgctgtcgggggctgtcgtcttccacgtacacgtcgtcgtgaggagcgcagtccggac tcttcccgcaacccctccggctccctttccgcacgcctcgaggcggcggcggccaccgag acagcagcgcaccttcccccatcccttccccttatcccccagcccaaaagggcccggtct gcgccccacccccgcccgtccgcccgctacgccgccgccatgtcggcgcaggcccagatg cgcgcgatgctggaccagttgatgggcacctcccgggacggtttgtacgctcttaaacgc gttattctaccctctctaactcctcccagcagcaaaagggagacacttcagctggaacaa aatagtggaagaaatggcgtctggggttcagtggggattcgcagagatacaactcgtcaa cgaatcaaattcagtgatgacagagtatgcaagagtcaccttctcaactgttgtcctcat gatgtcctttctggaactagaatggatcttggagaatgtctgaaagtccatgacctggct ttaagagcggattatgaaattgcatccaaagaacaagattttttctttgaacttgatgcc atggatcatctgcagtcattcattgcagattgtgatcgtagaacagaagtggccaagaaa agattagcagaaactcaagaagagattagtgctgaagtagcagcaaaggcagaacgtgtt catgagttaaatgaagaaattggtaaattgttagccaaggtggaacaactaggagctgaa gggaatgtggaggaatcccagaaagtaatggatgaagtagagaaagcacgggcaaagaaa agagaagcagaggaagtttatcggaattctatgccagcttccagttttcagcagcagaaa cttcgagtctgtgaagtctgctctgcctatttaggacttcatgataatgacagacgactg gctgatcattttgggggtaaactgcacctgggatttattgaaataagagagaagcttgaa gaattaaagagagtcgtagctgagaagcaggagaaaagaaaccaggaacggctgaaacga agagaagagagagagagagaagaaagggagaagctgaggagatccaggtccagagagcat cgcagacatcgatctcgctccatgtcacgtgaacgcaagaggagaactcgatccaaatct cgggagaaacgccatcgccacaggtcccgctccagcagccgtagccgcagccgtagccac cagagaagtcggcacagttctagagataggagcagagaacgatccaagaggagatcctca aaagaaagattcagagaccaagacttagcatcatgtgacagagacaggagttcaagagac agatcacctcgtgacagagatcggaaagataagaagcggtcctatgagagtgctaatggc agatcagaagacaggaggagctctgaagagcgcgaagcaggggagatctaa >gi568815591f:139241385_139445701|GENSCAN_predicted_peptide_4|88_aa XVSQAHLSLLLPSSSPSMAEITCFEIICVLDYAPLAGATETAKLSGQGWGSPASPEVDEA AGSGGKGHGCRERQRTGPEMISNTQSLL >gi568815591f:139241385_139445701|GENSCAN_predicted_CDS_4|267_bp nctgtttctcaagcccacctgtcacttctccttccctcttcatctccttccatggctgaa atcacctgctttgaaatcatctgtgtgcttgattatgcgccactagctggagccacagaa acagccaaactcagtgggcaggggtgggggagtccagccagtcctgaagtagatgaagct gcaggctcaggtggcaaagggcatggatgcagggagaggcaaagaactggaccagagatg atctcaaacacacaatccctcctgtga