GENSCAN 1.0 Date run: 8-Nov-116 Time: 13:31:40 Sequence gi568815589r:112779765_112990681 : 210917 bp : 43.06% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init - 3442 3340 103 1 1 94 91 58 0.194 7.10 1.00 Prom - 13305 13266 40 -3.76 2.03 PlyA - 14307 14302 6 1.05 2.02 Term - 16804 16707 98 2 2 81 45 167 0.987 9.83 2.01 Init - 21506 18593 2914 2 1 31 47 678 0.496 49.44 2.00 Prom - 22080 22041 40 -8.36 3.02 PlyA - 22249 22244 6 1.05 3.01 Sngl - 23493 22477 1017 0 0 88 43 571 0.991 49.64 3.00 Prom - 24097 24058 40 -10.25 4.00 Prom + 24105 24144 40 -16.10 4.01 Init + 24211 24297 87 0 0 68 39 127 0.472 4.44 4.02 Intr + 25012 25203 192 0 0 121 56 152 0.449 14.99 4.03 Intr + 37941 38051 111 2 0 121 55 79 0.059 8.48 4.04 Intr + 50961 51119 159 2 0 67 116 128 0.994 13.68 4.05 Intr + 56450 56645 196 1 1 61 71 239 0.993 18.49 4.06 Intr + 58734 58933 200 1 2 83 121 140 0.993 15.87 4.07 Intr + 66996 67046 51 2 0 102 48 50 0.314 1.50 4.08 Intr + 71095 71181 87 0 0 109 77 115 0.846 12.67 4.09 Intr + 84483 84635 153 2 0 112 90 197 0.691 22.57 4.10 Intr + 89020 89075 56 0 2 78 42 57 0.081 -2.12 4.11 Intr + 95405 95527 123 2 0 102 53 56 0.138 3.20 4.12 Term + 101664 101775 112 0 1 70 38 107 0.679 1.93 4.13 PlyA + 103771 103776 6 1.05 5.10 PlyA - 104115 104110 6 1.05 5.09 Term - 106286 106238 49 1 1 90 49 44 0.654 -2.52 5.08 Intr - 106852 106696 157 2 1 83 63 160 0.795 12.07 5.07 Intr - 107649 107566 84 1 0 52 101 104 0.938 7.89 5.06 Intr - 110952 109789 1164 1 0 30 97 1260 0.017 109.83 5.05 Intr - 112571 112465 107 1 2 90 66 34 0.012 1.26 5.04 Intr - 166226 165929 298 0 1 142 47 114 0.287 8.63 5.03 Intr - 179687 179586 102 0 0 65 111 55 0.872 5.65 5.02 Intr - 180488 180464 25 2 1 83 108 10 0.734 0.10 5.01 Init - 182497 182402 96 1 0 83 75 101 0.933 7.07 5.00 Prom - 184723 184684 40 -1.76 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 110917 109789 1129 1 1 49 97 1259 0.888 115.83 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589r:112779765_112990681|GENSCAN_predicted_peptide_1|35_aa MAPEISLNSYLPDKKPPTTTTFTFASVASGEVPGX >gi568815589r:112779765_112990681|GENSCAN_predicted_CDS_1|105_bp atggcgccagagatttcattaaacagttacctaccggacaagaaacccccaactaccacc acattcacctttgcatctgtggcatctggagaagtgcctggtgnn >gi568815589r:112779765_112990681|GENSCAN_predicted_peptide_2|1003_aa MPTRESRKDPKLTPTLTSQLKELEKQEQTHSKASRRQEITKIRAELKEIETQKTLQKINE SRSWFFERINKIDRPLARLIKKKREKNQIDTIKNDKGDITTDPTEIQTTIREYYKHLYAN KLENLEEMDTFLDTYTLPRLNQEEVESLNRPITGSEIVAIINSLPTKKSPGPDGFTAEFY QRYMEELVPFLLKLFQSIEKEGILPNSFYEASIILIPKPGRDTTKKENFRPISLMNIDAK ILNKILANRIQQHIKKLIHHDQVGFIPGMQGWFNIRKSINVIQHINRAKDKNHMIISIDA EKAFDKIQQPFMLKTLNKLGIDGTYFKIIRAIYDKPTANIILNGQKLEAFPLKTGTRQGC PLSPLLFNIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQNLLKL ISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMGELPFTIASKRIKYLGIQLTRDVKDLF KENYKPLLKEIKEETNKWKNIPCSWVGRINIVKMAILPKVIYRFNAIPIKLPMTFFTELE KTTLKFIWNQKRARIAKSILSQKNKAGGITLPDFKLYYKATVTKTAWYWYQNRDIDQWNR TEPSEIMPHIYNYLIFDKPEKNKQWGKDSLFNKWCWENWLAICRKLKLDPFLTPYTKINS RWIKDLNVKPKTIKTLEENLGITIQDIGVGKDFMSKTPKAMATKDKIDKWDLIKLKSFCT AKETTIRVNRQPTEWEKIFATYSSDKGLISRIYNELKQIYKKKTNNPIKKWAKDMNRHFS KEDIYAVKKHMKKCSSSLAIREMQIKTTMRYHLTPVRMAIIKKSGNNRCWRGCGEIGTLL HCWWDCKLVQPLWKSVWRFLRDLELEIPFDPAIPLLGIYPNEYKSCCYKDTCTRMFIAAL FTIAKTWNQPKCPTMIDWIKKMWHIYTMEYYAAIKNDEFISFVGTWMKLETIILSKLSQE QKTKHRIFSLIGQMDKLISWDVIPSTAPLSVDVTAESNYSSSH >gi568815589r:112779765_112990681|GENSCAN_predicted_CDS_2|3012_bp atgcctacaagagaaagcaggaaagatccaaaattgacacccaccctaacatcacaatta aaagaactagaaaagcaagagcaaacacattcaaaagctagcagaaggcaagaaataact aaaatcagagcagaactgaaggaaatagagacacaaaaaacccttcaaaaaatcaatgaa tccaggagctggttttttgaaaggatcaacaaaattgatagaccgctagcaagactaata aagaaaaaaagagagaagaatcaaatagacacaataaaaaatgataaaggggatatcacc accgatcccacagaaatacaaactaccatcagagaatactacaaacacctctacgcaaat aaactagaaaatctagaagaaatggatacattcctcgacacatacactctcccaagacta aaccaggaagaagttgaatctctgaatagaccaataacaggctctgaaattgtggcaata atcaatagtttaccaaccaaaaagagtccaggaccagatggattcacagccgaattctac cagaggtacatggaggaactggtaccattccttctgaaactattccaatcaatagaaaaa gagggaatcctccctaactcattttatgaggccagcatcattctgataccaaagccgggc agagacacaaccaaaaaagagaattttagaccaatatccttgatgaacattgatgcaaaa atcctcaataaaatactggcaaaccgaatccagcagcacatcaaaaagcttatccaccat gatcaagtgggcttcatccctgggatgcaaggctggttcaatatacgcaaatcaataaat gtaatccagcatataaacagagccaaagacaaaaaccacatgattatctcaatagatgca gaaaaagcctttgacaaaattcaacaacccttcatgctaaaaactctcaataaattaggt attgatgggacgtatttcaaaataataagagctatctatgacaaacccacagccaatatc atactgaatgggcaaaaactggaagcattccctttgaaaaccggcacaagacagggatgc cctctctcaccgctcctattcaacatagtgttggaagttctggccagggcaatcaggcag gagaaggaaataaagggtattcaattaggaaaagaggaagtcaaattgtccctgtttgca gacgacatgattgtttatctagaaaaccccatcgtctcagcccaaaatctccttaagctg ataagcaacttcagcaaagtctcaggatacaaaatcaatgtacaaaaatcacaagcattc ttatacaccaacaacagacaaacagagagccaaatcatgggtgaactcccattcacaatt gcttcaaagagaataaaatacctaggaatccaacttacaagggatgtgaaggacctcttc aaggagaactacaaaccactgctcaaggaaataaaagaggagacaaacaaatggaagaac attccatgttcatgggtaggaagaatcaatatcgtgaaaatggccatactgcccaaggta atttacagattcaatgccatccccatcaagctaccaatgactttcttcacagaattggaa aaaactactttaaagttcatatggaaccaaaaaagagcccgcattgccaagtcaatccta agccaaaagaacaaagctggaggcatcacactacctgacttcaaactatactacaaggct acagtaaccaaaacagcatggtactggtaccaaaacagagatatagatcaatggaacaga acagagccctcagaaataatgccgcatatctacaactatctgatctttgacaaacctgag aaaaacaagcaatggggaaaggattccctatttaataaatggtgctgggaaaactggcta gccatatgtagaaagctgaaactggatcccttccttacaccttatacaaaaatcaattca agatggattaaagatttaaacgttaaacctaaaaccataaaaaccctagaagaaaaccta ggcattaccattcaggacataggcgtgggcaaggacttcatgtccaaaacaccaaaagca atggcaacaaaagacaaaattgacaaatgggatctaattaaactaaagagcttctgcaca gcaaaagaaactaccatcagagtgaacaggcaacctacagaatgggagaaaatttttgca acctactcatctgacaaagggctaatatccagaatctacaatgaactcaaacaaatttac aagaaaaaaacaaacaaccccatcaaaaagtgggcgaaggacatgaacagacacttctca aaagaagacatttatgcagtcaaaaaacacatgaagaaatgctcatcatcactggccatc agagaaatgcaaatcaaaaccactatgagatatcatctcacaccagttagaatggcaatc attaaaaagtcaggaaacaacaggtgctggagaggatgcggagaaataggaacactttta cactgttggtgggactgtaaactagttcaaccattgtggaagtcagtgtggcgattcctc agggatctagaactagaaataccatttgacccagccatcccattactgggtatataccca aatgagtataaatcatgctgctataaagacacatgcacacgtatgtttattgcggcacta ttcacaatagcaaagacttggaaccaacccaaatgtccaacaatgatagactggattaag aaaatgtggcacatatacaccatggaatactatgcagccataaaaaatgatgagttcata tcctttgtagggacatggatgaaattggaaaccatcattctcagtaaactatcgcaagaa caaaaaaccaaacaccgcatattctcactcatagggcagatggacaagctcatcagctgg gacgtcataccttcaacagcccctctcagtgtggacgtgactgcagagtccaactacagc agcagccactga >gi568815589r:112779765_112990681|GENSCAN_predicted_peptide_3|338_aa MGKKQNRKTGNSKTQSASPPPKERSSSPATEQSWMENDFDELREEGFRRSNYSELREDIQ TKGKEVENFEKNLEECITRITNTEKCLKELMELKTKARELREECRSLRSRCDQLEERVST MEDEMNEMKREGKFREKRIKRNEQSLQEIWDYVKRPNLRLIGVPESDVENGTKLENTLQD IIQENFPNLARQANVQIQEIQRTPQRYSSRRATPRHIIVRFTKVEMKEKMLRAAREKGRV TLKGKPIRLTADLSAETLQARREWGPIFNILKEKNFQPRISYPAKLSFISEGEIKYFIDK QMLRDFVTTRPALKELLKEALNMERNNRYQPLQNHAKM >gi568815589r:112779765_112990681|GENSCAN_predicted_CDS_3|1017_bp atggggaaaaaacagaacagaaaaactggaaactctaaaacgcagagcgcctctcctcct ccaaaggaacgcagttcctcaccagcaacagaacaaagctggatggagaatgattttgac gagctgagagaagaaggcttcagacgatcaaattactctgagctacgggaggacattcaa accaaaggcaaagaagttgaaaactttgaaaaaaatttagaagaatgtataactagaata accaatacagagaagtgcttaaaggagctgatggagctgaaaaccaaggctcgagaacta cgtgaagaatgcagaagcctcaggagccgatgcgatcaactggaagaaagggtatcaaca atggaagatgaaatgaatgaaatgaagcgagaagggaagtttagagaaaaaagaataaaa agaaatgagcaaagcctccaagaaatatgggactatgtgaaaagaccaaatctacgtctg attggtgtacctgaaagtgatgtggagaatggaaccaagttggaaaacactctgcaggat attatccaggagaacttccccaatctagcaaggcaggccaacgttcagattcaggaaata cagagaacaccacaaagatactcctcgagaagagcaactccaagacacataattgtcaga ttcaccaaagttgaaatgaaggaaaaaatgttaagggcagccagagagaaaggtcgggtt accctcaaaggaaagcccatcagactaacagcggatctctcggcagaaaccctacaagcc agaagagagtgggggccaatattcaacattcttaaagaaaagaattttcaacccagaatt tcatatccagccaaactaagcttcataagtgaaggagaaataaaatactttatagacaag caaatgttgagagattttgtcaccaccaggcctgccctaaaagagctcctgaaggaagcg ctaaacatggaaaggaacaaccggtaccagccgctgcaaaatcatgccaaaatgtaa >gi568815589r:112779765_112990681|GENSCAN_predicted_peptide_4|508_aa MPRPASARARCAHTLACAHCLALPSEMNPDLILPNGGTPAGTSSPASSSSLLNRLQLDDD IDGETRDLFVIVDDPKKHVCTMETYITYRITTKSTRVEFDLPEYSVRRRYQDFDWLRSKL EESQPTHLIPPLPEKFVVKGVVDRFSEEFVETRRKALDKFLKRITDHPVLSFNEHFNIFL TAKDLNAYKKQGIALLTRMGESVKHVTGGYKLRTRPLEFAAIGDYLDTFALKLGTIDRIA QRIIKEEIEYLVELREYGPVYSTWSALEGELAEPLEGVSACIGNCSTALEELTDDMTEDF LPVLREYILYSDSMKPCILAPLFHKALKLKNLSVLKKRDQVQAEYEAKLEAVALRKEDRP KVPADVEKCQDRMECFNADLKADMERWQNNKRQDFRQLLMGMADKNIQYYEKCLMAWESI IPLLQEKQEAKPSLPSIYPLLHLPHLHQSGEMKKLPSVVAIMTGRQADSQLRAGKWRVKT WKQPAQGHMILDPCCGPLQHRHKAMASI >gi568815589r:112779765_112990681|GENSCAN_predicted_CDS_4|1527_bp atgcctcgccctgcttcggctcgcgcacggtgcgcacacacactggcctgcgcccactgt ctggcactccctagtgagatgaacccggatctcattttgcccaacggtggtactccagca ggtacttcaagtccagcttcttcatcttcccttctcaacagacttcagcttgatgatgat attgatggtgagactagagatctcttcgttatagttgatgatcccaagaagcatgtgtgt acaatggagacttacatcacctataggatcaccaccaaaagtactcgggtggagtttgac ctgccagaatattctgttcgtcgaagataccaggattttgactggttgaggagcaaactg gaagaatcccagcccactcatctcattccccctcttcccgagaagtttgtggtaaaaggt gttgtggatcgtttttcagaagagtttgtggagaccagaagaaaagctttggataaattt ctaaaaagaattacggaccatcctgtgctgtctttcaatgaacactttaatattttcctt actgctaaggacctgaacgcctacaagaagcaagggatagcattgctgaccagaatgggc gagtcagtcaagcacgtcactggcggctacaagctgaggactcggccgcttgagtttgct gccataggtgactacttagatacatttgcactcaaactgggaaccattgatcgaatagcc cagcggatcatcaaagaagaaatagagtaccttgtggagctgagagaatacgggcctgtg tactccacatggagcgccttggagggtgagctggctgaacccctggagggtgtgtcagct tgcattgggaactgctctacagccttagaagagctgacagatgacatgacagaagacttc ctacctgtgctcagggaatatattttatactctgactccatgaagccctgcatcctagct cccctcttccacaaagctttgaagctgaaaaacttgagtgtattgaagaagagggaccaa gttcaagcagagtatgaagccaaactggaagctgtggctctgcggaaggaagaccgcccc aaggtaccggcggacgtcgagaaatgtcaggatcggatggagtgtttcaatgctgacctg aaagctgacatggagaggtggcagaacaacaagaggcaggacttccggcagctactcatg gggatggctgacaagaacatccagtattatgagaagtgcctcatggcgtgggagtcgatt attccactactgcaggagaaacaagaggccaagcccagtttaccctcaatttaccctctt ctacatcttccacatctgcaccaatctggggagatgaagaaacttcctagtgtagtagcc atcatgactggcaggcaggccgacagccagctgagggctggcaagtggagagtgaagact tggaagcagccagcacagggccacatgatccttgacccctgctgtggaccactacagcac aggcacaaggcgatggccagcatctga >gi568815589r:112779765_112990681|GENSCAN_predicted_peptide_5|693_aa MMLLLLCLGLTLVCAQEEENNDAVTSNFDLSKHIAQDIASIRTPDVSSQLKERFVKYCEE HGIDKENIFDLTKVELRDEGGGHWVRGSWIGPFRSAPVSRGALENALAGSQGLSNALLWL VSPAGERYPASGQIPGAPGGRLRASQALPAAAARAQFRKSQVSSGAQAEASSKRIKVFGL FAVASGKIKNGVGEPGLGWVKHVSPKVTLARKPFDPHQRGHMSPEVTCPRRGHLPRFHPR TWVEPVVASSQVAASLYDAGLLLVVKASYGTGGSSNHSASPSPRGALEDQQQRAISNFYI IYNLVVGLSPLLSAYGLGWLSDRYHRKISICMSLLGFLLSRLGLLLKVLLDWPVEVLYGA AALNGLFGGFSAFWSGVMALGSLGSSEGRRSVRLILIDLMLGLAGFCGSMASGHLFKQMA GHSGQGLILTACSVSCASFALLYSLLVLKVPESVAKPSQELPAVDTVSGTVGTYRTLDPD QLDQQYAVGHPPSPGKAKPHKTTIALLFVGAIIYDLAVVGTVDVIPLFVLREPLGWNQVQ VGYGMAAGYTIFITSFLGVLVFSRCFRDTTMIMIGMVSFGSGALLLAFVKETYMFYIARA VMLFALIPVTTIRSAMSKLIKGSSYGKVFVILQLSLALTGVVTSTLYNKIYQLTMDMFVG SCFALSSFLSFLAIIPIRALCSVRLKSPSRWTV >gi568815589r:112779765_112990681|GENSCAN_predicted_CDS_5|2082_bp atgatgctgctgttgctgtgtctggggttgaccctcgtctgtgcccaggaggaagaaaac aatgatgctgtgacaagcaacttcgatctgtcaaagcacattgctcaggatatagcatca attcgaacaccggatgtgagctcacaactcaaggagaggtttgtgaaatattgtgaagaa catgggattgataaggaaaacatatttgacttgaccaaagttgaacttagagatgaggga ggtggccactgggtgcggggatcgtggattggtcccttcagatccgccccggtgtccaga ggagccctagagaatgccttggccggctcgcagggcctctccaacgccctgctctggctc gtctcacctgcaggagagaggtaccctgcgagtgggcagatacctggcgcccccggtggc cgtttgcgcgcatctcaggctctgccagccgctgccgcccgggcgcagttcagaaagagc caggtgtcctccggagcccaggccgaggcctcttccaaaaggatcaaggtgtttggactg tttgctgtagcctctggaaaaattaagaatggagtgggagagccaggcttgggttgggtg aaacatgtatctcccaaggtgaccctagccagaaagcccttcgatccccatcagagaggt cacatgagccccgaggtcacctgcccgcggaggggccacctgcctcgcttccacccgagg acctgggttgagcccgtggtggcctcgtcccaggtggctgcctccctctacgatgcgggg ctactcctcgtggtgaaggcgtcctacggaaccggaggctcctccaaccacagtgccagc ccatcgccccggggggctctagaggaccaacagcagagagccatctccaatttctacatt atctacaaccttgtggtgggcctgtcccccctgctgtccgcctacgggctgggatggctc agcgaccgctaccaccgaaagatctccatctgcatgtcgctgctgggcttcctgctctcc cgcctcgggctgctgctcaaggtgctgctggactggccagtggaggtgctgtacggggcg gcggcgctgaacgggctattcggcggcttctccgccttctggtccggggtcatggcgctg ggatcgctgggctcctccgagggccgccgctctgtgcgcctcatcctcattgacctgatg ctgggcttggcggggttctgcgggagcatggcttccgggcatctcttcaagcagatggct gggcactctgggcagggcctgatactgacggcctgcagcgtgagctgtgcctcgtttgcc ctgctctacagccttttggtgctaaaggtccctgagtcggtggccaaacccagccaggag ctccccgccgtggataccgtgtctggcacggttggcacataccgcactctggatcctgat cagttggaccaacagtatgcagtggggcaccctccatctcctggaaaagcaaaaccccat aaaaccaccattgccttgctctttgtgggtgctatcatatatgacctggcggtggtgggc acagtggacgtgatccctctttttgtgctgagggagcctctcggttggaaccaagtgcag gtgggctatggtatggctgcagggtacaccatcttcatcaccagcttcctgggtgtcctg gtcttctcccgctgctttcgggacaccaccatgatcatgattgggatggtctcctttggg tcaggagccctcctcttggcttttgtgaaagagacatacatgttctatattgctcgagcc gtcatgctgtttgctctcatccccgtcacaaccatccgatcagctatgtccaaactcata aagggctcctcttatggaaaggtgttcgtcatactgcagctgtccttggctctgaccggc gtggtgacatccaccttgtacaacaagatctaccagctcaccatggacatgtttgtgggc tcctgctttgctctctcctcctttctctccttcctggccatcattccaattagggctctg tgctcagtgcggctgaaatctcccagccgctggactgtgtga