GENSCAN 1.0 Date run: 4-Nov-116 Time: 23:34:02 Sequence gi568815591r:93569829_93770077 : 200249 bp : 36.62% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 Intr - 5429 5184 246 0 0 87 84 196 0.265 15.61 1.02 Intr - 5848 5737 112 1 1 62 6 115 0.208 -0.27 1.01 Init - 19162 17498 1665 1 0 64 86 637 0.085 52.94 1.00 Prom - 19623 19584 40 -6.15 2.02 PlyA - 19792 19787 6 1.05 2.01 Sngl - 21036 20020 1017 0 0 88 43 671 0.999 59.37 2.00 Prom - 60670 60631 40 -3.25 3.06 PlyA - 60953 60948 6 1.05 3.05 Term - 66503 66374 130 1 1 91 42 89 0.788 1.27 3.04 Intr - 67294 67250 45 0 0 98 86 38 0.571 1.21 3.03 Intr - 72531 72416 116 0 2 15 109 93 0.772 2.43 3.02 Intr - 82233 82154 80 1 2 90 76 9 0.002 -1.75 3.01 Init - 89289 88929 361 0 1 71 42 146 0.522 5.59 3.00 Prom - 89343 89304 40 -5.65 4.02 PlyA - 89460 89455 6 1.05 4.01 Sngl - 90647 89994 654 2 0 43 48 329 0.802 20.32 4.00 Prom - 91909 91870 40 -6.15 5.06 PlyA - 92079 92074 6 1.05 5.05 Term - 93256 93134 123 1 0 82 43 141 0.303 6.40 5.04 Intr - 100295 100097 199 1 1 44 20 157 0.109 2.93 5.03 Intr - 107048 106911 138 1 0 50 37 154 0.003 5.16 5.02 Intr - 146557 146411 147 0 0 74 80 115 0.103 7.73 5.01 Intr - 194331 194188 144 2 0 81 69 56 0.006 1.48 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:93569829_93770077|GENSCAN_predicted_peptide_1|675_aa MQGEIKMFFETNENKDTTYQNLWYAFKAVCRGKFIALNAHKRKQERSKIDALTSQLKQLE KQEQTYSKASRRQELTKIRGELKEIETQKTLQKINESRSWFFERINKIDRPLARLIKKKR EKHQIDAIKNDKGDITTDPTEIQTTIREYYKHLYANKLENLEEMDKFLDTYTLPRLNQEE VESLNRPITGSETVAIINSLPTKKSLGPDGFTAELYQRYKEELVPFLLKLFQSIEKEGIL PNSFYEASIILIPKPGKDTTKKENFRPISLMNIDAKILNKILAKRIQQHIKKLIHHDQVG FIPGMQGWFNIRKSINVIQHINRAKDKNHTIPSIDAEKAFDKIQQPFMLKTLNKLGIDGT YLKIIRAIDDKPTANIILNGQKLEAFPLKTGTRQGCPLSPLLFNIVLEVLARAIRQEKEI KGIQLGKEEVKLSLFADDMIVYLENPIVSAQNLLKLMSNFSKISGYKINVQKSQAFLYTN NRQTESQIMSEFPFTIASKRIKYLGIQVTRHVKELFKENYKPLLKEIKEDTNKWKNIPCS WVGRINIVKMAILPKGVPASLRCAPPGIGLSRTPADIEGFRGKEEESQCPAPSSLVLSPE WRLKKPGSQCLWDKVLVETHLPPEFAFSGEGTGLPESCLPVQTVAPLSLPTQGLTRVPRL LPVFTRRFSLPDSQX >gi568815591r:93569829_93770077|GENSCAN_predicted_CDS_1|2025_bp atgcagggagaaataaagatgttctttgaaaccaatgagaacaaagacacgacataccag aatctctggtacgcattcaaagcagtgtgtagagggaaatttatagcactaaatgcccac aagagaaagcaggaaagatccaaaattgatgccctaacatcacaattaaaacaactagaa aagcaagagcaaacatattcaaaagctagcagaaggcaagaactaactaaaatcagagga gaactgaaggaaatagagacacaaaaaacccttcaaaaaattaatgaatccaggagctgg ttttttgaaaggatcaacaaaattgatagaccgctagcaagactaataaagaaaaaaaga gagaagcatcaaatagatgcaataaaaaatgataaaggggatatcaccaccgatcccaca gaaatacaaactaccatcagagaatactacaaacacctctacgcaaataaactagaaaat ctagaagaaatggataaattcctcgacacatacactctcccaagactaaaccaggaagaa gttgaatctctgaatagaccaataacaggctctgaaactgtggcaataatcaatagctta ccaaccaaaaagagtctaggaccagatggattcacagccgaactctaccagaggtacaag gaggaactggtaccattccttctgaaactattccaatcaatagaaaaagagggaatcctc cctaactctttttatgaggccagcatcattctgataccaaagccaggcaaagacacaaca aaaaaagagaattttagaccaatatccttgatgaacattgatgcaaaaatcctcaataaa atactggcaaaacgaatccagcagcacatcaaaaagcttatccaccatgatcaagtgggc ttcatccctgggatgcaaggctggttcaatatacgcaaatcaataaatgtaatccagcat ataaacagagccaaagacaaaaaccacacgattccctcaatagatgcagaaaaggccttt gacaaaattcaacaacccttcatgctaaaaactctcaataaattaggtattgatgggacg tatctcaaaataataagagctatcgatgacaaacccacagccaatatcatactgaatggg caaaaactggaagcattccctttgaaaactggcacaagacagggatgccctctctcacca ctcctattcaacatagtgttggaagttctggccagggcaattaggcaggagaaggaaata aagggtattcaattaggaaaagaggaagtcaaattgtccctgtttgcagacgacatgatt gtatatctagaaaaccccattgtctcagcccaaaatcttcttaagctgatgagcaacttc agcaaaatctcaggatacaaaatcaatgtacaaaaatcacaagcattcttatacaccaac aacagacaaacagagagccaaatcatgagtgaattcccattcacaattgcttcaaagaga ataaaatacctaggaatccaagttacaaggcatgtgaaggagctcttcaaggagaactac aaaccactgctcaaggaaataaaagaggatacaaacaaatggaagaacattccatgctca tgggtaggaagaatcaatattgtgaaaatggccatactgcccaagggcgtgcctgccagc ctccgctgcgccccgcctggtattggattgtcccggactcctgctgacatcgagggcttt agggggaaagaagaggagtcgcagtgtccggcaccaagctccctagtgctgtccccggag tggcggctgaagaagccagggtcacaatgtctctgggataaggttcttgtggaaactcac ctccctccggaatttgcattctccggggaggggacagggctcccagaaagctgtctccca gtccagactgtcgcccccctctccctccctactcaaggtctaactcgggtccctcgcctg cttcctgtgtttacgcggcgctttagtctcccggactcgcaggnn >gi568815591r:93569829_93770077|GENSCAN_predicted_peptide_2|338_aa MGKKQNRKTGNSKKQSASPPPKERSSSPATEQSWMEHDFDELREEGFRRSNYSELWEDIQ TKVKEVENFEKNLEECITRITNTEKCLKELMELKTKARELREECRSLRSRCDQLEERVSP MEDEMNEMKGEGKFREKRIKRNEQSLQEIWDYVKRPNLRLIGVPESDGENGTKLENTLQD IIQENFPNLARLANVQIQEIQRTPQRYSSRRATPRHIIVRFTKVEMKEKMLRAAREKGRL TLKEKPIRLTTDLSAETLQARREWGPIFNILKEKNFQPRISYPAKLSFISEGEIKYFTDK QMLRDFDTTRPALKELLKEVLNMERNNRYQPLQNHAKM >gi568815591r:93569829_93770077|GENSCAN_predicted_CDS_2|1017_bp atggggaaaaaacagaacagaaaaactggaaactctaaaaagcagagcgcctctcctcct ccaaaggaacgcagttcctcaccagcaacggaacaaagctggatggagcatgactttgac gagctgagagaagaaggcttcagacgatcaaattactctgagctatgggaggacattcaa acaaaagtcaaagaagttgaaaactttgaaaaaaatttagaagaatgtataactagaata accaatacagagaagtgcttaaaggagctgatggagctgaaaaccaaggctcgagaacta cgtgaagaatgcagaagcctcaggagccgatgcgatcaactggaagaaagggtatcaccg atggaagatgaaatgaatgaaatgaagggagaagggaagtttagagaaaaaagaataaaa agaaatgagcaaagcctccaagaaatatgggactatgtgaaaagaccaaatctacgtctg attggggtacctgaaagcgatggggagaatggaaccaagttggaaaacactctgcaggat attatccaggagaacttccccaatctagcaaggctggccaacgttcagattcaggaaata cagagaacgccacaaagatactcctcgagaagagcaacaccaagacacataattgtcaga ttcaccaaagttgaaatgaaggaaaaaatgttaagggcagccagagagaaaggtcggctt accctcaaagagaagcccatcagactaacaacggatctctcggcagaaaccctacaagcc agaagagagtgggggccaatattcaacattcttaaagaaaagaattttcaacccagaatt tcatatccagccaaattaagcttcataagcgaaggagaaataaaatactttacagataag caaatgctgagagattttgacaccaccaggcctgccctaaaagagctcctgaaggaagtg ctaaacatggaaaggaacaaccggtaccagcccctgcaaaatcatgccaaaatgtaa >gi568815591r:93569829_93770077|GENSCAN_predicted_peptide_3|243_aa MGNDFMTKTPKAMATKAKIETHDLIKLKSFCTAKETTIRVNRQPTEWEKIFAIYPSGKGL ISRIYKELKQIYKKKIKQPHQKVGKEYEQTLFKRRHLCSQQTHEKVLIITGHQRNANQNH SNDIYLSPLSSDFQLDMTISSHGKIWRASEAIGQCQSSAAKPRRSGQESVRQPWTRVPGA LEVAASLVSVHIPSWKRLLLRSSYKLFQPIANKEIFESTYYLEVFHFKLSCLYGLNQCPS YTY >gi568815591r:93569829_93770077|GENSCAN_predicted_CDS_3|732_bp atgggcaatgacttcatgactaaaacaccaaaagcaatggcaacaaaagccaaaatagaa acacatgatctaattaaactaaagagcttctgcacagcaaaagaaactaccatcagagtg aacaggcaacctacagaatgggagaaaatttttgcaatctacccatctggcaaaggtcta atatccagaatctacaaagaacttaaacaaatttacaagaaaaaaatcaaacaaccccat caaaaagtgggcaaagaatatgaacagacacttttcaaaagaagacatttatgcagccaa cagacacatgaaaaagtgctcatcatcactggtcatcagagaaatgcaaatcaaaaccac agtaatgacatttatttgtctcccttgtcttctgactttcaactagacatgaccattagt agccatggcaaaatttggagggcttctgaggcgatcgggcagtgtcagtcttcagctgct aagccgagaagatctgggcaggagtcagtcagacagccttggaccagagtcccaggggct ctggaagtggctgccagtctggtttctgtccacatcccttcatggaaacgacttttacta aggtcttcctataaactctttcaaccaattgccaataaggaaatctttgaatccacctac tacctggaagttttccacttcaagttgtcctgcctttatggactgaatcagtgtccatct tacacgtattga >gi568815591r:93569829_93770077|GENSCAN_predicted_peptide_4|217_aa MNIDAKILNKILANQNQQHIKKLIHQDQVGFIPGMQGWFNIGKSINIIHHVNRTKDKNHM IISIDAEKVFDKIQQPFMLKTLNKQGIDVTYLKIIRGIYDKPTANIILNGQNLEAFPLKT GTRQGCPLSPLLFNIVLEVLARAIRQEKEIKGIQLAKEEVKLSLFADDMIVYLENPIISA QNLLKLISNFSKVSGCKINVQKSQAFLYTNNRQRAKS >gi568815591r:93569829_93770077|GENSCAN_predicted_CDS_4|654_bp atgaacattgatgcaaaaatcctcaataaaatactggcaaaccaaaatcagcagcacatc aaaaagcttatccaccaagatcaagttggcttcatccccgggatgcaaggctggttcaac ataggcaaatcaataaacataatccaccatgtaaacagaaccaaagacaaaaaccacatg atcatctcaatagatgcagaaaaggtctttgacaaaattcaacagcccttcatgctaaaa actcttaataaacaaggtattgatgtaacgtatctcaaaataataagaggtatttatgac aaacccacagccaatatcatactgaatggacaaaacctggaagcattccctttgaaaact ggcacaagacaaggatgccctctctcaccactcttattcaacatagtgttggaagttctg gccagggcaatcaggcaggagaaagaaataaagggtattcaattagcaaaagaggaagtc aaattgtccctgtttgcagatgacatgattgtatatttagaaaaccccatcatctcagcc caaaatctccttaagctgataagcaacttcagcaaagtctcaggatgcaaaatcaatgtg caaaaatcacaagcattcttatacaccaacaacagacagagagccaaatcatga >gi568815591r:93569829_93770077|GENSCAN_predicted_peptide_5|250_aa XISSTVLNKNGETGHHCLVPVLKENGSSFCLFSTMLAVDLSQMVLIILSFLHKSLHIAGV QEIFVDWMDVQMNDSKEATSDKGGGSWAMRIQGTPVGSGILITPINTKEARDEILMKCKA MRDQVVIAFKHYSDDGAYRDFGTRSKRSFLAVTTYPYENMPLTKDLLHPSPEEKRKHKKK RLVQSPNSYFMDVKCPRCYKITTVFSHAQTDRSSSPAMEQSRTENDFDELTEVGFRKSVI TNVSKLKEDV >gi568815591r:93569829_93770077|GENSCAN_predicted_CDS_5|753_bp nngatttccagtactgtgttgaataagaatggtgagactgggcatcattgcctggttcca gttctcaaggagaatggttccagcttttgcctgttcagtacgatgttggctgtcgatttg tcacaaatggttcttattattttgagcttcttgcacaagagcttgcatatagctggagtt caagaaatatttgtggattggatggatgtacaaatgaatgacagtaaagaggctacatct gacaagggaggaggaagctgggcaatgaggatccaggggacaccagtagggagtggcatc ctaattactccaataaacaccaaggaggcacgggatgaaatcctgatgaagtgcaaagcg atgagagatcaagtggttattgcatttaaacactacagtgatgatggcgcttacagagat tttggcaccagaagtaaaagatcctttctggcagtgacgacctacccatacgagaacatg cctctcacaaaggatctccttcatccctctccagaagagaagaggaaacacaagaagaaa cgcctggtgcagagccccaattcctacttcatggatgtgaaatgcccaagatgctataaa atcaccacggtctttagccatgcacaaacggatcgtagctcctcgccagcaatggaacaa agcaggacagagaacgactttgacgagttgacagaagtaggcttcagaaagtcggtaata acaaacgtctccaagctaaaggaggatgtttga