GENSCAN 1.0 Date run: 5-Nov-116 Time: 16:39:45 Sequence gi568815593f:173044546_173263918 : 219373 bp : 45.07% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 PlyA - 12 7 6 1.05 1.05 Term - 11294 11190 105 2 0 43 42 160 0.812 5.31 1.04 Intr - 12316 12150 167 2 2 127 23 59 0.680 2.98 1.03 Intr - 34542 34448 95 2 2 52 86 77 0.319 3.51 1.02 Intr - 41360 41224 137 2 2 24 91 132 0.350 6.47 1.01 Init - 43665 43648 18 0 0 62 89 31 0.405 0.65 1.00 Prom - 44002 43963 40 -9.85 2.00 Prom + 44214 44253 40 -4.96 2.01 Init + 45779 46856 1078 1 1 69 67 688 0.091 59.36 2.02 Intr + 64079 64273 195 0 0 33 95 129 0.942 7.49 2.03 Intr + 65977 66166 190 2 1 92 99 86 0.414 8.74 2.04 Intr + 67761 67834 74 0 2 79 52 66 0.343 1.15 2.05 Intr + 78535 78657 123 2 0 90 103 72 0.613 9.46 2.06 Term + 89085 89200 116 1 2 100 37 15 0.184 -3.67 2.07 PlyA + 89547 89552 6 -0.45 3.00 Prom + 93053 93092 40 -3.36 3.01 Init + 100001 100084 84 1 0 115 84 115 0.939 14.62 3.02 Intr + 102321 102413 93 2 0 70 94 27 0.714 1.66 3.03 Intr + 109777 109868 92 0 2 74 96 73 0.975 5.49 3.04 Intr + 115388 115506 119 2 2 97 71 156 0.999 14.91 3.05 Term + 119180 119376 197 0 2 94 50 290 0.982 23.37 3.06 PlyA + 119791 119796 6 1.05 4.05 PlyA - 119808 119803 6 1.05 4.04 Term - 130922 130808 115 1 1 40 43 97 0.299 -1.56 4.03 Intr - 133058 132951 108 1 0 66 99 45 0.494 2.80 4.02 Intr - 139433 139279 155 2 2 83 102 67 0.172 6.47 4.01 Init - 149152 149078 75 1 0 80 79 31 0.021 2.49 4.00 Prom - 160636 160597 40 -5.36 5.03 PlyA - 161173 161168 6 -0.45 5.02 Term - 161391 161241 151 2 1 84 43 141 0.944 6.58 5.01 Init - 164241 164171 71 0 2 49 78 63 0.707 1.92 5.00 Prom - 164405 164366 40 -2.16 6.00 Prom + 168181 168220 40 -7.76 6.01 Sngl + 169154 169336 183 1 0 97 48 215 0.488 11.66 6.02 PlyA + 169730 169735 6 1.05 7.09 PlyA - 174016 174011 6 1.05 7.08 Term - 181473 181291 183 0 0 62 36 226 0.987 12.34 7.07 Intr - 184434 184042 393 0 0 32 65 135 0.278 0.45 7.06 Intr - 185419 185217 203 2 2 98 60 116 0.941 8.80 7.05 Intr - 187086 186919 168 1 0 -21 65 140 0.079 0.72 7.04 Intr - 188664 188041 624 1 0 103 23 1145 0.124 101.72 7.03 Intr - 190680 190205 476 2 2 80 100 362 0.041 29.31 7.02 Intr - 208058 207965 94 0 1 140 100 20 0.518 7.42 7.01 Intr - 213977 213841 137 1 2 107 95 43 0.354 7.11 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 62392 62440 49 0 1 77 58 66 0.879 1.61 S.002 Term - 188664 188024 641 1 2 103 50 1143 0.826 106.37 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:173044546_173263918|GENSCAN_predicted_peptide_1|173_aa MEEELQTPFLLVGTQTDLRDDPSTIEKLAKNKQKPITPETAKKLTHDLKAISLSLCDLMP SSYNWYVVIIIALILSPDSYEKEGRLPAAAAPGSEAAGKGRAARQDGRPALPIPGLAYCP PASPIYRRPALLCNRPGPGHGFRGTGINQTACTFEERDPYTTVQPECHYYRSA >gi568815593f:173044546_173263918|GENSCAN_predicted_CDS_1|522_bp atggaagaagagctgcagactcctttcttgcttgttgggacccaaactgatctcagagat gacccctctactattgagaaacttgccaagaacaaacagaagcctatcactccagagact gctaaaaagctgacccatgacctgaaggccatcagtctctccttatgtgacctcatgccg tcgtcctacaactggtatgtagtcataatcattgccctaattctctcccctgattcctat gagaaagaaggccgactgccggcagcagcggccccgggctcggaggcagcggggaagggc cgggcggcccggcaggacggacgcccggcgctgcccatccccggcctagcctactgcccg cccgcgagtcccatctaccgccgccccgcgcttttatgtaaccgtcccgggccggggcac ggattccgaggcacagggatcaatcagactgcgtgcacctttgaagagagagacccgtat acgactgtacaaccagaatgccactactacagaagcgcctag >gi568815593f:173044546_173263918|GENSCAN_predicted_peptide_2|591_aa MNYQQNPRDNFLSLEDCKDIENLESFTDVLDNEGALTSNWEQWDTYCEDLTKYTKLTSCD IWGTKEVDYLGLDDFSSPYQDEEVISKTPTLAQLNSEDSQSVSDSLYYPDSLFSVKQNPL PSSFPGKKITSRAAAPVCSSKTLQAEVPLSDCVQKASKPTSSTQIMVKTNMYHNEKVNFH VECKDYVKKAKVKINPVQQSRPLLSQIHTDAAKENTCYCGAVAKRQEKKGMEPLQGHATP ALPFKETQELLLSPLPQEGPGSLAAGESSSLSASTSVSDSSQKKEEHNYSLFVSDNLGEQ PTKCSPEEDEEDEEDVDDEDHDEGFGSEHELSENEEEEEEEEDYEDDKDDDISDTFSEPG YENDSVEDLKEVTSISSRKRGKRRYFWEYSEQLTPSQQERMLRPSEWNRDTLPSNMYQKN GLHHGKYAVKKSRRTDVEDLTPNPKKLLQIGNELRKLNKVISDLTPVSELPLTARPRSRK EKNKLASRACRLKKKAQYEANKVKLWGLNTEYDNLLFVINSIKQEIVNRVQNPRDERGPN MGQKLEILIKDTLGLPVAGQTSEFVNQVLEKTAEGNPTGGLVGLRIPTSKV >gi568815593f:173044546_173263918|GENSCAN_predicted_CDS_2|1776_bp atgaactaccaacagaatcctagagacaactttctttctttggaggactgcaaagacatt gaaaatctggagtctttcacagatgtcctggataatgagggtgctttaacctcaaactgg gaacagtgggatacatactgtgaagacctaacgaaatataccaaactaaccagctgtgac atctggggaacaaaagaagtggattacttgggtcttgatgacttttctagtccttaccaa gatgaagaggttataagtaaaactccaactttagctcaacttaatagtgaggactcacag tctgtttctgattccctttattaccccgattcacttttcagtgtcaaacaaaatccctta ccctcttcattccctggtaaaaagatcacaagcagagcagctgctcctgtgtgttcttct aagactctgcaggctgaggtccctttgtcagactgtgtccaaaaagcaagtaaacccact tcaagcacacaaatcatggtgaagaccaacatgtatcataatgaaaaggtgaactttcat gttgaatgtaaagactatgtaaaaaaggcaaaggtaaagatcaacccagtgcaacagagc cggcccttgttgagccagattcacacagatgcagcaaaggagaacacctgctactgtggt gcagtggcaaagagacaagagaaaaaagggatggagcctcttcaaggtcatgccactccc gctttgccttttaaagaaacccaggaactattactaagtcccctgccccaggaaggtcct gggtcacttgcagcaggagagagcagcagtctttctgccagtacatcagtctcagattca tcccagaaaaaagaagagcacaattattctctttttgtctccgacaacttgggtgaacag ccaactaaatgcagtcctgaagaagatgaggaggacgaggaggatgttgatgatgaggac catgatgaaggattcggcagtgagcatgaactgtctgaaaatgaggaggaggaagaagag gaagaggattatgaagatgacaaggatgatgatattagtgatactttctctgaaccaggc tatgaaaatgattctgtagaagacctgaaggaggtgacttcaatatcttcacggaagaga ggtaaaagaagatacttctgggagtatagtgaacaacttacaccatcacagcaagagagg atgctgagaccatctgagtggaaccgagatactttgccaagtaatatgtatcagaaaaat ggcttacatcatggaaaatatgcagtaaagaagtcacggagaactgatgtagaagacctg actccaaatcctaaaaaactcctccagataggcaatgaacttcggaaactgaataaggtg attagtgacctgactccagtcagtgagcttcccttaacagcccgaccaaggtcaaggaag gaaaaaaataagctggcttccagagcttgtcggttaaagaagaaagcccagtatgaagct aataaagtgaaattatggggcctcaacacagaatatgataatttattgtttgtaatcaac tccatcaagcaagagattgtaaaccgggtacagaatccaagagatgagagaggacccaac atggggcagaagcttgaaatcctcattaaagatactctcggtctaccagttgctgggcaa acctcagaatttgttaaccaagtgttagagaagactgcagaagggaatcccactggaggc cttgtaggattaaggataccaacatcaaaggtgtaa >gi568815593f:173044546_173263918|GENSCAN_predicted_peptide_3|194_aa MAAPQDVHVRICNQEIVKFDLEVKALIQDIRDCSGPLSALTELNTKVKEKFQQLRHRIQD LEQLAKEQDKESEKQLLLQEVENHKKQMLRKTTKESLAQTSSTITESLMGISRMMAQQVQ QSEEAMQSLVTSSRTILDANEEFKSMSGTIQLGRKLITKYNRRELTDKLLIFLALALFLA TVLYIVKKRLFPFL >gi568815593f:173044546_173263918|GENSCAN_predicted_CDS_3|585_bp atggcggctccccaagacgtccacgtccggatctgtaaccaagagattgtcaaatttgac ctggaggtgaaggcgcttattcaggatatccgtgattgttcaggacccttaagtgctctt actgaactgaatactaaagtaaaagagaaatttcaacagttgcgtcacagaatacaggac ctggagcagttggctaaagagcaagacaaagaatcagagaaacaacttctactccaggaa gtggagaatcacaaaaagcagatgctcaggaaaaccaccaaagagagcctggcccagaca tccagtaccatcactgagagcctcatggggatcagcaggatgatggcccagcaggtccag cagagcgaggaggccatgcagtctctagtcacttcttcacgaacgatcctggatgcaaat gaagaatttaagtccatgtcgggcaccatccagctgggccggaagcttatcacaaaatac aatcgccgggagctgacggacaagcttctcatcttccttgcgctagccctgtttcttgct acggtcctctatattgtgaaaaagcggctctttccatttttgtga >gi568815593f:173044546_173263918|GENSCAN_predicted_peptide_4|150_aa MDTDVDLHSIGGMSKDLQPRFKTTTGGHWRILKEAMACASEREGLTQGTNLNEESPAALG PVRSCQNDPSKAQGGPSHQWPFHMDLLQMTSLAEGPGGRDMTPFSQPSSQSVRNWVALQE VSRQRALPPELYPLSDQQRHEILIGARTLL >gi568815593f:173044546_173263918|GENSCAN_predicted_CDS_4|453_bp atggacacagacgtagacctgcactccataggaggaatgtcaaaggatttgcagccacgt tttaaaaccaccacaggaggccactggagaattctcaaggaggccatggcatgtgcctct gaaagagaggggctgacgcagggcaccaacttgaatgaggaaagtccggctgcattgggc ccagtcagaagctgccagaatgacccaagcaaggcacagggaggcccaagccaccagtgg cctttccacatggacttgctgcaaatgaccagtctggctgagggccctgggggccgggac atgacacccttctctcagccctcatcacagtctgtgaggaactgggtggccctgcaggag gtcagccgccagcgagcattaccgcctgagctctacccactgtcggatcagcagcggcat gagattctcataggagcgagaaccctattgtga >gi568815593f:173044546_173263918|GENSCAN_predicted_peptide_5|73_aa MEQLEEDLRREFKTEGAARAKALRNRRGFVLSCASEERHPLRGHWMPHAFDVKVPFENGP TLYQQVPETPPGY >gi568815593f:173044546_173263918|GENSCAN_predicted_CDS_5|222_bp atggagcagctggaggaagatctgagaagagagtttaagacagagggagcagctcgtgca aaggccctgagaaacaggagaggtttcgtcctgagctgtgcaagtgaggagcgtcaccct ctgagagggcactggatgccacacgctttcgatgtcaaggtcccttttgaaaatggccct actctgtaccaacaagttccagagacaccaccaggctactaa >gi568815593f:173044546_173263918|GENSCAN_predicted_peptide_6|60_aa MPEPAPDAVGSCAAPASPTSAAPCSTAPGPIDCPRAEECRRKHHGTGRQLRLRPGSRTTE >gi568815593f:173044546_173263918|GENSCAN_predicted_CDS_6|183_bp atgcctgagcctgcccctgacgccgtgggctcctgcgcggccccagcctccccgacgagc gccgccccctgctccacggcgcccggtcccatcgactgcccaagggctgaggagtgccgg cgcaagcaccacgggactggcaggcagctccgcctgcggcccgggtccaggaccactgag tga >gi568815593f:173044546_173263918|GENSCAN_predicted_peptide_7|759_aa XKLPNQPPTWVTPADKATPLQGWCLHGNMKTQGYPCVPEKHQTRGHGEAPRAAKEREAEK KWAGEPLSGAPRYGGRSSATCCPDTSRAGRRVRGRAAAPCREAARGRGQRRFLPPTWRCE TGAATMFPSPALTPTPFSVKDILNLEQQQRSLAAAGELSARLEATLAPSSCMLAAFKPEA YAGPEAAAPGLPELRAELGRAPSPAKCASAFPAAPAFYPRAYSDPDPAKDPRAEKKELCA LQKAVELEKTEADNAERPRARRRRKPRVLFSQAQVYELERRFKQQRYLSAPERDQLASVL KLTSTQVKIWFQNRRYKCKRQRQDQTLELVGLPPPPPPPARRIAVPVLVRDGKPCLGDSA PYAPAYGVGLNPYGYNAYPAYPGYGGAACSPGYSCTAAYPAGPSPAQPATAAANNNFVNF GVGDLNAVQSPGIPQSNSGVSTLHACFQMEKPRTQALGKRSQAPQMPPDATSGPHQLPVG LSAEAGPQRGVLRMVLTSSCDRETEEVTWVVKIIQSLSGNGSWETHRTHTGESVPEARAR HTVDVSPVKMQTEWLQAALSRTQTLASGDARKPQVSPLTRKHHAVPRGSSTVFPAAPAPR ILREAFRAQAQRFYAAGNSLVASKPCRLPYQAELLEKGACGLPLIYHSQSLSRARHCSEL FTSEPIESLFIPLGYREETDEGFILNPERRSTLPRVTQLVNSEDKRALAKLVEAIRTNYN DKYDEIHRHWGGSVLGPKSVVHIAKLEKAKAKELATKLG >gi568815593f:173044546_173263918|GENSCAN_predicted_CDS_7|2280_bp nngaaattaccaaatcagcctccaacctgggtaacaccagctgataaagcgactcctttg caaggctggtgtctgcatgggaacatgaaaacccaaggctatccctgtgtgccggaaaag catcagaccagaggccatggagaagccccacgagcagcgaaggaaagagaagcagaaaaa aagtgggcaggtgaacctttgtcaggggcaccccgctacggaggaaggtcaagcgctacc tgctgcccggacacatccagagctggccgacgggtgcgcgggcgggcggcggcaccatgc agggaagctgccaggggccgtgggcagcgccgctttctgccgcccacctggcgctgtgag actggcgctgccaccatgttccccagccctgctctcacgcccacgcccttctcagtcaaa gacatcctaaacctggaacagcagcagcgcagcctggctgccgccggagagctctctgcc cgcctggaggcgaccctggcgccctcctcctgcatgctggccgccttcaagccagaggcc tacgctgggcccgaggcggctgcgccgggcctcccagagctgcgcgcagagctgggccgc gcgccttcaccggccaagtgtgcgtctgcctttcccgccgcccccgccttctatccacgt gcctacagcgaccccgacccagccaaggaccctagagccgaaaagaaagagctgtgcgcg ctgcagaaggcggtggagctggagaagacagaggcggacaacgcggagcggccccgggcg cgacggcggaggaagccgcgcgtgctcttctcgcaggcgcaggtctatgagctggagcgg cgcttcaagcagcagcggtacctgtcggcccccgaacgcgaccagctggccagcgtgctg aaactcacgtccacgcaggtcaagatctggttccagaaccggcgctacaagtgcaagcgg cagcggcaggaccagactctggagctggtggggctgcccccgccgccgccgccgcctgcc cgcaggatcgcggtgccagtgctggtgcgcgatggcaagccatgcctaggggactcggcg ccctacgcgcctgcctacggcgtgggcctcaatccctacggttataacgcctaccccgcc tatccgggttacggcggcgcggcctgcagccctggctacagctgcactgccgcttacccc gccgggccttccccagcgcagccggccactgccgccgccaacaacaacttcgtgaacttc ggcgtcggggacttgaatgcggttcagagccccgggattccgcagagcaactcgggagtg tccacgctgcatgcctgtttccagatggaaaagccaagaacccaagcccttggcaagcgt tctcaggctcctcagatgcccccagatgccacgtcggggcctcatcagctgcccgtggga ctgagtgccgaggctggaccccagagaggtgtcctgcggatggtgctcacctccagctgt gaccgggaaactgaggaggttacgtgggtggtcaagatcatccagtctctaagcgggaac gggagctgggaaacacacaggacacatactggggagagtgttcctgaagccagagcccgc cacactgtggatgtctctccagtaaagatgcagacagaatggctccaggccgcgctgagc agaacccagactcttgccagtggggacgcgcggaagccgcaggtttcaccgctgactcgg aaacatcacgcggtcccgcgcgggagtagcaccgtcttccccgcagcgcccgcccctcgc atcctccgggaagcattccgagctcaggcccagcgcttctacgccgcaggcaacagcctt gtggcctctaagccttgccgactcccctaccaggctgagctcctggagaaaggggcctgt ggcctgcctttaatttatcactcacagagtttatcgcgtgccaggcactgttctgagctc tttacaagtgaacccattgaatccctttttatacctctgggttatcgtgaggaaacagat gaaggattcattttaaacccagaaagaaggagtacgttgcccagggtcacacagctggtt aactcagaagacaaaagagctttggctaagctggtggaagctatcaggaccaattacaac gacaaatatgatgagatccaccgtcactggggaggcagtgtcctgggtcccaagtctgtg gttcacatcgccaagcttgaaaaggcaaaggctaaagaacttgccaccaaactgggttaa