GENSCAN 1.0 Date run: 7-Nov-116 Time: 03:40:48 Sequence gi568815576f:40851399_41067887 : 216489 bp : 44.91% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 993 988 6 1.05 1.03 Term - 4382 4352 31 1 1 56 48 75 0.615 -2.37 1.02 Intr - 5231 5033 199 0 1 44 78 187 0.595 11.71 1.01 Init - 10596 9729 868 0 1 75 83 456 0.556 38.53 1.00 Prom - 12161 12122 40 -2.46 2.00 Prom + 21598 21637 40 -3.16 2.01 Init + 29838 29886 49 2 1 77 58 40 0.789 -0.99 2.02 Intr + 30372 30779 408 1 0 89 87 176 0.189 11.84 2.03 Intr + 34915 35117 203 2 2 53 80 216 0.152 16.30 2.04 Intr + 56189 56251 63 1 0 122 100 61 0.784 9.61 2.05 Intr + 57724 57837 114 0 0 78 102 54 0.986 6.44 2.06 Intr + 62841 62926 86 2 2 113 91 34 0.975 4.82 2.07 Intr + 70935 71115 181 0 1 96 75 116 0.975 10.97 2.08 Intr + 72964 73084 121 0 1 99 107 -10 0.724 2.07 2.09 Term + 74871 75037 167 1 2 95 43 182 0.979 12.48 2.10 PlyA + 76085 76090 6 1.05 3.00 Prom + 93610 93649 40 -2.46 3.01 Init + 100001 100078 78 1 0 93 53 150 0.814 13.06 3.02 Intr + 102157 102235 79 0 1 99 78 45 0.606 3.72 3.03 Intr + 112649 112719 71 0 2 113 83 -23 0.203 -1.40 3.04 Intr + 114939 115042 104 2 2 100 97 26 0.340 3.67 3.05 Term + 115129 115246 118 1 1 92 38 29 0.317 -3.69 3.06 PlyA + 115830 115835 6 1.05 4.06 PlyA - 117153 117148 6 1.05 4.05 Term - 126041 125843 199 1 1 116 47 107 0.674 6.27 4.04 Intr - 145207 145096 112 2 1 73 110 20 0.393 2.14 4.03 Intr - 151982 151945 38 1 2 105 86 49 0.352 4.31 4.02 Intr - 158290 158196 95 1 2 102 49 56 0.215 1.86 4.01 Init - 159598 159569 30 1 0 34 102 18 0.241 -2.33 4.00 Prom - 165077 165038 40 -4.96 5.00 Prom + 170210 170249 40 -2.06 5.01 Init + 178368 178415 48 2 0 65 97 29 0.308 2.45 5.02 Intr + 183123 183233 111 2 0 50 79 92 0.394 5.08 5.03 Term + 197977 198204 228 0 0 87 49 102 0.201 2.83 5.04 PlyA + 202728 202733 6 1.05 6.00 Prom + 207395 207434 40 -3.76 6.01 Sngl + 212654 213238 585 1 0 99 46 737 0.999 66.79 6.02 PlyA + 213286 213291 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576f:40851399_41067887|GENSCAN_predicted_peptide_1|365_aa MVDYYEVLGLQRYASPEDIKKAYHKVALKWHPDKNPENKEEAERKFKEVAEAYEVLSNDE KRDIYDKYGTEGLNGGGSHFDDECEYGFTFHKPDDVFKEIFHERDPFSFHFFEDSLEDLL NRPGSSYGNRNRDAGYFFSTASEYPIFEKFSSYDTGYTSQGSLGHEGLTSFSSLAFDNSG MDNYISVTTSDKIVNGRNINTKKIIESDQEREAEDNGELTFFLVNSVANEEGFAKECSWR TQSFNNYSPNSHSSKHVSQYTFVDNDEGGISWVTSNRDPPIFSAGVKEGAAPFCAVTPSQ RLGLEPGRSPPSFAHHLPTMDPRKVNELRAFVKMCKQDPSVLHTEEMRFLREWVERVLRY VQGVN >gi568815576f:40851399_41067887|GENSCAN_predicted_CDS_1|1098_bp atggtggattactatgaagttctaggactgcaaagatatgcttcacctgaggacattaaa aaagcttatcataaagtggcacttaaatggcaccctgataaaaatccagaaaataaagaa gaagcagagagaaaattcaaagaagtagctgaggcatacgaggtattatcaaatgatgag aaacgggacatttatgataaatatggcacagaaggattaaacggaggtggaagtcatttt gatgatgaatgtgagtacggcttcacattccataagccagatgatgtttttaaagaaatt tttcatgaaagggatccattttcttttcacttctttgaagactcgcttgaggacctgtta aatcgtccaggaagctcctatggaaacagaaacagagatgcaggatactttttctccact gccagtgaatatccaatttttgagaaattttcttcatatgatacaggatatacatcacag ggttcattggggcatgaaggccttacttctttctcttccctggcttttgataatagtggg atggacaactacatatctgttacaacttcagacaaaatcgttaatggcagaaatattaat acaaagaaaattattgaaagtgatcaagaaagagaagctgaagataatggagagttgaca ttttttcttgtaaatagtgtggccaatgaagagggctttgcaaaagaatgcagctggaga acacagtcattcaacaactattcaccaaattctcacagctccaaacatgtatctcaatat actttcgtggacaatgatgagggaggtatatcttgggttaccagcaacagagatccccct attttctcagcaggagtcaaagagggtgccgcccccttctgcgcggtcacgccgagccag cgcctgggcctggaaccgggccgtagcccccccagtttcgcccaccacctccctaccatg gacccccgcaaagtgaacgagcttcgggcctttgtgaaaatgtgtaagcaggatccgagc gttctgcacaccgaggaaatgcgcttcctgagggagtgggtggagagagtacttcgctac gtgcaaggagttaattga >gi568815576f:40851399_41067887|GENSCAN_predicted_peptide_2|463_aa MRFYHVGRAGLELLTSGEVTPGLSQVEYALRRHKLMSLIQKEAQGQSGTDQTVVVLSNPT YYMSNDIPYTFHQDNNFLYLCGFQEPDSILVLQSLPGKQLPSHKAILFVPRRDPSRELWD GPRSGTDGAIALTGVDEAYTLEEFQHLLPKMKAETNMVWYDWMRPSHAQLHSDYMQPLTE AKAKSKNKVRGVQQLIQRLRLIKSPAEIERMQIAGKLTSQAFIETMFTSKAPVEEAFLYA KFEFECRARGADILAYPPVVAGGNRSNTLHYVKNNQLIKDGEMVLLDGGCESSCYVSDIT RTWPVNGRFTAPQAELYEAVLEIQRDCLALCFPGTSLENIYSMMLTLIGQKLKDLGIMKN IKENNAFKAARKYCPHHVGHYLGMDVHDTPDMPRSLPLQPGMVITIEPGIYIPEDDKDAP EKFRGLGVRIEDDVVVTQDSPLILSADCPKEMNDIEQICSQAS >gi568815576f:40851399_41067887|GENSCAN_predicted_CDS_2|1392_bp atgaggttttaccatgttggccgggctggtcttgaactcctgacctcaggggaggtaact ccaggactatctcaggtggaatatgcacttcgcagacacaaactaatgtctctgatccag aaggaagctcaagggcagagtgggacagaccagacagtggttgtgctctccaaccctaca tactacatgagcaacgatattccctatactttccaccaagacaacaatttcctgtaccta tgtggattccaagagcctgatagcattcttgtccttcagagcctccctggcaaacaatta ccatcacacaaagccatactttttgtgcctcggcgagatcccagtcgagaactttgggat ggtccgcgatctggcactgatggagcaatagctctaactggagtagacgaagcctatacg ctagaagaatttcaacatcttctaccaaaaatgaaagctgagacgaacatggtttggtat gactggatgaggccctcacatgcacagcttcactctgactatatgcagcccctgactgag gccaaagccaagagcaagaacaaggttcggggtgttcagcagctgatacagcgcctccgg ctgatcaagtctcctgcagaaattgaacgaatgcagattgctgggaagctgacatcacag gctttcatagaaaccatgttcaccagtaaagcccctgtggaagaagcctttctttatgct aagtttgaatttgaatgccgggctcgtggcgcagacattttagcctatccacctgtggtg gctggtggtaatcggtcaaacactttgcactatgtgaaaaataatcaactcatcaaggat ggggaaatggtgcttctggatggaggttgtgagtcttcctgctatgtgagtgacatcaca cgtacgtggccagtcaatggcaggttcaccgcacctcaggcagaactctatgaagccgtt ctagagatccaaagagattgtttggccctctgcttccctgggacaagcttggagaacatc tacagcatgatgctgaccctgataggacagaagcttaaagacttggggatcatgaagaac attaaggaaaataatgccttcaaggctgctcgaaaatactgtcctcatcatgttggccac tacctcgggatggatgtccatgacactccagacatgccccgttccctccctctgcagcct gggatggtaatcacaattgagcccggcatttatattccagaggatgacaaagatgcccca gagaagtttcggggtcttggtgtacgaattgaggatgatgtagtggtgactcaggactca cctctcatcctttctgcagactgtcccaaagagatgaatgacattgaacagatatgcagc caggcttcttga >gi568815576f:40851399_41067887|GENSCAN_predicted_peptide_3|149_aa MAAAMDVDTPSGTNSGAGKKRFEVKKWNAVALWAWDIVVDNCAICRNHIMDLCIECQANQ ASATSEECTVAWGVCNRVVCSDLVGLYQDTVLLQSAQWVRWEREKLSNTLRELYKSQLKR DLPKGCHSKRIFVLLGIPSVLAVDSTLSY >gi568815576f:40851399_41067887|GENSCAN_predicted_CDS_3|450_bp atggcggcagcgatggatgtggataccccgagcggcaccaacagcggcgcgggcaagaag cgctttgaagtgaaaaagtggaatgcagtagccctctgggcctgggatattgtggttgat aactgtgccatctgcaggaaccacattatggatctttgcatagaatgtcaagctaaccag gcgtccgctacttcagaagagtgtactgtcgcatggggagtctgtaaccgcgtggtatgc tctgacttagttgggctctaccaggacaccgtactgctgcaaagtgctcagtgggtgagg tgggagagggagaagctgtcgaatacccttcgtgagttatacaagagtcagctgaaaagg gatcttccaaaaggctgtcactctaagagaatatttgtcctcttgggtatccctagtgta ctagctgtggattccactcttagttactga >gi568815576f:40851399_41067887|GENSCAN_predicted_peptide_4|157_aa MPLKDSVLQKPLRGQLGPEPVPTPMASLFQACLPGEVQDAAGFSTWYKKSGPEKDLFTWL FESWLLVNKLVLNPALHTTVFPCLGFARGPSRAPTTLEQVIVCPGDLVWDSPPMSDDCLH QVSQLACHIPKEDFPAPLLEVATQFYSMTPQPSSLST >gi568815576f:40851399_41067887|GENSCAN_predicted_CDS_4|474_bp atgcctctgaaagacagtgtcctgcaaaagcctttgcgaggtcagcttggccctgagcct gtgcccacacccatggcctccctctttcaggcctgtttacctggtgaggtgcaggatgct gcagggttctccacctggtacaagaagagtggcccagagaaagacctttttacttggctc ttcgaaagctggctcttggtcaacaaacttgtcctcaaccccgcccttcacaccacagtc ttcccctgcctggggtttgctcgggggcccagcagagctccaaccacactggagcaagtc atcgtctgccccggggacttggtctgggactccccacctatgtcagatgactgtcttcat caagtctcacaacttgcctgtcatatccccaaagaggacttccctgcccctctgttagaa gtggccacacagttttactctatgacacctcaaccctcttcactgtctacgtga >gi568815576f:40851399_41067887|GENSCAN_predicted_peptide_5|128_aa MLTFYQTSSFKVMGYKHFLTARVQAEHVPVARESPAGFESWCMQQEALGYTGTPTHHGEE QPLTMPTCLVSPVIYGPVGPSCWPWTQQLLKVKQALCEGCKSQDGPSAVVPSWGQGGDHV ASLKTNAS >gi568815576f:40851399_41067887|GENSCAN_predicted_CDS_5|387_bp atgctaacattctatcagaccagcagcttcaaagtgatgggctataagcacttccttact gcccgtgttcaggctgagcatgttcctgtggccagagaaagccctgcaggattcgagtcc tggtgcatgcagcaggaagccttggggtacacaggaacacccacccaccatggcgaggag cagccgctgactatgccgacctgtttagtgtcgccagtgatttatggcccagtggggccc agctgttggccgtggacacagcagctgctcaaggttaagcaggccctgtgtgagggatgc aagtcacaggatggacccagtgcagttgttccctcttggggtcaagggggcgatcatgtt gccagtctgaaaaccaacgcaagctaa >gi568815576f:40851399_41067887|GENSCAN_predicted_peptide_6|194_aa MPVARSWVCRKTYVTPRRSFEKSRLDQELKLIEEYGLRNKREVWRVKFTLAKIRKAAREL LTLDQKDPRRLFEGNALLWRLVCIGVLDEGKMKLDYILGLKIEDFLERRLQTQVFKLGLA KSIHHARVLIRQRHFRVRKQVVNIPSFIVRLDSQKHIDFSLCSPYGGGRPGRVKRKNAKK GQGGAGAGDHKEED >gi568815576f:40851399_41067887|GENSCAN_predicted_CDS_6|585_bp atgccagtggcccggagctgggtttgtcgcaaaacttacgtgaccccgcggagatccttt gagaaatctcgtctcgaccaagagctgaagctgatcgaagagtatgggctccggaataaa cgtgaggtctggagggtcaaatttaccctggccaagatccgcaaggccgcccgggaactg ctgacgcttgatcagaaggacccacggcgtctgttcgaaggcaatgccctgctgtggcgg ctggtctgcattggggtgctggatgagggcaagatgaagctggattacatcctgggcctg aagatagaggatttcttagagagacgcctacagacccaggtcttcaagctgggcttggcc aagtccatccaccacgctcgcgtgctgatccgccagcgccatttcagggtccgcaagcag gtggtgaacatcccgtccttcattgtccgcctggattcccagaagcacatcgacttctct ctgtgctctccctatgggggtggccgcccgggccgcgtgaagaggaagaatgccaagaag ggccagggtggggctggggctggagaccacaaggaggaggattaa