GENSCAN 1.0 Date run: 4-Nov-116 Time: 20:44:57 Sequence gi568815580r:36696808_36928767 : 231960 bp : 40.23% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 12288 12584 297 1 0 67 59 270 0.668 17.92 1.02 Intr + 21025 21908 884 2 2 107 79 808 0.100 71.91 1.03 Intr + 23871 24125 255 2 0 26 57 174 0.004 4.82 1.04 Intr + 28710 28895 186 2 0 90 29 87 0.165 1.96 1.05 Intr + 33161 33331 171 1 0 122 100 21 0.962 5.92 1.06 Intr + 33839 33997 159 1 0 34 78 220 0.996 14.86 1.07 Intr + 35783 35859 77 1 2 12 64 129 0.402 0.29 1.08 Intr + 40671 40794 124 0 1 55 36 97 0.109 0.87 1.09 Intr + 43849 44031 183 0 0 49 121 175 0.309 15.96 1.10 Intr + 45930 46049 120 2 0 43 101 103 0.976 6.87 1.11 Intr + 47225 47386 162 1 0 47 95 123 0.516 8.15 1.12 Intr + 50138 50328 191 1 2 87 111 106 0.489 10.26 1.13 Intr + 51260 51345 86 2 2 80 20 38 0.550 -5.26 1.14 Intr + 51799 52001 203 2 2 44 34 155 0.437 3.78 1.15 Intr + 56416 56495 80 0 2 85 64 68 0.415 1.53 1.16 Intr + 58312 58504 193 1 1 93 95 109 0.961 10.67 1.17 Intr + 60964 61085 122 0 2 30 64 109 0.353 1.07 1.18 Intr + 61933 62077 145 1 1 18 67 132 0.460 3.46 1.19 Intr + 63801 63975 175 2 1 92 78 247 0.985 22.69 1.20 Intr + 72458 72619 162 0 0 82 70 304 0.677 27.03 1.21 Term + 76848 76915 68 1 2 124 46 57 0.273 2.22 1.22 PlyA + 81186 81191 6 1.05 2.13 PlyA - 81275 81270 6 1.05 2.12 Term - 82634 82388 247 1 1 64 44 157 0.214 3.18 2.11 Intr - 100243 100002 242 1 2 98 55 104 0.043 3.53 2.10 Intr - 101802 101642 161 1 2 83 116 152 0.996 16.29 2.09 Intr - 103504 103391 114 2 0 46 110 94 0.990 6.90 2.08 Intr - 108695 108567 129 0 0 76 97 85 0.963 7.95 2.07 Intr - 111127 111040 88 1 1 58 101 102 0.902 7.12 2.06 Intr - 114168 114105 64 2 1 114 113 37 0.626 6.60 2.05 Intr - 120320 120244 77 2 2 86 40 54 0.414 -2.21 2.04 Intr - 122166 122087 80 1 2 94 108 80 0.486 8.95 2.03 Intr - 124322 124205 118 2 1 46 57 82 0.456 0.02 2.02 Intr - 127064 126996 69 2 0 63 96 70 0.478 3.76 2.01 Init - 133339 133094 246 1 0 60 50 189 0.801 9.94 2.00 Prom - 134321 134282 40 -7.05 3.00 Prom + 135207 135246 40 -8.85 3.01 Init + 136801 136876 76 0 1 90 56 24 0.749 0.70 3.02 Intr + 138427 138569 143 2 2 26 92 172 0.796 10.55 3.03 Term + 175147 175305 159 0 0 82 37 94 0.041 0.66 3.04 PlyA + 175735 175740 6 1.05 4.02 PlyA - 176890 176885 6 1.05 4.01 Sngl - 179133 178744 390 0 0 67 43 161 0.848 5.47 4.00 Prom - 180573 180534 40 -6.15 5.02 PlyA - 180764 180759 6 1.05 5.01 Sngl - 182336 182034 303 2 0 68 54 283 0.809 18.48 5.00 Prom - 223377 223338 40 -3.35 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 24010 23807 204 1 0 63 43 251 0.904 13.04 S.002 Term - 100243 99998 246 1 0 98 53 120 0.885 4.31 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815580r:36696808_36928767|GENSCAN_predicted_peptide_1|1347_aa XSAGDPEPESEAEPEAEAGAGQVADEAGQDIASAHEGAETEVEQALEQEPEERASLSEKE RQNEGVNERDNCSASSVSSSSSTLEREEKEDKLSRDRTTGLWPAGVQDAGVNGQCGDILT NKRFMLDMLYAHNRKSPDDEEKGDGEAGRTQQEAEAVASLATRISTLQANSQTQDESVRR VDVGCLDNRGSVKAFAEKFNSGDLGRGSISPDAEPNDKVPETAPVQPKTESDYIWDQLMA NPRELRIQDMDFTDLGEEDDIDVLDVDLGHREAPGPPPPPPPTFLGLPPPPPPPLLDSIP PPPVPGNLLVPPPPVFNAPQGLGWSQVPRGQPTFTKKKKTIRLFWNEVRPFDWPCKNNRR CREFLWSKLEPIKVDTSRLEHLFESKSKELSVSKNIPPPPPSPPPPPSLPPSSSPPPSSP PAPSPPPPLPPPLPPPLPPLIAHLAGCTSRLFVVPDQVAGKKLPLDSVLGGIKWSKSMEG SASWTVTENVLRLQAAGWYHTQVSKGISWLWCTCAVLTLGHSHISLGSCFPLAFSLPEKE HASKLVRPLFILTSACPFLSFPWKAPLHSSTRQLHRGLSAQERWRCSGIRGMECLNGKKT AADGKRQEIIVLDSKRSNAINIGLTVLPPPRTIKIAILNFDEYALNKEGIEGPLESRNPA YRKSEGTRQTGAGDMKEWDTCQVVGKQGTLPPPISPDAQTPPEASVIGALVKGKLGREKI LTMIPTDEEKQKIQEAQLANPEIPLGSAEQFLLTLSSISELSARLHLWAFKMDYETTEKE VAEPLLDLKEGIDQLENNKTLGFILSTLLAIGNFLNGTNAKAFELSYLEKVPEVKDTVHK QSLLHHVCTMVVENFPDSSDLYSEIGAITRSAKVDFDQLQDNLCQMERRCKASWDHLKAI AKHEMKPVLKQRMSEFLKDCAERIIILKIVHRRIINRPLRKWQVVFKVGRDKNKKQNLPS RLTTIPGHPDDGCSRLTCPHPTGDDQEEPEVHCGFGVGSKRGVAKGRWYPEVASPKYLNR RTLCSNEVFSWSAALQALSSLASVSLQGSCTLHMVSGALRFHSFLLFMGHPPYAIREVNI NKFCRIISEFALEYRTTRERVLQQKQKRANHRERNKTRGKMITDARITEQLIPHPVYKVG PRANGDIDKATRGTYSLSAAFTLVRMWPAKPYGSQSRTLSARCTAASDVGGGQGIELADC YLRDSPRRLARSRSGKFSGSSPAPPSQPQGLSYAEDAAEHENMKAVLKTSSPSVEDATPA LGVRTRSRASRGSTSSWTMGTDDSPNVTDDAADEIMDRIVKSATQVPSQRVVPRERKRSR ANRKSCCVSLLPKGQVADGNSLLPEGA >gi568815580r:36696808_36928767|GENSCAN_predicted_CDS_1|4044_bp ncaagtgccggggatcctgaacccgaatcagaggcagaaccggaagcagaggcaggggcg gggcaggttgctgatgaagctggccaggacatagcctctgcccacgagggtgcagagact gaagtggagcaggcactagagcaagagccggaagaaagagcctccctcagtgaaaaagag aggcagaacgagggggtgaacgagagggacaactgctctgcctccagcgtctcgtcctcc agcagcacgttggagagggaggagaaggaggacaagctctccagggacaggacaactggt ttgtggcccgcaggtgtccaggatgcaggtgtaaatggacagtgtggcgacatcctcacc aacaaacggttcatgcttgacatgctgtatgcccataacaggaagtctccggatgatgag gagaagggggatggggaggctgggaggacccagcaggaggcagaggcggtagccagcctt gctaccaggatatccaccctgcaggccaactctcagacccaggatgagagtgtcaggagg gtggatgtcggctgtttggacaatcggggcagtgtgaaagcatttgctgagaaattcaac agtggggacctggggagaggttccatctcccctgatgctgagcccaatgacaaggtccca gaaacagcgccggtgcagccgaagacagagtctgattacatctgggaccagctcatggcc aatccaagagagctcagaatccaagacatggatttcactgacctgggggaggaggatgac attgatgtcctagatgtggacctgggtcacagggaggcccctgggccacctcccccaccc ccacccacctttctgggtttgccgcccccaccccctccgcccctgttggacagcattcct ccccctcctgtccctggtaatttattggttcctcctcctccagtgttcaacgctcctcag ggcttagggtggtcccaggtacccaggggtcagcccacattcactaagaaaaagaagacc atccgtttgttctggaatgaagttcggccttttgactggccatgtaaaaacaaccgacgc tgcagagaattcctgtggtcaaaactggaacccattaaggtggacacttccagactggag cacctgtttgagtctaaatccaaggaactgtctgtctcaaagaatattcctcctccccct ccttctcctcctcctcctccttctcttcctccttcttcttctcctcctccttcttctcct cctgctccttctcctcctcctcctcttcctcctcctcttcctcctcctcttcctcccctc attgcccacttggctggctgcacctctaggcttttcgtggttcccgatcaggttgctggg aagaagctgccccttgactcagtattgggaggaataaaatggagtaaaagtatggagggt tcagcatcatggaccgtcactgagaatgtcttgagactccaggcagcagggtggtatcat acccaagtcagcaaggggatctcctggctgtggtgtacttgtgccgtgctgacacttggc cattcccacatcagcctgggttcctgctttcccctggccttttcccttcctgagaaggaa catgcctctaagctggttcgtcccctattcatcctcaccagtgcatgccctttcctttct tttccttggaaggctcctcttcattcatctaccaggcagctgcacagaggcctaagtgca caggaaaggtggaggtgctctggtataaggggcatggaatgtttgaatggcaagaaaact gctgcagatggaaaaaggcaagagatcattgttctggattccaagaggagtaacgccatc aatattggtctgacggtgctgccccctccaaggacgattaagatcgccattttgaatttt gatgagtatgccttaaacaaagaaggaatcgagggcccactcgaatccaggaatccagcc tatcgaaagagtgagggcacacggcagaccggagcaggggacatgaaggagtgggacacg tgccaggttgtgggcaagcaaggaacactgccaccacctatctctccagatgctcaaact ccacctgaagcctcagtgattggagccctggtgaagggaaagttgggaagggagaaaatt ctaacgatgattcccaccgatgaggagaagcagaaaatccaggaagctcagctggccaac cctgaaatccccctgggcagtgcagagcagttcctcctcaccctgtcctccatcagcgag ctctctgcacgacttcacctctgggcattcaaaatggattatgaaactacagaaaaggaa gtagcagaaccactcctggacctgaaggaaggaatagaccagttggagaacaataaaacc ttgggctttatcctgtctactctcttagccattgggaactttctaaatggaactaatgcc aaagcgtttgagttaagctacctcgagaaggttccagaagtcaaagacacagtgcacaag cagtcgcttctccaccatgtgtgcaccatggtggtagaaaacttcccagacagctccgat ctgtactcggagatcggggccatcaccaggtcagccaaggttgactttgatcaacttcag gataatttatgtcagatggagagaagatgcaaagcttcatgggatcacctcaaggcaatt gcaaaacatgaaatgaaaccagttttaaaacaacggatgtcagagttcctgaaagactgt gcagagcgaattataattttaaagattgtccatagaaggataatcaacagaccactgaga aaatggcaagtagtttttaaagttggacgggacaaaaataagaagcagaatcttccctcc aggttgacaactatccctggtcacccagatgatggctgctcccgcctcacctgtccacat cctactggggatgaccaggaggagccagaagttcactgtggttttggtgtgggtagtaaa aggggcgtggctaaagggaggtggtacccggaagtggccagtccaaagtatctaaaccgt agaaccctgtgttcaaatgaagtctttagctggagcgctgccctgcaggctctcagttcc ttggcctctgtgtccctccagggctcctgcacactacacatggtgtctggggctctgaga ttccactcctttttactctttatgggccatccaccttatgcaattcgggaagtgaacata aacaaattctgcaggattattagtgaatttgcactagagtatcgcacaaccagggaaagg gttttgcagcagaaacagaaacgggccaaccacagagagagaaataagaccagagggaag atgatcaccgatgccaggatcactgagcaactaattcctcatcctgtgtacaaagttggt cctagagcaaatggtgacatagacaaggcaacacgaggcacctacagcctttctgccgca ttcacactcgtgaggatgtggccagccaagccatatggttcacagagcaggacactctca gcacgatgcacagctgcgagtgacgttggtggtggtcagggaatcgagttagctgactgt tacctcagggacagccccagaagattggcccggagccgttctggcaagttctccggcagt tctccggcgcccccaagccagccgcagggtctgagctatgcggaggacgcggctgagcac gagaacatgaaggctgtgctgaaaacctcgtccccctccgtggaggacgccacccccgcg ctgggcgtccgcacacgcagccgagcaagccgaggatccactagttcctggactatggga actgatgactcgcccaatgtcacagatgatgcagctgatgagatcatggaccgcatcgtc aagtcagccacccaagtgcccagtcagcgagtggtgccgagggagaggaaacgatcccgg gccaaccggaaatcttgctgtgtctcactgctgcccaagggacaagttgctgatgggaat tcgttgctgcctgaaggtgcttag >gi568815580r:36696808_36928767|GENSCAN_predicted_peptide_2|544_aa MPSDSAKGNSNASINAITNPSKHKKSCLCKHSVTQIAVPLSSNPSKEREVNYQDQSSSEK SLPTPRKETLLSHKHLITAAELDCEDDEIPHLDYVTLNGKSKIIQGKALTGQLQTQEVGE IDSPLNEKCYGHAGKEGIDSTHLWKSSPGVTEVTIIEKPPAERHMISSWEQPIIGKENSR TVAMKLLLADPVNTCSCWEVASKSEGNNGVQDCPGVEKKNNCVMPEDVKNFYLMTNGFHM TWSVKLDEHIIPLGSMAINSISKLTQLTQSSMYSLPNAPTLADLEDDTHEASDDQPEKPH FDSRSVIFELDSCNGSGKVCLVYKSGKPALAEDTEIWFLDRALYWHFLTDTFTAYYRLLI THLGLPQWQYAFTSYGISPQAKQWFSMYKPITYNTNLLTEETDSFVNKLDPSKVFKSKNK IVIPKKKGPVQPAGGQKGPSGPSGPSTSSTSKSSSGSGNPTRNGGAGHTQRDEKKRKYEQ QGTGQSRLQSPLLPRDFQRSQRVHTLYLPALAEVPCSLSARTKVWGYTGWPNNATGADLE GREE >gi568815580r:36696808_36928767|GENSCAN_predicted_CDS_2|1635_bp atgcccagtgattctgccaaaggtaacagcaacgcctcaattaatgccatcaccaatcct agcaaacacaaaaagagctgcctctgcaagcactcagtaacacagattgccgtccctctt tcaagtaacccctctaaggaacgtgaagtaaattatcaggaccaaagctcaagcgagaag agtctacctactccaaggaaagaaacactcctgtcacacaaacatctcatcactgctgct gaactggactgtgaagatgatgagataccacaccttgattatgttacattaaatggcaaa agcaagattatccagggcaaggcactcacaggccaactccaaactcaagaagttggagaa atagactcacctctcaatgagaagtgctatgggcatgcagggaaggaaggaattgacagc acccacctttggaaatcttccccaggtgtgactgaggtgaccatcatagaaaagcctcct gctgaacgtcatatgatttcttcctgggaacaacccatcataggaaaagaaaattccaga acagtggctatgaaactcttattggctgatcctgtcaacacctgttcctgttgggaagtg gcctctaaaagtgaggggaataatggggtgcaggattgtccaggagtagaaaagaagaat aactgtgtgatgcctgaagatgtgaagaacttttacctgatgaccaatggcttccacatg acatggagtgtgaagctggatgagcacatcattccactgggaagcatggcaattaacagc atctcaaaactgactcagctcacccagtcttccatgtattcacttcctaatgcacccact ctggcagacctggaggacgatacacatgaagccagtgatgatcagccagagaagcctcac tttgactctcgcagtgtgatatttgagctggattcatgcaatggcagtgggaaagtttgc cttgtctacaaaagtgggaaaccagcattagcagaagacactgagatctggttcctggac agagcgttatactggcattttctcacagacacctttactgcctattaccgcctgctcatc acccacctgggcctgccccagtggcaatatgccttcaccagctatggcattagcccacag gccaagcaatggttcagcatgtataaacctatcacctacaacacaaacctgctcacagaa gagaccgactcctttgtgaataagctagatcccagcaaagtgtttaagagcaagaacaag atcgtaatcccaaaaaagaaagggcctgtgcagcctgcaggtggccagaaagggccctca ggaccctccggtccctccacttcctccacttctaaatcctcctctggctctggaaacccc acccggaatggtggtgcaggtcacacccaaagggatgagaagaaaaggaagtatgagcag caaggcacagggcagtcaagactccagtccccccttctccccagggacttccagagaagc cagagggttcacactctgtaccttcctgctcttgctgaagtgccctgcagtctgtctgcc aggacaaaagtttggggctacacaggctggcctaacaatgccacaggtgctgatttggag ggccgagaggagtga >gi568815580r:36696808_36928767|GENSCAN_predicted_peptide_3|125_aa MHRSDREQLGAYCNNPGRGKVELDQGISAEGNVRSRHKLMSPKADVKLKTSRVTDASISM ESLKGTGDSVDEQAPLGESPDQASILLDAWGKETVVAMEKKQSSWVLLLYLPCVTKNIVS GRRAT >gi568815580r:36696808_36928767|GENSCAN_predicted_CDS_3|378_bp atgcacagaagtgacagagagcagttaggagcttattgcaataatcctgggagaggaaaa gttgagttggatcaaggaatttctgctgaaggaaatgtcagatcaagacacaagctgatg agtccaaaagctgatgttaaacttaagacttccagggtgactgatgcttcaatctccatg gagtccttaaaaggcacaggagattcagtagatgaacaggccccacttggtgagagccct gatcaagcatcaatcttgctcgatgcctgggggaaggaaacagtagttgccatggaaaag aaacaaagttcctgggtccttctcctctatcttccctgtgtaacaaaaaacatagtttca ggaagacgagcaacataa >gi568815580r:36696808_36928767|GENSCAN_predicted_peptide_4|129_aa MTKTTITSIDAEKAFDKIQHPFMLKTLNTLGTDGMYLKIIRAIYDKPIANIILIGEKLEA FPLKTCTRQGCPLSPLLFNIVLEVLARAIRQEKEINGIRREEEEVKLSLFADDRIAYLLN PHCLSPKTP >gi568815580r:36696808_36928767|GENSCAN_predicted_CDS_4|390_bp atgacaaaaaccacgattacctcaatagatgcagaaaaggccttcgataaaattcaacac cctttcatgctaaaaacactcaatacactaggtaccgatggaatgtatctcaaaataata agggctatttatgacaaacccatagccaatatcatactgattggggaaaagctggaagca ttccctttgaaaacctgcacaagacaaggatgccctctctcaccactcctattcaacata gtattggaagttctggccagggcaatcaggcaagagaaagaaataaatggtattcgaaga gaagaagaggaagtcaaattgtctctgtttgcagatgacaggattgcatatttattaaac ccccattgtctcagcccaaaaactccttaa >gi568815580r:36696808_36928767|GENSCAN_predicted_peptide_5|100_aa MRKNQHKKAENSKNQNASSPPKDHNSSPARQQNRTKNEFDELTEVGFRRWVITNSSELKE HVLTQCKEAKNLDKRLQELLTRITSSEKNINDLMELKNTA >gi568815580r:36696808_36928767|GENSCAN_predicted_CDS_5|303_bp atgaggaaaaaccagcacaaaaaggctgaaaattccaaaaaccagaatgcctcttctcct ccaaaggatcacaactcctcaccagcaaggcaacaaaaccggacaaagaatgagtttgac gaattgacagaagtaggcttcagaaggtgggtaataacaaactcctctgagctaaaggag catgttctaacccaatgcaaggaagctaagaaccttgataaaaggttacaggaactgcta actagaataaccagctcagagaagaacataaatgacctgatggagctgaaaaacacagca tga