GENSCAN 1.0 Date run: 6-Nov-116 Time: 15:43:13 Sequence gi568815589r:114803934_115030303 : 226370 bp : 41.41% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.10 Intr - 1934 1870 65 0 2 135 115 43 0.028 9.22 1.09 Intr - 2111 1957 155 1 2 53 1 198 0.083 6.39 1.08 Intr - 5522 5321 202 0 1 38 61 168 0.359 6.52 1.07 Intr - 7542 7444 99 1 0 68 95 91 0.700 6.96 1.06 Intr - 9808 9570 239 0 2 63 85 77 0.421 1.24 1.05 Intr - 13712 13639 74 2 2 99 66 76 0.716 3.79 1.04 Intr - 15303 15087 217 2 1 50 68 162 0.796 7.98 1.03 Intr - 24208 24116 93 0 0 88 73 54 0.266 2.06 1.02 Intr - 31919 31474 446 2 2 62 93 194 0.821 8.57 1.01 Init - 36022 35927 96 1 0 97 92 79 0.545 9.56 1.00 Prom - 40580 40541 40 -6.55 2.03 PlyA - 41306 41301 6 1.05 2.02 Term - 41721 41499 223 2 1 85 54 157 0.026 7.41 2.01 Init - 61709 61636 74 2 2 64 58 111 0.351 6.29 2.00 Prom - 62285 62246 40 -7.45 3.00 Prom + 63366 63405 40 -8.35 3.01 Sngl + 66786 67250 465 2 0 77 48 324 0.874 23.19 3.02 PlyA + 67670 67675 6 1.05 4.00 Prom + 67979 68018 40 -1.45 4.01 Init + 71696 71746 51 1 0 68 99 44 0.438 4.72 4.02 Term + 72128 72211 84 1 0 114 42 40 0.219 -1.33 4.03 PlyA + 74242 74247 6 1.05 5.07 PlyA - 74270 74265 6 1.05 5.06 Term - 79540 79430 111 1 0 59 47 104 0.366 0.98 5.05 Intr - 81165 81004 162 0 0 16 27 149 0.399 0.85 5.04 Intr - 84908 84783 126 2 0 110 74 44 0.787 5.16 5.03 Intr - 101568 101391 178 2 1 107 110 23 0.829 5.40 5.02 Intr - 101966 101895 72 1 0 102 86 42 0.801 3.00 5.01 Init - 103395 103256 140 0 2 47 47 211 0.689 12.56 5.00 Prom - 104271 104232 40 -5.15 6.07 PlyA - 104665 104660 6 -0.45 6.06 Term - 105390 105223 168 0 0 86 36 120 0.492 3.50 6.05 Intr - 111135 110864 272 1 2 14 66 157 0.394 2.44 6.04 Intr - 114205 114163 43 1 1 136 94 36 0.516 5.89 6.03 Intr - 116349 116256 94 2 1 79 92 30 0.425 1.55 6.02 Intr - 121135 121041 95 1 2 66 73 90 0.922 3.14 6.01 Init - 126337 126176 162 1 0 67 127 189 0.750 18.82 6.00 Prom - 126386 126347 40 -3.75 7.03 PlyA - 127522 127517 6 1.05 7.02 Term - 128787 128428 360 0 0 74 47 194 0.846 7.45 7.01 Init - 130150 130094 57 1 0 81 81 48 0.962 4.76 7.00 Prom - 139779 139740 40 -6.65 8.00 Prom + 139999 140038 40 -5.85 8.01 Init + 143322 143471 150 2 0 52 41 121 0.074 3.89 8.02 Intr + 156743 156923 181 1 1 46 30 184 0.106 6.92 8.03 Intr + 171264 172368 1105 1 1 46 53 327 0.008 12.98 8.04 Intr + 184890 184934 45 0 0 120 78 20 0.077 0.81 8.05 Intr + 185938 186210 273 1 0 138 50 219 0.713 18.83 8.06 Term + 196222 196501 280 1 1 32 48 157 0.242 0.03 8.07 PlyA + 197694 197699 6 1.05 9.07 PlyA - 199265 199260 6 1.05 9.06 Term - 217334 217224 111 2 0 143 32 87 0.990 6.18 9.05 Intr - 220203 220040 164 1 2 104 109 99 0.996 12.37 9.04 Intr - 221745 221626 120 1 0 52 32 102 0.583 0.55 9.03 Intr - 222762 222601 162 1 0 102 92 359 0.932 36.73 9.02 Intr - 225354 225199 156 1 0 113 -2 109 0.798 3.46 9.01 Intr - 225523 225427 97 1 1 65 23 111 0.643 0.96 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 184890 184924 35 2 2 120 52 31 0.824 -1.20 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589r:114803934_115030303|GENSCAN_predicted_peptide_1|562_aa MGDRGAMSSRAAGSCQRVHTNRAHQQGSRMPWVHSGWQQVNATLGQSFQRKEQAAIVDVS QPSLMIPPGTKNTEATRVWSRSPANHSHPTEEWPDCKKKKQTENNNIDKNDLTKTPFKGQ QPQKSKVDKPTNMWKIQHKNAENSKSQSTSSPPHDHNTSPARAQNWAEAEMAELTEVGFK RMGTAPTYKCGYEVDEIMYSSWHCELPTIIKIGENGNPERKSGLCLEEHLDGEEEDPANC KLAVWNLNSKDVSVSFLLFVQGELLGQLFSNLFQGERGQSVVPWPSTDTQCLLNEELSTN STSGSLIASPKHPASPLLKPYLLLFPHICPWTYLMAPAQDPVGVLALTPAAYTFQREQST VPATRNLGSTSAPALPLVSPFWVCFKIEGLCHCDDGATVKLLVPHTYLLLNICEGFFRVY ALVSKWVSQTSGCSQHSKQDDLFKVKSEPATSPPITFQRTPTPLTTTSLPIKLSEEEAKS YLDPGGHTRSQRCLQEQQEHGRGSGTELWGNSQCGNAARARQLQAQGQEQQRTLGSHLLP GLTTYLLVSQLRAQGEACVQFQ >gi568815589r:114803934_115030303|GENSCAN_predicted_CDS_1|1686_bp atgggtgacaggggagccatgagcagcagggcagcagggagctgccagcgagtgcacacc aacagggcacaccaacagggtagcaggatgccatgggtacattcaggctggcaacaggtc aatgctaccctgggacagagcttccagaggaaggaacaggctgccattgttgatgtttca cagccttcactgatgatacctccaggtacaaaaaacactgaggcaactagggtctggagc agatccccagcaaaccacagccaccctacagaagagtggcctgactgtaaaaagaaaaaa caaacagaaaacaacaacatcgacaaaaatgacctcacaaaaaccccattcaaaggtcaa caacctcaaaaatctaaggtagataagcccacaaatatgtggaagattcaacacaaaaat gctgaaaactcaaaaagccagagtacctcttctccaccacatgaccacaacacatctcca gcaagagcacagaactgggctgaggctgagatggctgaattgacagaagtaggcttcaaa agaatggggacagcacctacctacaagtgtggttatgaggttgatgagataatgtattcc agctggcattgtgaactgccaaccattataaaaataggagaaaatggaaatccagaaagg aaaagcggcttgtgtctcgaggaacatttggatggagaggaagaagaccctgcaaattgc aagctggctgtctggaaccttaacagtaaagatgtttctgtatccttcttgctctttgtt cagggagagttgctggggcaactattttctaacctctttcaaggggaaagaggccagtca gtggtgccctggccttccacagatactcagtgtctgttgaatgaagaactgagcaccaac tccacatcaggctctctgattgccagccctaaacaccctgctagccctcttctaaagcca tatctcctgctcttcccccacatttgcccctggacctatctcatggctcctgctcaagac ccggtaggcgtcttggctctgacccctgctgcctacaccttccagagagaacagagcaca gtcccagcaactaggaacctgggctcaacttcagcaccagccctgccacttgtgtcaccc ttctgggtctgtttcaaaattgaaggtttgtgccactgtgatgacggtgctacggttaag ctccttgttcctcatacatacctcctgttgaacatatgtgagggtttctttcgggtatat gccttggtctccaaatgggtctcccagacttcaggctgttctcaacacagcaaacaggat gatcttttcaaagtaaaatcagagcctgccacctctccgcccatcaccttccaaagaacc cctactccactcaccaccacctctcttcccatcaaactcagtgaagaagaagccaagtcc tacctggacccaggaggacacacacggagtcagaggtgcctccaggagcagcaggagcat ggccgaggatctgggactgagctttggggaaacagccagtgtggaaatgctgccagagca cggcagctgcaggcccaaggccaggagcagcagcgcacgctgggctctcacctgctgcct ggactcaccacatacctgcttgtcagccagctccgggcccagggagaggcctgtgtgcag ttccag >gi568815589r:114803934_115030303|GENSCAN_predicted_peptide_2|98_aa MSDGFLRSSPEGDAGAMLLVQPAEPIISIFHLWLFLQQLGLSGGHLELPDSAAAVVVVAP WTVKGSASGRRRKRAAVHVPIVPFFVFMCTQCLAPTYE >gi568815589r:114803934_115030303|GENSCAN_predicted_CDS_2|297_bp atgagtgacggcttcctgaggtcctcaccagaaggagatgctggtgccatgcttcttgta cagcctgcagaaccaattatcagcatcttccatctctggctgttcctgcagcaactgggt ctcagtggcggccatcttgaacttcctgactccgctgccgctgtggtggtggtggctccc tggaccgtgaaaggcagtgctagtggaagaagaaggaaaagagctgccgtacatgtgcct attgttcccttctttgtgttcatgtgtactcaatgtttagctcccacttatgaatga >gi568815589r:114803934_115030303|GENSCAN_predicted_peptide_3|154_aa MGRNQHKKAENSKNQNASSPPKDHNSSPARKQNWTENEFDKSTEVGFRRWIINSSKLKEC VLTQCKEAKNLEKRLDELLTRITSLEKNLNDLMELQNTAPELREAYTSINSQIDQEEERI SEIEDQLNEIKREDKIREKRMKGVNKASKKYGTM >gi568815589r:114803934_115030303|GENSCAN_predicted_CDS_3|465_bp atggggagaaaccagcacaaaaaggctgaaaattccaaaaaccagaatgcctcttctcct ccaaaggatcacaactcctcgccagcaaggaaacaaaactggacagagaatgagtttgac aaatcgacagaagtaggcttcagaaggtggataataaactcctccaagctaaaggagtgt gttctaacccaatgcaaggaagctaagaaccttgaaaaaaggttagacgaattgctaact agaataaccagtttagagaagaacctaaatgacctgatggagctgcaaaacacagcacca gaacttcgtgaagcatacacaagtatcaatagccaaatcgatcaagaggaagaaaggata tcagagattgaagatcaacttaatgaaataaagcgagaagacaagattagagaaaaaaga atgaaaggagtgaacaaagcctccaagaaatatgggactatgtga >gi568815589r:114803934_115030303|GENSCAN_predicted_peptide_4|44_aa MISFDFMSHIKVTLMQEDQHHIEAAKAWGLHPLKPWSELYCGSF >gi568815589r:114803934_115030303|GENSCAN_predicted_CDS_4|135_bp atgatctcctttgacttcatgtctcacatcaaggttacgctgatgcaagaggatcaacac cacatagaagcagccaaggcttggggcttacaccctctgaagccatggtctgagctgtac tgtggctccttttga >gi568815589r:114803934_115030303|GENSCAN_predicted_peptide_5|262_aa MHEDETKGMTADPLALTTKRALDQKLYVDAADKEENCPCFCPECLSQKLLRRPLMYPEKG SIQEVMGLPPSYLPISAKYTVLFLMQGRQALNLCLYEVIHKGTIKLPCPQIDSLMNMSAQ GRFAYCSVLQHPEKLSTPRLLQGGGGVTSAIQSCFFYLLSASFSNLKLEPGTKTPETSDQ QLPGVGKVDSSMCMQTAQPKGRIKGEVTQDPGSMPTYKAKVKHKTKPWKQGEEQPVCNFR QGVTKTKLKQLTMDTSKEFRRT >gi568815589r:114803934_115030303|GENSCAN_predicted_CDS_5|789_bp atgcatgaggatgaaaccaaggggatgacagctgacccactggctctcaccaccaagaga gctttggaccagaagctctatgtggatgcagcagataaggaggaaaactgcccctgcttc tgcccggagtgcctctcgcagaaattgctcagaagacctcttatgtatcctgaaaagggc tccattcaagaagtcatgggcctacctccaagctacctaccaatttcagcaaagtacaca gtccttttcctaatgcaagggaggcaggcacttaatctctgcctctatgaagttattcac aagggaaccataaaattgccttgtccccagattgacagcttaatgaacatgtcagcacag ggccgatttgcatattgcagtgtgcttcagcacccagagaaactctccacaccacggctg ctgcagggtgggggaggggtgacatcagcaattcagagctgttttttctatctcctgagt gcctctttcagcaatttgaagttagaaccaggtactaaaacacctgaaactagtgatcag cagcttcctggagttgggaaagtggactcaagcatgtgcatgcagacagcccaacccaag ggaagaatcaaaggagaagtgacgcaagaccccggaagtatgccaacatataaagccaaa gttaaacataaaaccaaaccttggaaacaaggtgaagaacagcctgtatgcaacttcaga cagggggttaccaaaacaaagctgaagcagttgaccatggacacctcaaaagaatttcgg agaacctga >gi568815589r:114803934_115030303|GENSCAN_predicted_peptide_6|277_aa MAPPGDTAMHVPAGSVASHLGTTSRSYFYLTTATLALCLVFTVATIMVLVVQRTPLEHGA FPVELIGVRHKKKLFGMPHDTSHYPSLMNLMIVVRHFNADIFLSAFAHHANQWSRLADSI PNSPDNVPLKGERKRAEARHLLGSPFSLAVENLPPSAENQQWLRAWDVELCAHAKEKTQD TLRYRGNTKNQTLLGNERISESYHKLPIPGQDSPQPTDTSQAPSEFQGHCRISAACNATI YKSRFSVFLTKKESYIIDSQGDEKTQATRLMTHEMFI >gi568815589r:114803934_115030303|GENSCAN_predicted_CDS_6|834_bp atggcccctcctggagacacagccatgcatgtgccggcgggctccgtggccagccacctg gggaccacgagccgcagctatttctatttgaccacagccactctggctctgtgccttgtc ttcacggtggccactattatggtgttggtcgttcagaggacgcctcttgaacatggtgcc ttccctgttgagctaataggagttcggcacaagaagaagttatttggaatgccccatgac acttcccactacccaagcttgatgaatttgatgattgtggtaagacatttcaatgcagat atttttcttagtgcatttgcacatcatgctaatcaatggtcacgcttggcagactccatt cccaactcacctgacaacgtccccctcaaaggagaaaggaaaagggcagaagctagacac cttctagggagccctttttcactggctgttgaaaatcttccaccatcagcagaaaatcag cagtggctcagagcctgggatgtagagctatgtgcacatgctaaagaaaaaacacaggat acacttcgctacagaggaaacaccaaaaatcaaacccttttgggtaatgaaaggatttct gagtcctatcacaagcttcccattcctggccaggatagcccccagccaactgacacctca caggcaccttcagaattccagggacattgcagaatttcagctgcatgtaatgccactatt tataagtcacgtttttctgtcttcctaaccaaaaaagaaagttatatcatagactcacag ggtgatgagaagactcaggccacacggctgatgacccatgaaatgttcatctag >gi568815589r:114803934_115030303|GENSCAN_predicted_peptide_7|138_aa MKHRLKVNPAYMTSSKKRKDIVLYAKRVTVRQTFFSTALLSPTGLQEDPNFKQSLFIFQH PQLPFNNQKFSTLIKEAEQEGQGSSWFSATLGRPQSPCYPLTVRAHQRTQPSPQISQVHA PENTGLPRSHLNYRTKRT >gi568815589r:114803934_115030303|GENSCAN_predicted_CDS_7|417_bp atgaagcatagattgaaggtcaaccctgcatatatgaccagcagcaagaaacgcaaggac attgtattgtatgcaaaacgggtaactgtaagacaaacatttttttccactgcccttctc agccccacaggccttcaggaagacccgaacttcaagcaaagcctttttatttttcaacac ccacagcttccattcaacaatcagaagttttccactttgatcaaagaggctgaacaagag ggccaagggagcagctggttctcagcaactctgggcagaccacagagcccttgctaccca ctcactgtccgtgcccaccagaggacacagccttctccccaaatcagccaggtacatgcc ccagaaaacactggcttgcctcgttcccaccttaattaccggaccaaacgaacgtga >gi568815589r:114803934_115030303|GENSCAN_predicted_peptide_8|677_aa MSEIVDQMINGRKWVLEERKNMYEGPPGWKELCMFKEHTEEGPTYRDKGQFIVVSSHVMA PVTQWPDFLVVLVPMATVDSKAVVLKQECASESPCYNRVLDPITSVSDAAEPVIGLFRDS TSSWFSLEGLFEENYKPLLNEINEDTNKWKNIPCSWIGRINIMKTAILPKVIYRFNAIPI KLPMTFFSELEKTTLKFIWKQKRARIAKSILSQKNKAGGITLPDFKLYYKPTVTKTAWYW YQNRDIDQWNRTQPSEIIPHIYNHMIFDKPDKNKKWGKDSLFNKWFWENWLAICGKLKLD PFLTPYTKINSRWIKDLNVRPKTMKTLEENLGNTIQDIGMGKDFMSKTPKAMATKAKIDK WDLIKLTSFCTAKETTIRVNRQPTEREKIFAIYSSDKGLISRIFKELKQTYKKKTNNPIK NWAKDMNRHFSKEDIYAAKRHMKKCSSSLAIREMQIKTTVRCHLTPVRMAIIKKSGNNRI SVLQNIQYLIGTALCIINQEFTVKAQEELSDVFSSGKYLVPQGAGGASRAPASRAVAPED FQCRPGNPTLAFDVALTVLNAQAAQIQNRGKACCFLSLTLETDQRPGLQIVSAFGDPVFI GSPLANTLRERTGNESSVLIPVFSSVLLRCKGTAPVVLLYHHHFGFSQNVCSEKAQSLAS VHFGLPGEFPVLLGLHG >gi568815589r:114803934_115030303|GENSCAN_predicted_CDS_8|2034_bp atgtcggaaattgtggaccagatgattaatgggaggaagtgggttttagaagaaagaaag aatatgtatgaaggccccccaggatggaaagaactttgtatgttcaaggaacatacagag gagggacctacatatagggacaaagggcagtttatagttgtttcttctcatgtaatggcc cctgtaacgcagtggccagatttcctcgttgtccttgtgcccatggccacagttgattct aaggcagttgttctcaaacaggagtgtgcatcagaatctccttgttacaatagagttctg gatcccatcaccagtgtgtctgacgcagcagagcctgttattggtctattcagagattca acttcttcctggtttagtcttgaaggactcttcgaggagaactacaaaccactgctcaat gaaataaacgaggacacaaacaaatggaagaacattccatgctcatggataggaagaatc aatatcatgaaaacagccatattgcccaaggtaatttatagattcaatgccatccccatc aagctaccgatgactttcttctcagaattggaaaaaactactttaaagttcatatggaag caaaaaagagcccgcatcgccaagtcaatcctaagccaaaagaacaaagctggaggcatc acgctacctgacttcaaactatactacaagcctacagtaaccaaaacagcatggtactgg taccaaaacagagatatagaccaatggaacagaacacagccctcagaaataataccacac atctacaaccatatgatctttgacaaacctgacaaaaacaagaaatggggaaaggattcc ctatttaataaatggttctgggaaaactggctagccatatgtggaaagctgaaactggat cccttccttacaccttatacaaaaattaattcaagatggattaaagacttaaatgttaga cctaaaaccatgaaaaccctagaagaaaacctaggcaataccattcaggacataggcatg ggcaaggacttcatgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaa tgggatctaattaaactaacaagcttctgcacagcaaaagaaactaccatcagagtgaac aggcaacctacagaacgggagaaaatttttgcaatctactcatctgacaaagggctaata tccagaatcttcaaagaactcaaacaaacttacaagaaaaaaacaaacaaccccatcaaa aattgggcaaaggatatgaacagacacttttcaaaagaagacatttatgcagccaaaaga cacatgaaaaaatgctcatcatcactggccatcagagaaatgcaaatcaaaaccacagtg agatgccatctcacaccagttagaatggcaatcattaaaaagtcaggaaacaacaggatt tctgtgcttcagaatatccagtatttgataggtactgctttgtgcatcataaaccaagag tttacagtaaaagcacaggaagagctgagtgatgtgttctcttcaggaaagtacctggtc ccccagggagctgggggcgcctccagagcccctgcctcccgagctgtggctcctgaggac tttcagtgtagacctggcaacccaaccttagcctttgatgtggctctgacagtgctcaat gcacaagcagctcaaattcagaaccgtggaaaggcttgttgctttctgtcactgaccctg gagactgatcagaggcccggtctgcaaattgtcagcgcttttggagatccggtatttatt gggtcgccactagcaaacactctcagggaaaggactggcaatgaatcatctgtactaatc cctgtgttttcatcagtcttgcttcggtgcaaggggacagccccggttgttcttctatat catcatcactttggtttctctcaaaatgtttgctcagaaaaagcccagagtttggcctcg gttcactttggacttccaggggaatttcccgtcctccttggactccatgggtga >gi568815589r:114803934_115030303|GENSCAN_predicted_peptide_9|269_aa VFLRRKNGRENFYQNWKAYAAGFGDRREEFWLGTYGRERDEEEELLLVADTECQILNEKD YHLNALNHQNVNQKVSETDLNQFRGLDNLNKITAQGQYELRVDLRDHGETAFAVYDKFSV GDAKTRYKLKVEGYSGTAEEERSSEELSDLSKVKQLVEGGCNPGCRKLHPTMFSVTKKGD SMAYHNGRSFSTFDKDTDSAITNCALSYKGAFWYRNCHRVNLMGRYGDNNHSQGVNWFHW KGHEHSIQFAEMKLRPSNFRNLEGRRKRA >gi568815589r:114803934_115030303|GENSCAN_predicted_CDS_9|810_bp gtgttcctgagacgcaaaaacggacgcgagaacttctaccaaaactggaaggcatatgct gctggatttggggaccgcagagaagaattctggcttgggacctatgggagagaaagagat gaggaagaagaactacttctggtggcagacacagaatgtcaaattcttaatgaaaaagac tatcatttaaatgctctgaatcatcaaaatgtaaaccaaaaagtatctgagacagatctc aatcaatttagagggctggacaacctgaacaaaatcacagcccaggggcagtacgagctc cgggtggacctgcgggaccatggggagacagcctttgctgtctatgacaagttcagcgtg ggagatgccaagactcgctacaagctgaaggtggaggggtacagtgggacagcagaggag gaaagaagttcagaagagctcagtgatttgtccaaggtaaaacagcttgtagaaggtgga tgcaatcctggctgcagaaaactgcatccgacaatgttctcagtgacaaagaagggtgac tccatggcctaccacaatggcagatccttctccacctttgacaaggacacagattcagcc atcaccaactgtgctctgtcctacaaaggggctttctggtacaggaactgtcaccgtgtc aacctgatggggagatatggggacaataaccacagtcagggcgttaactggttccactgg aagggccacgaacactcaatccagtttgctgagatgaagctgagaccaagcaacttcaga aatcttgaaggcaggcgcaaacgggcataa