GENSCAN 1.0 Date run: 3-Nov-116 Time: 07:15:36 Sequence gi568815588r:104214411_104416051 : 201641 bp : 43.24% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 PlyA - 424 419 6 1.05 1.05 Term - 1133 986 148 1 1 52 55 63 0.031 -3.23 1.04 Intr - 11147 11051 97 0 1 55 111 33 0.064 1.37 1.03 Intr - 16433 16180 254 1 2 56 61 214 0.350 12.58 1.02 Intr - 18008 17900 109 0 1 34 86 98 0.634 3.54 1.01 Init - 23656 23575 82 1 1 35 103 45 0.293 1.83 1.00 Prom - 30280 30241 40 -3.66 2.00 Prom + 31477 31516 40 -6.96 2.01 Init + 31694 31916 223 1 1 76 33 185 0.712 10.78 2.02 Intr + 40753 40861 109 2 1 107 80 93 0.989 9.74 2.03 Intr + 45166 45388 223 1 1 106 76 287 0.969 27.43 2.04 Intr + 48569 48667 99 1 0 77 76 50 0.803 3.01 2.05 Intr + 52842 52921 80 2 2 141 38 15 0.077 0.15 2.06 Intr + 54295 54634 340 1 1 54 67 235 0.014 13.48 2.07 Intr + 60368 60539 172 1 1 52 113 33 0.303 1.72 2.08 Intr + 60816 60924 109 1 1 110 94 113 0.849 13.44 2.09 Intr + 63484 63706 223 1 1 67 71 207 0.959 15.03 2.10 Intr + 64960 65061 102 0 0 68 72 69 0.455 3.67 2.11 Intr + 76807 76933 127 0 1 43 105 53 0.441 2.75 2.12 Intr + 77078 77259 182 0 2 54 36 97 0.195 0.69 2.13 Intr + 83168 83274 107 1 2 92 46 88 0.948 4.01 2.14 Term + 84718 84874 157 1 1 131 47 117 0.991 9.31 2.15 PlyA + 84985 84990 6 1.05 3.07 PlyA - 88549 88544 6 1.05 3.06 Term - 101654 99998 1657 1 1 114 45 2741 0.016 261.31 3.05 Intr - 105222 105124 99 2 0 104 87 31 0.001 3.83 3.04 Intr - 117122 117049 74 2 2 135 36 26 0.047 0.30 3.03 Intr - 123962 123836 127 1 1 31 97 128 0.447 8.68 3.02 Intr - 131853 131805 49 1 1 100 80 58 0.244 3.94 3.01 Init - 140478 140427 52 0 1 91 65 28 0.258 2.24 3.00 Prom - 142286 142247 40 -8.16 4.00 Prom + 142393 142432 40 -3.06 4.01 Init + 142964 143002 39 1 0 59 109 -26 0.482 -3.21 4.02 Intr + 143931 144212 282 2 0 105 86 295 0.964 28.62 4.03 Intr + 147613 147761 149 0 2 62 58 139 0.938 7.33 4.04 Intr + 150323 150479 157 2 1 111 46 110 0.999 9.11 4.05 Intr + 151404 151598 195 2 0 67 113 357 0.997 35.71 4.06 Intr + 154013 154150 138 1 0 85 31 98 0.875 4.46 4.07 Intr + 156485 156644 160 1 1 102 82 108 0.996 11.16 4.08 Term + 156854 156885 32 0 2 92 45 48 0.862 -1.08 4.09 PlyA + 157358 157363 6 1.05 5.00 Prom + 159680 159719 40 -2.16 5.01 Init + 162445 162483 39 0 0 66 113 9 0.859 1.38 5.02 Intr + 165619 165810 192 0 0 88 121 309 0.953 33.89 5.03 Intr + 177823 177984 162 0 0 118 99 82 0.970 12.47 5.04 Intr + 178919 179065 147 1 0 64 79 162 0.999 13.33 5.05 Intr + 184950 185090 141 2 0 56 80 91 0.980 5.65 5.06 Intr + 186270 186493 224 2 2 126 90 250 0.981 25.93 5.07 Intr + 189842 189935 94 2 1 85 50 39 0.773 -0.23 5.08 Intr + 192279 192383 105 2 0 87 69 180 0.778 16.41 5.09 Intr + 196241 196293 53 1 2 86 2 61 0.066 -4.99 5.10 Term + 199231 199486 256 1 1 92 45 233 0.291 14.46 5.11 PlyA + 200236 200241 6 -3.64 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 52842 52995 154 0 1 141 49 111 0.830 9.89 S.002 Sngl - 101641 99998 1644 1 0 107 45 2725 0.809 265.39 S.003 Intr + 107461 107598 138 1 0 67 69 121 0.900 7.68 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:104214411_104416051|GENSCAN_predicted_peptide_1|229_aa MHTETIPNTAHSARVKERIIQRYLLKHEGSGPKRRANGRGAYGLRDTGVHSSGVAARSPA AAERWVQGFPKQNVHFVNDNTICYPCGNYVIFINIETKKKTVLQCSNGIVGVMATNIPCE VVAFSDRKLKPLIYVYSFPGLTRRTKLKGNILLDYTLLSFSYCGTYLASYSSLPEFELAL CPGFQLSPSRDDGLCQVPDAKSQRIKGTRDSSKSVVDVLGRAFVIGNEG >gi568815588r:104214411_104416051|GENSCAN_predicted_CDS_1|690_bp atgcacacagagaccatccctaacactgcccattcagctcgtgttaaagaacgaattatt caacggtacttgctaaagcacgaagggagcgggccaaagcgcagagccaatgggcgcggg gcttacggtctccgggatacgggtgtacacagcagcggcgtcgcggcccgcagccccgcg gcggcggaaagatgggtgcaaggattccctaagcagaatgttcattttgtcaacgacaac accatttgctacccttgtgggaattatgtaatatttattaatattgaaaccaagaaaaag actgtactgcagtgtagtaatggaattgtgggcgtcatggcaactaacatcccctgtgaa gttgtggctttttctgaccggaagctaaaacctctcatctacgtatacagctttccagga ttgaccagaaggaccaaattgaaaggcaacattctcctggactacactttactttcattc agttactgtggcacctacctggctagttactcctctctcccagaatttgaactggccctt tgcccaggctttcagttgtctccatccagggatgacggtctttgccaggtacctgatgca aaaagtcaaaggatcaagggaaccagagattccagcaagagtgtggttgatgtgttgggc agggcatttgtgatagggaatgaaggatga >gi568815588r:104214411_104416051|GENSCAN_predicted_peptide_2|750_aa MNPAGGTNDSRCAALRAVTLIVKVRSFTLEASETTNPPERTLNTSEHQKEQLPDTPPLRT VTLTARVYGFILEVRSAPPGPVPEGSIRIYSMRFCPFAERTRLVLKAKGIRHEVININLK NKPEWFFKKNPFGLVPVLENSQGQLIYESAITCEYLDEAYPGKKLLPDDPYEKACQKMIL ELFSKVPSLVGSFIRSQNKEDYAGLKEEFRKEFTKLEEVCRPHSKTETVDGSHEGRSHSL SPAYYRLGWVAKSVFQMSPSTQVTQAVWGPHPEGDLEPTDFTKASCRKPSAATKAPAAGG GPPLLPRAYRSRLALVLSSGFPLRCGAGCRGPTANHYRFLFRQPPAPAGPANPDLQEPSP VVQKLNLGQGVRARAQLFLEPAAAVASHRAATVSSGSCANHLETMSGDATRTLGKGSQPP GPVPEGLIRIYSMRFCPYSHRTRLVLKAKDIRHEVVNINLRNKPEWYYTKHPFGHIPVLE TSQCQLIYESVIACEYLDDAYPGRKLFPYDPYERARQKMLLELFCKVPHLTKECLVALRC GRECTNLKAALRQEFSNLEENDFHEEKLSSSSSKYSSPHWEPPHCFEIGGLGFLGNGVED GEDFFILRLNVLGEPRPTRLALMATRGSPARFLLPLVFQGALQNADVLLRPSGGARGALA EDEILEYQNTTFFGGTCISMIDYLLWPWFERLDVYGILDCVSHTPALRLWISAMKWDPTV CALLMDKSIFQGFLNLYFQNNPNAFDFGLC >gi568815588r:104214411_104416051|GENSCAN_predicted_CDS_2|2253_bp atgaacccagcgggaggaacaaacgactccagatgtgcggccttaagagctgtaacactc atcgtgaaggtccgcagcttcactcttgaagccagcgagacaacgaacccaccagaaaga actctgaacacatcggaacatcagaaggaacaacttccagacacgccgcctttaagaact gtaacactcaccgcaagggtctatggcttcattcttgaagtcagaagcgcgcccccgggg ccggtcccggagggctcgatccgcatctacagcatgaggttctgcccgtttgctgagagg acgcgtctagtcctgaaggccaagggaatcaggcatgaagtcatcaatatcaacctgaaa aataagcctgagtggttctttaagaaaaatccctttggtctggtgccagttctggaaaac agtcagggtcagctgatctacgagtctgccatcacctgtgagtacctggatgaagcatac ccagggaagaagctgttgccggatgacccctatgagaaagcttgccagaagatgatctta gagttgttttctaaggtgccatccttggtaggaagctttattagaagccaaaataaagaa gactatgctggcctaaaagaagaatttcgtaaagaatttaccaagctagaggaggtgtgt agaccacactccaaaactgaaactgtggatggcagccatgaaggaagatcccacagtctc agccctgcttactacaggctggggtgggtagcgaagtctgtatttcaaatgagcccctca acccaggtgacgcaggcggtgtggggaccgcatccggagggcgacctggagccgactgac ttcacaaaggcctcctgccgcaaaccttcagcggccaccaaagccccggctgccggcggc ggaccacctctgctgccgcgcgcctaccggagccgcttggccctagtgctttccagcgga tttcccctcaggtgcggagccgggtgccggggtcccacagccaaccactaccggttcctc tttcgtcagccaccggcgccggcaggacccgcgaatcccgatctccaggagcctagccca gtagttcaaaaattaaatttggggcaaggggtgcgcgccagagcgcagctgtttctggag cctgcggcagcggtggcgagccacagggcggcgaccgtgagctccgggagctgcgcaaac cacctggagaccatgtctggggatgcgaccaggaccctggggaaaggaagccagccccca gggccagtcccggaggggctgatccgcatctacagcatgaggttctgcccctattctcac aggacccgcctcgtcctcaaggccaaagacatcagacatgaagtggtcaacattaacctg agaaacaagcctgaatggtactatacaaagcacccttttggccacattcctgtcctggag accagccaatgtcaactgatctatgaatctgttattgcttgtgagtacctggatgatgct tatccaggaaggaagctgtttccatatgacccttatgaacgagctcgccaaaagatgtta ttggagctattttgtaaggtcccacatttgaccaaggagtgcctggtagcgttgagatgt gggagagaatgcactaatctgaaggcagccctgcgtcaggaattcagcaacctggaagag aacgattttcatgaggaaaaactttcttcctcctccagcaagtacagcagcccccactgg gagccaccacactgtttcgagattgggggcttgggatttcttggcaatggtgtagaggat ggtgaagatttcttcatcctccggttgaacgttcttggggagccgcggccgacgcgcctc gcactgatggccaccagggggagccccgcgcgcttcctccttccccttgtgttccagggc gcacttcaaaacgctgacgttctccttcgtccctctgggggcgcccgaggggccctcgca gaagatgagattcttgagtatcagaacaccaccttctttggtggaacctgtatatccatg attgattacctcctctggccctggtttgagcggctggatgtgtatgggatactggactgt gtgagccacacgccagccctgcggctctggatatcagccatgaagtgggaccccacagtc tgtgctcttctcatggataagagcattttccagggcttcttgaatctctattttcagaac aaccctaatgcctttgactttgggctgtgctga >gi568815588r:104214411_104416051|GENSCAN_predicted_peptide_3|685_aa MVLKDDGAWAGFMGVKTGVRVMGGRSLMNGLVQSAGPQAPPQPPASSGASALDLRAKPVL EPSSRGGDRDRGPKPIVISLKLMQESPQLSPPPPVSKQQGRVLQTEPACAEFNREPNDSW DPIRGSGEMDYVRKKAPTMAMGLFRVCLVVVTAIINHPLLFPRENATVPENEEEIIRKMQ AHQEKLQLEQLRLEEEVARLAAEKEALEQVAEEGRQQNETRVAWDLWSTLCMILFLMIEV WRQDHQEGPSPECLGGEEDELPGLGGAPLQGLTLPNKATLGHFYERCIRGATADAARTRE FLEGFVDDLLEALRSLCNRDTDMEVEDFIGVDSMYENWQVDRPLLCHLFVPFTPPEPYRF HPELWCSGRSVPLDRQGYGQIKVVRADGDTLSCICGKTKLGEDMLCLLHGRNSMAPPCGD MENLLCATDSLYLDTMQVMKWFQTALTRAWKGIAHKYEFDLAFGQLDSPGSLKIKFRSGK FMPFNLIPVIQCDDSDLYFVSHLPREPSEGTPASSTDWLLSFAVYERHFLRTTLKALPEG ACHLSCLQIASFLLSKQSRLTGPSGLSSYHLKTALLHLLLLRQAADWKAGQLDARLHELL CFLEKSLLQKKLHHFFIGNRKVPEAMGLPEAVLRAEPLNLFRPFVLQRSLYRKTLDSFYE MLKNAPALISEYSLHVPSDQPTPKS >gi568815588r:104214411_104416051|GENSCAN_predicted_CDS_3|2058_bp atggtgctgaaagatgatggggcttgggctggcttcatgggagtgaaaactggtgttcgg gtcatggggggtagatccctcatgaatggcttggtgcagagtgccgggccgcaagctcca ccgcagccgcctgcaagcagcggcgcctcggccctcgacctgcgcgcaaagcctgtgctg gagccgtcctcccgcggcggggaccgggaccggggacccaagccaatcgtcatttccctg aagctgatgcaggagtccccccaactctcgccccctcctcctgtatcaaagcagcaggga agggtgctgcagacagagccagcctgtgcagagtttaacagggagccaaatgacagctgg gatccgatccggggaagcggggagatggattacgtgagaaagaaagctccaaccatggcc atggggctcttccgcgtgtgtctggtggtggtgacggccatcatcaaccacccgctgctg ttcccgcgggagaacgccacagtccccgagaacgaggaggagatcatccgcaagatgcag gcgcaccaggagaagctgcagctggagcagttgcgcctggaggaggaggtggctcggctg gcggccgaaaaggaggcactggagcaggtggcggaggagggcaggcagcagaacgagaca cgcgtggcctgggacctctggagcaccctctgcatgatcctcttcctgatgatcgaggtg tggcggcaggaccaccaggaggggccctcacctgagtgcctgggcggtgaggaggatgag ctgcctgggctggggggcgcccccttgcagggcctcaccctgcccaacaaggccacgctt ggccacttttatgagcgctgcatccggggggccacggccgatgcagcccgtacccgggag ttcctggaaggcttcgtggatgacttgctggaagccctgaggagcctctgcaaccgggac accgacatggaggtggaggacttcattggcgtggacagcatgtacgagaactggcaggtg gacaggccactgctgtgccaccttttcgtgcccttcacaccccccgagccctaccgcttc cacccagagctctggtgctccggccgctcagtgcccctggatcgccagggctacggccag atcaaggtggtccgcgccgatggggacacattgagctgcatctgcggcaagaccaagctc ggggaagacatgctgtgtctcctgcacggcaggaacagcatggcgcctccctgcggcgac atggagaacctgctgtgtgccacagattccctgtacctggacacgatgcaggtcatgaag tggttccagacggccctcaccagagcctggaagggcatcgcccacaagtacgagttcgac ctggcctttggccagctggacagcccggggtccctgaagatcaagttccgttcagggaag ttcatgcccttcaacctgattcctgtgatccagtgtgatgactcggacctgtactttgtc tcccaccttcccagggagccctctgagggcaccccagcctccagcacagactggctcctg tcctttgctgtctatgagcgacacttcctcaggacgacactaaaggcactgcccgagggc gcctgccacctcagctgcctgcagatagcatccttcctgctctccaagcagagccgcctg accggtcccagcgggctcagcagctaccacctgaagacggccctactgcacctcctactc ctccggcaggccgccgactggaaggcggggcagctggacgctcgtctgcacgagttgctg tgcttcctggagaagagcttgctccagaagaagctccaccacttcttcatcggcaaccgc aaggtgcctgaggccatgggactccctgaggccgtgctcagggccgagcccctcaacctc ttccggcccttcgtcctgcagcgaagcctttaccgtaagacactggactccttctatgag atgctcaagaatgccccagcgctcattagcgagtattccctacatgtcccctcagaccag cctaccccaaaaagctga >gi568815588r:104214411_104416051|GENSCAN_predicted_peptide_4|383_aa MMESLLGDKCIFQEKGGKQVLEESAFEEMERDFQGVLHELSGDKSLEKFRIEYERLHAVM KKSYDNEKRLMAKCRELNAEIVVNSAKVATALKLSQDDQTTIASLKKEIEKAWKMVDSAY DKEQKAKETILALKEEIVNLTKLVEQGSGLSMDQHSNIRDLLRFKEEVTKERDQLLSEVV KLRESLAQTTEQQQETERSKEEAEHAISQFQQEIQQRQNEASREFRKKEKLEKELKQIQA DMDSRQTEIKALQQYVQKSKEELQKLEQQLKEQKILNERAAKELEQFQMRNAKLQQENEQ HSLVCEQLSQENQQKALELKAKEEEVHQMRLDIGKLNKIREQIHKKLHHTEDQKAEVEQH KETLKNQIVGLERGSIGKNDNPT >gi568815588r:104214411_104416051|GENSCAN_predicted_CDS_4|1152_bp atgatggaatctcttttgggagataagtgcatatttcaggaaaagggtggaaagcaagtc ctggaagaatctgcatttgaagaaatggaaagagattttcagggagttctccatgaactt tctggagacaaaagtttggaaaaatttcggattgaatatgagaggcttcatgctgtcatg aaaaagtcttatgacaatgaaaagcgtctgatggccaaatgcagagagctaaatgcagag attgtagtgaattctgcgaaggtcgccactgcccttaagctctctcaggatgatcagacc accattgcatccctaaagaaggaaattgaaaaggcctggaagatggtggactcagcctat gacaaagagcagaaggccaaggagacgattcttgctctgaaagaggaaatagtgaacctg accaaactagtggagcaggggtctggactgtcaatggaccagcatagcaacatccgagat ttactgaggttcaaagaagaagtgacaaaggagagagaccagctcttatcagaagtggta aaattacgagaatccctagctcagaccactgaacagcagcaggaaacagagcgatcaaaa gaggaggctgaacatgccatcagtcagttccaacaagaaatccagcaacgtcagaacgaa gcttcccgggagttccggaagaaggaaaaactagagaaagagctcaagcagattcaggca gacatggacagcaggcagacagaaataaaagccctgcagcagtatgtgcagaagagcaag gaggagcttcagaagctggagcagcagctgaaggagcagaagatattgaatgagagagct gcaaaggaactcgagcaatttcagatgagaaatgctaaacttcagcaagagaatgaacag cacagtttggtctgtgagcagctatcccaggaaaaccaacagaaggcgttggagctcaaa gccaaagaggaagaagtccatcaaatgcgccttgacatcgggaagctcaacaaaatcaga gaacaaattcataagaaattgcaccacaccgaagatcaaaaggcagaagtcgaacagcac aaagaaaccctaaaaaatcagattgtgggattagagagaggttccattggcaaaaatgac aatccaacctga >gi568815588r:104214411_104416051|GENSCAN_predicted_peptide_5|470_aa MDELLRERDILNKNMLKAVNATQKQTDLVKLHEQAKRNLEGEIQNYKDEAQKQRKIIFHL EKERDRYINQASDLTQKVLMNMEDIKVRETQIFDYRKKIAESEIKLKQQQNLYEAVRSDR NLYSKNLVEAQDEITDMKRKLKIMIHQVDELKEDISAKESALVKLHLEQQRIEKEKETLK AELQKLRQQALETKHFIEKQEAEERKLLRIIAEADGERLRQKKELDQVISERDILGSQLV RRNDELALLYEKIKIQQSVLNKGESQYNQRLEDMRILRLEIKKLRREKGILARSMANVEE LRKYGFVNCPLFESIALAKPFLLESGDTAKVPLASDPNAYELIQKIHTLQKRLISKTEEV VEKELLLQLDLLFAFDVSEFQHNAFGHLWDEAGLYALSNCYPLPGYHYIIIIIIIIIIII IIKMAFLIMGIRHDSIGIIIIIKMAFLIMGIRHDSIGVGSSNLLLEAFKT >gi568815588r:104214411_104416051|GENSCAN_predicted_CDS_5|1413_bp atggacgagcttctaagagaaagggacatactaaataagaacatgcttaaggcggtcaat gcgacccagaagcagacagacttggtaaagctccatgaacaagccaagaggaacctggag ggagaaatccagaactacaaggatgaggctcagaagcagagaaagatcatctttcatctg gaaaaggagcgtgaccggtacatcaaccaagccagtgaccttacgcaaaaggtccttatg aacatggaagacataaaagttcgtgaaacacagatttttgactacaggaaaaaaatagct gaatcagagattaaattaaaacagcaacagaacctatatgaagctgtgagatcagacaga aatctgtatagcaaaaatctggttgaggctcaggatgaaataacagatatgaagagaaag ttaaagattatgatccatcaggtagatgagctgaaagaagacatctctgccaaagagtcc gcacttgtgaagctgcacctggaacagcagcgaatagaaaaggaaaaggaaacattgaag gctgagctgcagaagctgagacaacaagccctggagacaaaacactttattgaaaagcaa gaagctgaagagagaaaactcctgcgaataattgctgaggctgacggggagaggttgaga cagaagaaggaattagaccaggtcatcagtgagagagatatcctggggtctcagcttgtt cggcgcaatgatgagttagctttgctctatgagaagatcaagatccaacagtctgtgctg aataaaggggagagccagtacaaccagaggttggaggacatgagaatcctcagacttgag atcaagaagcttcgccgggaaaaggggattcttgccaggagtatggctaatgttgaagaa ctcagaaaatacggttttgtgaactgccctctttttgaaagcattgcgttagccaagccc tttctcttagaatctggggacacagctaaagtcccattggccagcgaccccaatgcatat gagctgatacagaaaattcacaccctgcagaagcgtctcatcagcaagactgaagaggtg gttgaaaaagagctgctcctccagttagatcttctctttgcctttgatgtttcagagttt cagcacaacgcatttgggcatctgtgggatgaggcaggtctatatgccttgagtaactgc tatccattacctgggtatcattacatcatcatcatcatcatcatcatcatcatcatcatc atcatcaagatggcatttctaataatgggcataaggcatgatagcattggcatcatcatc atcatcaagatggcatttctaataatgggcataaggcatgatagcattggagtaggaagc tctaacctcttgctagaggcatttaaaacatga