GENSCAN 1.0 Date run: 3-Nov-116 Time: 01:03:35 Sequence gi568815588r:71962860_72188328 : 225469 bp : 46.88% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 574 814 241 0 1 68 9 127 0.346 0.84 1.02 Intr + 1379 1835 457 0 1 120 98 219 0.215 18.38 1.03 Intr + 12886 13040 155 1 2 72 94 62 0.076 4.92 1.04 Term + 14483 14490 8 0 2 145 42 0 0.096 -0.87 1.05 PlyA + 15507 15512 6 -0.45 2.03 PlyA - 16207 16202 6 1.05 2.02 Term - 16405 16331 75 1 0 97 49 80 0.786 2.84 2.01 Init - 22468 22454 15 1 0 99 95 22 0.786 2.60 2.00 Prom - 23002 22963 40 -2.76 3.04 PlyA - 26768 26763 6 1.05 3.03 Term - 32097 31866 232 2 1 68 48 176 0.955 7.55 3.02 Intr - 35587 35407 181 2 1 42 28 118 0.045 0.13 3.01 Init - 39105 39021 85 0 1 92 11 72 0.128 0.88 3.00 Prom - 42500 42461 40 -4.06 4.00 Prom + 42706 42745 40 -4.86 4.01 Init + 42984 43123 140 2 2 68 100 92 0.898 8.01 4.02 Term + 44313 45612 1300 0 1 143 50 2911 0.993 283.87 4.03 PlyA + 49193 49198 6 1.05 5.00 Prom + 64981 65020 40 -3.06 5.01 Init + 81617 81911 295 1 1 95 82 108 0.678 8.49 5.02 Term + 93134 93360 227 0 2 89 52 97 0.639 3.14 5.03 PlyA + 94986 94991 6 1.05 6.11 PlyA - 96196 96191 6 1.05 6.10 Term - 100046 99901 146 0 2 105 42 269 0.238 22.07 6.09 Intr - 100303 100166 138 2 0 106 64 255 0.999 25.34 6.08 Intr - 101381 101319 63 0 0 94 94 134 0.801 13.29 6.07 Intr - 104261 104043 219 0 0 65 70 363 0.998 30.47 6.06 Intr - 104873 104754 120 0 0 107 58 131 0.999 12.47 6.05 Intr - 105442 105328 115 1 1 86 94 144 0.999 14.82 6.04 Intr - 107567 107453 115 1 1 89 65 77 0.865 5.95 6.03 Intr - 109399 109285 115 2 1 112 94 217 0.999 24.21 6.02 Intr - 109689 109644 46 0 1 140 87 49 0.918 7.98 6.01 Init - 125469 125281 189 0 0 85 86 589 0.968 55.41 6.00 Prom - 129720 129681 40 -6.46 7.06 PlyA - 130426 130421 6 1.05 7.05 Term - 134591 134475 117 2 0 94 51 126 0.946 8.04 7.04 Intr - 165308 165223 86 0 2 40 86 45 0.010 -0.96 7.03 Intr - 170322 170198 125 2 2 73 115 47 0.098 6.13 7.02 Intr - 190129 190010 120 0 0 86 76 102 0.659 8.41 7.01 Intr - 198815 198679 137 2 2 72 78 99 0.647 6.67 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 87954 88104 151 2 1 99 82 41 0.908 4.80 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:71962860_72188328|GENSCAN_predicted_peptide_1|286_aa MPSKWNPRRSRLLGDRAITRIRKGTGRFGLCSSSEPFTSSLEKLAFAPRSRCCPADHAMH GRRLGPHLPGSPAGDPAAWEEARPARGPSSANFANGSQVAGRAGVGAGWGEEEEEEEEKG SRTPWTRFRNPPPLAGSGLSGEGARQDLSQARCAGRRPPGAASLPACRPAPQRLHPSGPP RRRRTARAARAPGPRNRFCRDLPGGKRSCERMLPAPDPSRGGSGPPVESRRPGCQPLQLL LQLTALSRCRPQSGQLEWRGQWPCPVQNDILSPSQGKRDYPCFPGQ >gi568815588r:71962860_72188328|GENSCAN_predicted_CDS_1|861_bp atgcccagcaagtggaatccaagacgttctcgccttctcggggacagggccatcaccagg attcggaaaggaacagggaggttcggtttgtgttccagttctgagccctttacctcatcc ctcgagaagctggcctttgcaccccgcagccggtgctgcccagccgaccacgccatgcac gggaggcgactgggaccccacttgcctggcagcccggccggagacccagctgcctgggag gaggcccggccggctcgcggcccgagcagcgccaacttcgccaacggttcccaagttgcc gggcgcgcaggggtgggagctgggtggggtgaggaagaggaggaggaggaggagaagggc tcccggacgccttggacgcggttcaggaatccgccgccgctagccggctcgggcctgagc ggggagggcgccaggcaggacctctcgcaggctcgctgcgcaggacggcgcccgcctggc gccgcttcccttccagcgtgccgaccggccccgcagcgcctccatccctccggcccgccc cggagaagacgcacagctcgggccgcgcgggcgccggggccgcggaaccgcttctgccgg gatcttccaggaggaaagcgaagttgcgagcggatgctgcccgcgccggaccccagccgc ggagggtcggggccgccggtggagtctcggcggccgggatgccagcctttgcagctgctg ctgcaactcacagctctctcccggtgccgccctcagtcaggccagctggagtggaggggc cagtggccgtgccctgtgcagaatgacatcctctcaccttcacaagggaagcgtgattat ccttgctttccagggcagtag >gi568815588r:71962860_72188328|GENSCAN_predicted_peptide_2|29_aa MVWVLMRNRGPDTLSDPPKATGWRFEARM >gi568815588r:71962860_72188328|GENSCAN_predicted_CDS_2|90_bp atggtgtgggttctgatgagaaacagaggtccagacacgctgagcgacccgcccaaggcc accggctggaggtttgaggccaggatgtga >gi568815588r:71962860_72188328|GENSCAN_predicted_peptide_3|165_aa MQEALYKKQACDTCDIRYTFVSLLSHFHGAVAEAGVQDALNHVAGQPREESAGSFRTDLD IDQDSCPRTWPKSSMGEAWRASYQSRQAGKAERDKEAAENLEAASVGWFMRFKERRHLHN IKVQGKAGSADGELAASYPKDLAKITDEGGYNNQQVFDVDQTAFY >gi568815588r:71962860_72188328|GENSCAN_predicted_CDS_3|498_bp atgcaggaggccctttacaagaagcaggcctgtgacacctgtgacatcagatacaccttt gtatcacttctctcccacttccatggagctgtggctgaagctggtgtccaggatgccctc aaccatgtagccgggcagccccgggaggagtctgctggctcattcagaaccgacctggac atagaccaagattcatgcccaaggacgtggcccaagtccagcatgggagaagcatggcga gcatcttatcagagccggcaggctgggaaagctgagagagacaaggaagctgcagaaaat ttggaagctgctagcgttggttggttcatgaggtttaaggaaagaagacatctccataac ataaaagtgcaaggtaaagcaggaagtgctgatggagaattggcagcaagttatcccaag gatctagctaagatcactgacgaaggtggctacaataatcaacaggttttcgatgtagac caaacagccttctattga >gi568815588r:71962860_72188328|GENSCAN_predicted_peptide_4|479_aa MEKGLTLPQDCRDFVHSLKMRSKYALFLVFVVIVFVFIEKENKIISRVSDKLKQIPQALA DANSTDPALILAENASLLSLSELDSAFSQLQSRLRNLSLQLGVEPAMEAAGEEEEEQRKE EEPPRPAVAGPRRHVLLMATTRTGSSFVGEFFNQQGNIFYLFEPLWHIERTVSFEPGGAN AAGSALVYRDVLKQLFLCDLYVLEHFITPLPEDHLTQFMFRRGSSRSLCEDPVCTPFVKK VFEKYHCKNRRCGPLNVTLAAEACRRKEHMALKAVRIRQLEFLQPLAEDPRLDLRVIQLV RDPRAVLASRMVAFAGKYKTWKKWLDDEGQDGLREEEVQRLRGNCESIRLSAELGLRQPA WLRGRYMLVRYEDVARGPLQKAREMYRFAGIPLTPQVEDWIQKNTQAAHDGSGIYSTQKN SSEQFEKWRFSMPFKLAQVVQAACGPAMRLFGYKLARDAAALTNRSVSLLEERGTFWVT >gi568815588r:71962860_72188328|GENSCAN_predicted_CDS_4|1440_bp atggagaaaggactcactttgccccaggactgccgggactttgtgcacagcctgaagatg agaagcaaatacgcccttttcttggtttttgtggtgatagtttttgtcttcatcgaaaag gaaaataaaatcatatcaagggtctcagacaagctgaagcagattccccaagctctagca gatgccaacagcaccgacccagccctgatcttagctgagaacgcatctctcttgtccctg agcgagctcgattcagccttctcccagcttcagagccgtctccgcaacctcagcttgcag ctgggcgtggagccagccatggaggccgcaggggaggaagaggaagagcagagaaaggag gaggagccgcccagaccggccgtggcggggccccggcgccacgtgctgctcatggccacc acgcgcaccggctcctcgttcgtgggcgagttcttcaaccagcagggcaacatcttctac ctcttcgagccgctgtggcacatcgagcgcacagtgtccttcgagccggggggcgccaac gccgcgggctcggccctggtgtaccgcgacgtgctcaagcagctcttcctgtgcgacctg tacgtgctggagcacttcatcacgccgctgcccgaggaccacctgactcagttcatgttc cgccggggctccagccgctccctgtgcgaggaccccgtctgtacgcccttcgtcaagaag gtcttcgagaagtaccactgcaagaaccgccgctgcggccccctcaacgtgacgctggcc gcagaggcctgccgccgcaaggagcacatggccctcaaggcggtgcgcatccggcagctg gagttcctgcagccgctggccgaggacccccgcctggacctgcgcgtcatccagctggtg cgcgacccccgggccgtgctggcctcgcgcatggtggccttcgccggcaagtataagacc tggaagaagtggctggacgacgagggccaggacggcctgagggaagaggaggtgcagcgg ctgcggggcaactgcgagagcatccgcctgtccgcggagctggggctgcggcagcccgcc tggctgcggggccgctacatgctggtgcgctacgaggacgtggcacgcgggccgctgcag aaggcccgcgagatgtaccgcttcgccggcatccccctgaccccgcaggtggaagactgg atccaaaagaacacgcaggcggcccacgacggcagcggcatctactccacgcagaagaac tcctcggagcagttcgagaagtggcgcttcagcatgcccttcaagctggcccaggtggtg caggccgcctgcggccctgccatgcgcctcttcggctacaaactggcgcgggacgccgcc gccctcaccaaccgctcagtcagcctgctggaggagaggggcaccttctgggtcacgtag >gi568815588r:71962860_72188328|GENSCAN_predicted_peptide_5|173_aa MPGRPGAAPAGHFPGRTVLASSLQSSSRPPLAIKAPFSGGCLCWALPTSTGSLQTLLAGT RTQSLPAAEGPPLWTHEPAGWCTLALGSQLLPPPAHPGDGETEAHGEVLTQVQKQVLEES PCLTFFCTSPVPTLPSTFNHDLLMKTQDFPPWALSQCGRHARSGYSPGQLNLQ >gi568815588r:71962860_72188328|GENSCAN_predicted_CDS_5|522_bp atgccaggacggcctggagcagcacctgctggccacttccctggccgcacggtgctggca tcctccctgcagtccagcagccgtcctccactggccataaaggcccctttctctggtggc tgtttgtgttgggccctacccacctccacgggcagcctccagacactgctggcaggcacg aggacacagtccctgcccgctgctgaagggccgcctttgtggacacatgaaccagccggc tggtgcaccctggcccttggaagccagctgcttccccctcctgctcatccaggagatggg gaaactgaggcccatggagaggtgctgacccaagttcagaagcaggtcctggaagaatct ccatgtcttaccttcttctgtacttccccagtacccacgctcccatccacattcaaccat gacttactgatgaaaacacaggattttccaccttgggcattatctcagtgcgggaggcat gccaggagtggctacagccctgggcagctgaacttgcagtga >gi568815588r:71962860_72188328|GENSCAN_predicted_peptide_6|421_aa MRAPGCGRLVLPLLLLAAAALAEGDAKGLKEGETPGNFMEDEQWLSSISQYSGKIKHWNR FRDDDYIKSWEDNQQGDEALDTTKDPCQKVKCSRHKVCIAQGYQRAMCISRKKLEHRIKQ PTVKLHGNKDSICKPCHMAQLASVCGSDGHTYSSVCKLEQQACLSSKQLAVRCEGPCPCP TEQAATSTADGKPETCTGQDLADLGDRLRDWFQLLHENSKQNGSASSVAGPASGLDKSLG ASCKDSIGWMFSKLDTSADLFLDQTELAAINLDKYEVCIRPFFNSCDTYKDGRVSTAEWC FCFWREKPPCLAELERIQIQEAAKKKPGIFIPSCDEDGYYRKMQCDQSSGDCWCVDQLGL ELTGTRTHGSPDCDDIVGFSGDFGSGVGWEDEEEKETEEAGEEAEEEEGEAGEADDGGYI W >gi568815588r:71962860_72188328|GENSCAN_predicted_CDS_6|1266_bp atgcgcgccccgggctgcgggcggctggtgctgccgctgctgctcctggccgcggcagcc ctggccgaaggcgacgccaaggggctcaaggagggcgagacccccggcaatttcatggag gacgagcaatggctgtcgtccatctcgcagtacagcggcaagatcaagcactggaaccgc ttccgagacgatgactatatcaagagctgggaggacaatcagcaaggagatgaagccctg gataccaccaaggacccctgccagaaggtgaagtgcagccgccacaaggtgtgcattgcc cagggctaccagcgggccatgtgcatcagtcgcaagaagctggagcacaggatcaagcag ccgaccgtgaaactccatggaaacaaagactccatctgcaagccctgccacatggcccag cttgcctctgtctgcggctcagatggccacacttacagctctgtgtgtaagctggagcaa caggcgtgcctgagcagcaagcagctggcggtgcgatgcgagggcccctgcccctgcccc acggagcaggctgccacctccaccgccgatggcaaaccagagacttgcaccggtcaggac ctggctgacctgggagatcggctgcgggactggttccagctccttcatgagaactccaag cagaatggctcagccagcagtgtagccggcccggccagcgggctggacaagagcctgggg gccagctgcaaggactccattggctggatgttctccaagctggacaccagtgctgacctc ttcctggaccagacggagctggccgccatcaacctggacaagtacgaggtctgcatccgt cccttcttcaactcctgtgacacctacaaggatggccgggtctctactgctgagtggtgc ttctgcttctggagggagaagcccccctgcctggcagagctggagcgcatccagatccag gaggccgccaagaagaagccaggcatcttcatcccgagctgcgacgaggatggctactac cggaagatgcagtgtgaccagagcagcggtgactgctggtgtgtggaccagctgggcctg gagctgactggcacgcgcacgcatgggagccccgactgcgatgacatcgtgggcttctcg ggggactttggaagcggtgtcggctgggaggatgaggaggagaaggagacggaggaagca ggcgaggaggccgaggaggaggagggcgaggcaggcgaggctgacgacgggggctacatc tggtag >gi568815588r:71962860_72188328|GENSCAN_predicted_peptide_7|194_aa DHGVDSSIFQNPKKLHLTIGMLVLLSEEEIQQTCEMLQQCKEEFINDISGGKPLEVEMAG IEYMNDDPGMVDVLYAKVHMKDGSNRLQELVDRVLERFQASGLIVKEWNSVKLHATVMNT LFRKDPNAEGRYNLYTAEGKYIFKERESFDGRNILKLFENFYFGSLKLNSIHISQRFTVD SFGNYASCGQIDFS >gi568815588r:71962860_72188328|GENSCAN_predicted_CDS_7|585_bp gatcatggggttgacagcagcattttccagaatcctaaaaagcttcatctaactattggg atgttggtgcttttgagtgaggaagagatccagcagacatgtgagatgctacagcagtgt aaagaggaattcattaatgatatttctgggggtaaacccctagaagtggagatggcaggg atagaatacatgaatgatgatcctggcatggtggatgttctttacgccaaagtccatatg aaagatggctccaacaggctacaagaattagttgatcgagtgctggaacgttttcaggca tctggactaatagtgaaagagtggaatagtgtgaaactgcatgctacagttatgaataca ctattcaggaaagaccccaatgctgaaggcaggtacaatctctacacagcggaaggcaaa tatatcttcaaggaaagagaatcatttgatggccgaaatattttaaagttgtttgagaac ttctactttggctccctaaagctgaattcaattcacatctctcagaggttcaccgtagac agctttggaaactacgcttcctgtggacaaattgacttctcctga