GENSCAN 1.0 Date run: 6-Nov-116 Time: 12:37:58 Sequence gi568815588r:71997337_72313298 : 315962 bp : 46.87% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 3592 3718 127 0 1 73 94 60 0.219 5.42 1.02 Intr + 8457 8646 190 1 1 17 100 116 0.193 4.34 1.03 Term + 9836 11135 1300 2 1 143 50 2911 0.993 283.87 1.04 PlyA + 14716 14721 6 1.05 2.00 Prom + 30504 30543 40 -3.06 2.01 Init + 47140 47434 295 0 1 95 82 108 0.678 8.49 2.02 Term + 58657 58883 227 2 2 89 52 97 0.639 3.14 2.03 PlyA + 60509 60514 6 1.05 3.11 PlyA - 61719 61714 6 1.05 3.10 Term - 65569 65424 146 2 2 105 42 269 0.238 22.07 3.09 Intr - 65826 65689 138 1 0 106 64 255 0.999 25.34 3.08 Intr - 66904 66842 63 2 0 94 94 134 0.801 13.29 3.07 Intr - 69784 69566 219 2 0 65 70 363 0.998 30.47 3.06 Intr - 70396 70277 120 2 0 107 58 131 0.999 12.47 3.05 Intr - 70965 70851 115 0 1 86 94 144 0.999 14.82 3.04 Intr - 73090 72976 115 0 1 89 65 77 0.865 5.95 3.03 Intr - 74922 74808 115 1 1 112 94 217 0.999 24.21 3.02 Intr - 75212 75167 46 2 1 140 87 49 0.918 7.98 3.01 Init - 90992 90804 189 2 0 85 86 589 0.968 55.41 3.00 Prom - 95243 95204 40 -6.46 4.12 PlyA - 95949 95944 6 1.05 4.11 Term - 100114 99998 117 1 0 94 51 126 0.946 8.04 4.10 Intr - 130831 130746 86 2 2 40 86 45 0.010 -0.96 4.09 Intr - 135845 135721 125 1 2 73 115 47 0.098 6.13 4.08 Intr - 155652 155533 120 2 0 86 76 102 0.656 8.41 4.07 Intr - 164338 164202 137 1 2 72 78 99 0.643 6.67 4.06 Intr - 199653 199475 179 1 2 106 115 171 0.999 21.24 4.05 Intr - 206188 206091 98 0 2 90 77 8 0.421 -0.45 4.04 Intr - 213495 213396 100 1 1 55 78 123 0.555 7.17 4.03 Intr - 215995 215851 145 1 1 34 84 152 0.043 9.26 4.02 Intr - 226688 226512 177 2 0 32 44 117 0.006 1.82 4.01 Init - 229295 229254 42 2 0 62 66 38 0.008 -2.58 4.00 Prom - 229783 229744 40 -4.76 5.00 Prom + 229834 229873 40 -5.16 5.01 Init + 230777 230825 49 1 1 67 70 -3 0.180 -3.09 5.02 Intr + 233030 233104 75 0 0 101 103 33 0.972 5.59 5.03 Term + 235665 235780 116 1 2 96 43 175 0.966 12.53 5.04 PlyA + 236388 236393 6 1.05 6.00 Prom + 240057 240096 40 -2.96 6.01 Init + 246404 246583 180 1 0 84 32 75 0.037 0.68 6.02 Intr + 263471 263605 135 1 0 157 47 13 0.060 4.86 6.03 Intr + 276821 277085 265 1 1 112 105 298 0.994 30.99 6.04 Term + 277359 277852 494 1 2 112 43 799 0.999 72.57 6.05 PlyA + 278679 278684 6 1.05 7.00 Prom + 281500 281539 40 -4.76 7.01 Init + 283347 283424 78 2 0 96 73 107 0.563 11.07 7.02 Intr + 288398 288452 55 1 1 48 103 31 0.353 -0.85 7.03 Intr + 294555 294623 69 1 0 101 94 60 0.707 7.05 7.04 Intr + 297051 297088 38 1 2 118 53 3 0.421 -2.22 7.05 Term + 298564 298854 291 0 0 115 47 156 0.892 9.54 7.06 PlyA + 299662 299667 6 -0.45 8.02 PlyA - 299757 299752 6 1.05 8.01 Sngl - 303535 303020 516 1 0 88 43 235 0.926 13.11 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 53477 53627 151 1 1 99 82 41 0.908 4.80 S.002 Init - 215962 215851 112 1 1 56 84 115 0.902 8.27 S.003 Sngl + 218306 218551 246 1 0 53 48 197 0.829 7.28 S.004 Term + 263471 263538 68 0 2 157 49 29 0.811 4.00 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:71997337_72313298|GENSCAN_predicted_peptide_1|538_aa MEDEKPTPATLGPQEIVDAVGSIIRWRGAGQKPDETAGRWQAAECPRLARGAPTAPPFPM EKGLTLPQDCRDFVHSLKMRSKYALFLVFVVIVFVFIEKENKIISRVSDKLKQIPQALAD ANSTDPALILAENASLLSLSELDSAFSQLQSRLRNLSLQLGVEPAMEAAGEEEEEQRKEE EPPRPAVAGPRRHVLLMATTRTGSSFVGEFFNQQGNIFYLFEPLWHIERTVSFEPGGANA AGSALVYRDVLKQLFLCDLYVLEHFITPLPEDHLTQFMFRRGSSRSLCEDPVCTPFVKKV FEKYHCKNRRCGPLNVTLAAEACRRKEHMALKAVRIRQLEFLQPLAEDPRLDLRVIQLVR DPRAVLASRMVAFAGKYKTWKKWLDDEGQDGLREEEVQRLRGNCESIRLSAELGLRQPAW LRGRYMLVRYEDVARGPLQKAREMYRFAGIPLTPQVEDWIQKNTQAAHDGSGIYSTQKNS SEQFEKWRFSMPFKLAQVVQAACGPAMRLFGYKLARDAAALTNRSVSLLEERGTFWVT >gi568815588r:71997337_72313298|GENSCAN_predicted_CDS_1|1617_bp atggaagatgagaagcccacaccagccacactaggcccccaggagattgtggacgccgtc gggagcattattcggtggaggggtgcggggcagaagccagatgagacagctgggaggtgg caggcagctgagtgtccaaggctggcccgaggagcccccacggccccacctttccccatg gagaaaggactcactttgccccaggactgccgggactttgtgcacagcctgaagatgaga agcaaatacgcccttttcttggtttttgtggtgatagtttttgtcttcatcgaaaaggaa aataaaatcatatcaagggtctcagacaagctgaagcagattccccaagctctagcagat gccaacagcaccgacccagccctgatcttagctgagaacgcatctctcttgtccctgagc gagctcgattcagccttctcccagcttcagagccgtctccgcaacctcagcttgcagctg ggcgtggagccagccatggaggccgcaggggaggaagaggaagagcagagaaaggaggag gagccgcccagaccggccgtggcggggccccggcgccacgtgctgctcatggccaccacg cgcaccggctcctcgttcgtgggcgagttcttcaaccagcagggcaacatcttctacctc ttcgagccgctgtggcacatcgagcgcacagtgtccttcgagccggggggcgccaacgcc gcgggctcggccctggtgtaccgcgacgtgctcaagcagctcttcctgtgcgacctgtac gtgctggagcacttcatcacgccgctgcccgaggaccacctgactcagttcatgttccgc cggggctccagccgctccctgtgcgaggaccccgtctgtacgcccttcgtcaagaaggtc ttcgagaagtaccactgcaagaaccgccgctgcggccccctcaacgtgacgctggccgca gaggcctgccgccgcaaggagcacatggccctcaaggcggtgcgcatccggcagctggag ttcctgcagccgctggccgaggacccccgcctggacctgcgcgtcatccagctggtgcgc gacccccgggccgtgctggcctcgcgcatggtggccttcgccggcaagtataagacctgg aagaagtggctggacgacgagggccaggacggcctgagggaagaggaggtgcagcggctg cggggcaactgcgagagcatccgcctgtccgcggagctggggctgcggcagcccgcctgg ctgcggggccgctacatgctggtgcgctacgaggacgtggcacgcgggccgctgcagaag gcccgcgagatgtaccgcttcgccggcatccccctgaccccgcaggtggaagactggatc caaaagaacacgcaggcggcccacgacggcagcggcatctactccacgcagaagaactcc tcggagcagttcgagaagtggcgcttcagcatgcccttcaagctggcccaggtggtgcag gccgcctgcggccctgccatgcgcctcttcggctacaaactggcgcgggacgccgccgcc ctcaccaaccgctcagtcagcctgctggaggagaggggcaccttctgggtcacgtag >gi568815588r:71997337_72313298|GENSCAN_predicted_peptide_2|173_aa MPGRPGAAPAGHFPGRTVLASSLQSSSRPPLAIKAPFSGGCLCWALPTSTGSLQTLLAGT RTQSLPAAEGPPLWTHEPAGWCTLALGSQLLPPPAHPGDGETEAHGEVLTQVQKQVLEES PCLTFFCTSPVPTLPSTFNHDLLMKTQDFPPWALSQCGRHARSGYSPGQLNLQ >gi568815588r:71997337_72313298|GENSCAN_predicted_CDS_2|522_bp atgccaggacggcctggagcagcacctgctggccacttccctggccgcacggtgctggca tcctccctgcagtccagcagccgtcctccactggccataaaggcccctttctctggtggc tgtttgtgttgggccctacccacctccacgggcagcctccagacactgctggcaggcacg aggacacagtccctgcccgctgctgaagggccgcctttgtggacacatgaaccagccggc tggtgcaccctggcccttggaagccagctgcttccccctcctgctcatccaggagatggg gaaactgaggcccatggagaggtgctgacccaagttcagaagcaggtcctggaagaatct ccatgtcttaccttcttctgtacttccccagtacccacgctcccatccacattcaaccat gacttactgatgaaaacacaggattttccaccttgggcattatctcagtgcgggaggcat gccaggagtggctacagccctgggcagctgaacttgcagtga >gi568815588r:71997337_72313298|GENSCAN_predicted_peptide_3|421_aa MRAPGCGRLVLPLLLLAAAALAEGDAKGLKEGETPGNFMEDEQWLSSISQYSGKIKHWNR FRDDDYIKSWEDNQQGDEALDTTKDPCQKVKCSRHKVCIAQGYQRAMCISRKKLEHRIKQ PTVKLHGNKDSICKPCHMAQLASVCGSDGHTYSSVCKLEQQACLSSKQLAVRCEGPCPCP TEQAATSTADGKPETCTGQDLADLGDRLRDWFQLLHENSKQNGSASSVAGPASGLDKSLG ASCKDSIGWMFSKLDTSADLFLDQTELAAINLDKYEVCIRPFFNSCDTYKDGRVSTAEWC FCFWREKPPCLAELERIQIQEAAKKKPGIFIPSCDEDGYYRKMQCDQSSGDCWCVDQLGL ELTGTRTHGSPDCDDIVGFSGDFGSGVGWEDEEEKETEEAGEEAEEEEGEAGEADDGGYI W >gi568815588r:71997337_72313298|GENSCAN_predicted_CDS_3|1266_bp atgcgcgccccgggctgcgggcggctggtgctgccgctgctgctcctggccgcggcagcc ctggccgaaggcgacgccaaggggctcaaggagggcgagacccccggcaatttcatggag gacgagcaatggctgtcgtccatctcgcagtacagcggcaagatcaagcactggaaccgc ttccgagacgatgactatatcaagagctgggaggacaatcagcaaggagatgaagccctg gataccaccaaggacccctgccagaaggtgaagtgcagccgccacaaggtgtgcattgcc cagggctaccagcgggccatgtgcatcagtcgcaagaagctggagcacaggatcaagcag ccgaccgtgaaactccatggaaacaaagactccatctgcaagccctgccacatggcccag cttgcctctgtctgcggctcagatggccacacttacagctctgtgtgtaagctggagcaa caggcgtgcctgagcagcaagcagctggcggtgcgatgcgagggcccctgcccctgcccc acggagcaggctgccacctccaccgccgatggcaaaccagagacttgcaccggtcaggac ctggctgacctgggagatcggctgcgggactggttccagctccttcatgagaactccaag cagaatggctcagccagcagtgtagccggcccggccagcgggctggacaagagcctgggg gccagctgcaaggactccattggctggatgttctccaagctggacaccagtgctgacctc ttcctggaccagacggagctggccgccatcaacctggacaagtacgaggtctgcatccgt cccttcttcaactcctgtgacacctacaaggatggccgggtctctactgctgagtggtgc ttctgcttctggagggagaagcccccctgcctggcagagctggagcgcatccagatccag gaggccgccaagaagaagccaggcatcttcatcccgagctgcgacgaggatggctactac cggaagatgcagtgtgaccagagcagcggtgactgctggtgtgtggaccagctgggcctg gagctgactggcacgcgcacgcatgggagccccgactgcgatgacatcgtgggcttctcg ggggactttggaagcggtgtcggctgggaggatgaggaggagaaggagacggaggaagca ggcgaggaggccgaggaggaggagggcgaggcaggcgaggctgacgacgggggctacatc tggtag >gi568815588r:71997337_72313298|GENSCAN_predicted_peptide_4|441_aa MRSQLTAASTSQAQVKRAFRGGARSETLKPDPVTELPLTPPAEEDDDEAAISSLAEFLLH YRERKTAIGKFMEGIIGISFGESVMEVLRPQLIRIDGRNYRKNPVQEQTYQHEEDEEDFY QGSMECADEPCDAYEVEQTPQGFRSTLRAPSLLYKHIVGKRGDTRKKIEMETKTSISIPK PGQDGEIVITGQHRNGVISARTRIDVLLDTFRRKQPFTHFLAFFLNEVEVQEGFLRFQEE VLAKCSMDHGVDSSIFQNPKKLHLTIGMLVLLSEEEIQQTCEMLQQCKEEFINDISGGKP LEVEMAGIEYMNDDPGMVDVLYAKVHMKDGSNRLQELVDRVLERFQASGLIVKEWNSVKL HATVMNTLFRKDPNAEGRYNLYTAEGKYIFKERESFDGRNILKLFENFYFGSLKLNSIHI SQRFTVDSFGNYASCGQIDFS >gi568815588r:71997337_72313298|GENSCAN_predicted_CDS_4|1326_bp atgcgatcacagctcactgcagcctccacctcccaggctcaggtgaaaagggctttccgt ggtggggcaaggtctgagacactgaaaccagatccagtgacagaacttccactgacccca ccagctgaggaggatgatgatgaagcagccatttcctctctagcagagttcttacttcac tacagagaaagaaaaacagcaattggcaagtttatggaaggcataattggaatatcattt ggagaaagtgtcatggaagttctgcgtccacagcttataagaattgatggccggaattac aggaagaatccagtccaagaacagacctatcaacatgaagaagatgaagaggacttctat caaggctccatggagtgtgctgatgagccctgtgatgcctacgaggtggagcagacccca caaggattccggtctactttgagggcccccagcttgctctataagcatatagttggaaag agaggggacactaggaagaaaatagaaatggagaccaaaacttctattagcattcctaaa cctggacaagacggggaaattgtaatcactggccagcatcgaaatggtgtaatttcagcc cgaacacggattgatgttcttttggacacttttcgaagaaagcagcccttcactcacttc cttgcctttttcctcaatgaagttgaggttcaggaaggattcctgagattccaggaggaa gtactggcgaagtgctccatggatcatggggttgacagcagcattttccagaatcctaaa aagcttcatctaactattgggatgttggtgcttttgagtgaggaagagatccagcagaca tgtgagatgctacagcagtgtaaagaggaattcattaatgatatttctgggggtaaaccc ctagaagtggagatggcagggatagaatacatgaatgatgatcctggcatggtggatgtt ctttacgccaaagtccatatgaaagatggctccaacaggctacaagaattagttgatcga gtgctggaacgttttcaggcatctggactaatagtgaaagagtggaatagtgtgaaactg catgctacagttatgaatacactattcaggaaagaccccaatgctgaaggcaggtacaat ctctacacagcggaaggcaaatatatcttcaaggaaagagaatcatttgatggccgaaat attttaaagttgtttgagaacttctactttggctccctaaagctgaattcaattcacatc tctcagaggttcaccgtagacagctttggaaactacgcttcctgtggacaaattgacttc tcctga >gi568815588r:71997337_72313298|GENSCAN_predicted_peptide_5|79_aa MDAVFNLKRFNLMLHPYGSERFLCESVFSYQVASTLKQVKHDQQVARMEKLAGLVEELEA DEWRFKPIEQLLGFTPSSG >gi568815588r:71997337_72313298|GENSCAN_predicted_CDS_5|240_bp atggatgctgtttttaatctcaaaaggttcaacttgatgttacatccttatggctctgag agattcctctgcgaatctgtttttagctatcaagtggcatccacgcttaaacaggtgaaa catgatcagcaagttgctcggatggaaaaactagctggtttggtagaagagctggaggct gacgagtggcggtttaagcccatcgagcagctgctgggattcaccccctcttcaggttga >gi568815588r:71997337_72313298|GENSCAN_predicted_peptide_6|357_aa MEAKRTWEWECRGDDKEPFLEKASLLLNMMMRDSAPHTKGKMPKKRSTHREQHVQSQELQ GSHQKGANPAPCPHPAAGAPNAELSISVAWLPSSRAVGGASFLPRRLLRSGTLSSSANAL ASVLTMPSLWDRFSSSSTSSSPSSLPRTPTPDRPPRSAWGSATREEGFDRSTSLESSDCE SLDSSNSGFGPEEDTAYLDGVSLPDFELLSDPEDEHLCANLMQLLQESLAQARLGSRRPA RLLMPSQLVSQVGKELLRLAYSEPCGLRGALLDVCVEQGKSCHSVGQLALDPSLVPTFQL TLVLRLDSRLWPKIQGLFSSANSPFLPGFSQSLTLSTGFRVIKKKLYSSEQLLIEEC >gi568815588r:71997337_72313298|GENSCAN_predicted_CDS_6|1074_bp atggaagccaagaggacttgggagtgggaatgccgaggagatgacaaggagcctttccta gagaaggcgagcctgctcttgaacatgatgatgagagacagcgcgccacacacaaaggga aagatgccaaagaagcgttccacgcacagggaacagcatgtgcaaagtcaggaactccag ggctctcaccagaaaggcgccaaccctgcgccctgcccccacccagccgccggggcccca aatgctgagctcagcatctctgtggcctggctgccctcctcccgggcagtgggaggagcc tccttcctcccgaggcggcttctacgctccggcactctgagttcatcagcaaacgccctg gcgtctgtcctcaccatgcctagcctttgggaccgcttctcgtcgtcgtccacctcctct tcgccctcgtccttgccccgaactcccaccccagatcggccgccgcgctcagcctggggg tcggcgacccgggaggaggggtttgaccgctccacgagcctggagagctcggactgcgag tccctggacagcagcaacagtggcttcgggccggaggaagacacggcttacctggatggg gtgtcgttgcccgacttcgagctgctcagtgaccctgaggatgaacacttgtgtgccaac ctgatgcagctgctgcaggagagcctggcccaggcgcggctgggctctcgacgccctgcg cgcctgctgatgcctagccagttggtaagccaggtgggcaaagaactactgcgcctggcc tacagcgagccgtgcggcctgcggggggcgctgctggacgtctgcgtggagcagggcaag agctgccacagcgtgggccagctggcactcgaccccagcctggtgcccaccttccagctg accctcgtgctgcgcctggactcacgactctggcccaagatccaggggctgtttagctcc gccaactctcccttcctccctggcttcagccagtccctgacgctgagcactggcttccga gtcatcaagaagaagctgtacagctcggaacagctgctcattgaggagtgttga >gi568815588r:71997337_72313298|GENSCAN_predicted_peptide_7|176_aa MGSFMETSGAADAACGYVELAALNVESHSKYLNRQVLLHPLLQGDLVNQQVGEGDLLLPE AVGAGARGHQGTDSRLSSYQPIGAGEAAGVGPVNRFANQHRVIPALPLVSVWSHDPNCPV TVKPGTCVTIPKDRASFYSTGTWDDVPLQLQQVLMEPKHFPRQQRQSTNKYKDSGK >gi568815588r:71997337_72313298|GENSCAN_predicted_CDS_7|531_bp atgggcagcttcatggaaacttctggggcagcagatgcagcctgtggctatgtggagctg gcggctctgaatgtagagtcccactccaaatacctcaacaggcaagtcctgctgcaccca ctcctacaaggtgacttggttaatcagcaggttggggagggggacctgctgctcccagag gccgtaggtgctggagcacgaggacaccagggaactgactcaaggctctcctcttaccag cccattggagcaggtgaggctgcaggtgtggggcctgtgaacagatttgccaatcaacac agggttattcctgccttgcccctggtatctgtgtggtcacatgaccccaactgtccagtt acagtgaaacccgggacatgtgtcacaattcccaaggacagagcctctttctactcaact ggaacttgggatgatgtgcccctgcagctgcagcaggtgctcatggagcccaaacacttc ccaagacagcaaaggcaaagtacaaacaagtacaaagactctgggaaatga >gi568815588r:71997337_72313298|GENSCAN_predicted_peptide_8|171_aa MAGCRSRALPRGEAAKARQEIERSGPALLGGPAVLEDRVHPPQPLARVLSPPLPEAAPNA GPAEPTPTGNSRWPTSTARRPGCHRRLSLHTSLQAEGAGSGLGQPRKGLPQCSGRLKGSS AAKVGAQAEEVPRASEGCEDCQHAVTSQNDSQCRASFSQRYTCGNNPCSYM >gi568815588r:71997337_72313298|GENSCAN_predicted_CDS_8|516_bp atggcgggatgcaggtcccgagccctgccccgcggggaggcagctaaggcccggcaagaa atcgagcgcagtgggccggcactgctgggtgggccggcagtgctggaggaccgagtacac cctccgcagccgctggcccgggtgctaagccccccattgcccgaggccgctcctaatgcg gggcccgccgagcccacgcccaccgggaactcgcgctggcccacaagcaccgcgcgcaga cccggttgccaccggcgcctctccctccacacctccctgcaagctgagggagccggctct ggccttggccagcccagaaaggggctcccacagtgcagcggcaggctgaagggctcaagt gccgccaaagtgggagcccaggcagaggaggtgccgagagcgagcgagggctgtgaggac tgccagcatgctgtcacctctcagaatgactctcagtgtcgggcctcattctctcaacgt tacacctgtgggaataatccatgcagttatatgtag