GENSCAN 1.0 Date run: 4-Nov-116 Time: 15:27:59 Sequence gi568815593r:76977177_77177878 : 200702 bp : 41.08% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 21400 21567 168 0 0 59 92 65 0.373 3.58 1.02 Intr + 24597 24632 36 2 0 89 91 76 0.662 5.54 1.03 Intr + 32663 32795 133 1 1 82 86 113 0.833 9.80 1.04 Intr + 53323 53431 109 2 1 107 89 14 0.113 1.82 1.05 Intr + 53512 53800 289 1 1 51 85 414 0.642 33.73 1.06 Intr + 57242 57344 103 1 1 98 68 47 0.766 2.53 1.07 Intr + 58365 58567 203 1 2 86 90 161 0.868 14.18 1.08 Intr + 59380 59544 165 0 0 54 84 175 0.999 12.94 1.09 Intr + 62355 62543 189 2 0 33 90 145 0.991 8.06 1.10 Intr + 69171 69501 331 2 1 111 113 321 0.996 31.18 1.11 Intr + 70985 71096 112 0 1 119 93 -3 0.998 1.82 1.12 Intr + 71760 71811 52 0 1 55 121 103 0.996 8.29 1.13 Intr + 75530 75631 102 1 0 35 109 84 0.951 4.65 1.14 Intr + 76789 76954 166 0 1 80 76 170 0.999 13.61 1.15 Intr + 78338 78420 83 0 2 91 94 79 0.999 7.24 1.16 Intr + 82440 82567 128 2 2 39 88 164 0.996 10.06 1.17 Intr + 84527 84626 100 2 1 37 111 98 0.994 6.19 1.18 Term + 85876 86076 201 0 0 74 49 206 0.908 11.71 1.19 PlyA + 86246 86251 6 1.05 2.03 PlyA - 86505 86500 6 1.05 2.02 Term - 99972 99663 310 2 1 71 49 265 0.995 14.45 2.01 Init - 100297 100002 296 1 2 52 17 466 0.818 30.61 2.00 Prom - 106422 106383 40 -5.75 3.02 PlyA - 106632 106627 6 1.05 3.01 Sngl - 107249 106914 336 2 0 63 53 156 0.490 5.38 3.00 Prom - 109137 109098 40 -5.35 4.03 PlyA - 109165 109160 6 1.05 4.02 Term - 110081 109404 678 2 0 120 34 138 0.252 4.30 4.01 Init - 118673 118494 180 2 0 67 78 174 0.960 13.53 4.00 Prom - 121023 120984 40 -4.05 5.00 Prom + 123732 123771 40 -6.15 5.01 Init + 132917 132924 8 1 2 103 100 0 0.034 3.25 5.02 Intr + 135754 135919 166 1 1 45 86 98 0.033 4.34 5.03 Intr + 140420 140548 129 1 0 67 63 60 0.023 1.37 5.04 Term + 141144 141260 117 2 0 97 51 116 0.173 6.36 5.05 PlyA + 142675 142680 6 1.05 6.00 Prom + 152896 152935 40 -6.05 6.01 Init + 157716 157767 52 2 1 105 71 100 0.874 11.37 6.02 Term + 160400 160521 122 0 2 60 48 122 0.791 3.06 6.03 PlyA + 162223 162228 6 1.05 7.05 PlyA - 162817 162812 6 1.05 7.04 Term - 167299 166735 565 0 1 105 43 150 0.224 5.08 7.03 Intr - 177394 177296 99 0 0 29 82 116 0.196 3.41 7.02 Intr - 192051 191977 75 2 0 51 94 62 0.016 0.81 7.01 Init - 198279 198203 77 0 2 35 97 53 0.103 1.51 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:76977177_77177878|GENSCAN_predicted_peptide_1|889_aa MGTHGHKDSNDRYWGLQEEGGKERGRVGNLTTVYHAQYLGDGITRTPDLSIIQYTQVVDV FLIDESSLAKVKVQCALIHIGPPSVTIGVPLSALGHHLPTCYRKTCSVKTVKAGEGSFQE SFPVCRPRTSGRGHPRSPCSLLAAPFGYNGLWVRRRPFRPERSPSAATSSLAPELMASEA PSPPRSPPPPTSPEPELAQLRRKVEKLERELRSCKRQVREIEKLLHHTERLYQNAESNNQ ELRTQVEELSKILQRGRNEDNKKSDVEVQTENHAPWSISDYFYQTYYNDVSLPNKVTELS DQQDQAIETSILNSKDHLQVENDAYPGTDRTENVKYRQVDHFASNSQEPASALATEDTSL EGSSLAESLRAAAEAAVSQTGFSYDENTGLYFDHSTGFYYDSENQLYYDPSTGIYYYCDV ESGRYQFHSRVDLQPYPTSSTKQSKDKKLKKKRKDPDSSATNEEKDLNSEDQKAFSVEHT SCNEEENFANMKKKAKIGIHHKNSPPKVTVPTSGNTIESPLHENISNSTSFKDEKIMETD SEPEEGEITDSQTEDSYDEAITSEGNVTAEDSEDEDEDKIWPPCIRVIVIRSPVLQIGSL FIITAVNPATIGREKDMEHTLRIPEVGVSKFHAEIYFDHDLQSYVLVDQGSQNGTIVNGK QILQPKTKCDPYVLEHGDEVKIGETVLSFHIHPGSDTCDGCEPGQVRAHLRLDKKDESFV GPTLSKEEKELERRKELKKIRVKYGLQNTEYEDEKTLKNPKYKDRAGKRREQVGSEGTFQ RDDAPASVHSEITDSNKGRKMLEKMGWKKGEGLGKDGGGMKTPIQLQLRRTHAGLGTGKP SSFEDVHLLQNKNKKNWDKARERFTENFPETKPQKDDPGTMPWVKGTLE >gi568815593r:76977177_77177878|GENSCAN_predicted_CDS_1|2670_bp atgggtactcatggacataaagatagcaacgatagatactggggactacaagaggaggga gggaaggagaggggaagggttggaaatctaactactgtgtaccatgctcaatacctgggt gatggaatcactcgtaccccagacctcagcattatacaatatacccaggttgttgatgtt tttctcatcgacgagagctctttggctaaggtgaaggtgcaatgtgcccttatccacatt gggccaccatcagttacaatcggtgtgcccttatccgcactgggccaccatcttcctact tgttacagaaaaacctgctctgtaaagacagttaaggctggagaaggtagtttccaggaa agttttccggtttgcaggccgcgcacatcgggcaggggccatcctcggtccccttgctcg ttgctcgcagccccgttcggctacaatggtctctgggtccgccggcgtccgtttcggcct gaacgcagcccctccgcggcgacgagcagtctcgcgccggagctcatggcctcggaggcg ccgtccccgccgcggtcgccgccgccgcccacctcccccgagcctgagctggcccagcta aggcggaaggtggagaagttggaacgtgaactgcggagctgcaagcggcaggtgcgggag atcgagaagctgctgcatcacacagaacggctgtaccagaacgcagaaagcaacaaccag gagctccgcacgcaggtggaagaactcagtaaaatactccaacgtgggagaaatgaagat aataaaaagtctgatgtagaagtacaaacagagaaccatgctccttggtcaatctcagat tatttttatcagacgtactacaatgacgttagtcttccaaataaagtgactgaactgtca gatcaacaagatcaagctatcgaaacttctattttgaattctaaagaccatttacaagta gaaaatgatgcttaccctggtaccgatagaacagaaaatgttaaatatagacaagtggac cattttgcctcaaattcacaggagccagcatctgcattagcaacagaagatacctcctta gaaggctcatcattagctgaaagtttgagagctgcagcagaagcggctgtatcacagact ggatttagttatgatgaaaatactggactgtattttgaccacagcactggtttctattat gattctgaaaatcaactctattatgatccttccactggaatttattactattgtgatgtg gaaagtggtcgttatcagtttcattctcgagtagatttgcaaccttatccgacttctagc acaaaacaaagtaaagataaaaaattgaagaagaaaagaaaagatccagattcttctgca acaaatgaggaaaaggatttgaactcagaggatcaaaaagccttcagtgttgaacataca agctgcaatgaggaagaaaatttcgcaaatatgaaaaagaaggccaaaataggcattcat cacaaaaatagtccccccaaagtcactgttccaactagtggaaatactatagagtctcct cttcatgaaaacatctctaattcaacatcatttaaagatgagaaaatcatggagactgat agtgaaccagaggaaggtgaaattacagactctcagactgaggatagttatgacgaagcc attaccagtgaaggcaatgtaactgcagaagatagtgaggatgaagatgaagacaaaatt tggcccccatgtattagagtaattgtcattagatcacctgtgttgcagataggatcactc tttatcattactgctgtaaaccctgctacaattggaagagaaaaggatatggaacatact ctccgaatccctgaagttggtgtcagtaagtttcatgcagaaatttattttgaccatgac ttacaaagttatgtccttgtggatcaaggcagtcaaaatggcacaattgttaatggaaaa cagattcttcagccgaaaactaaatgtgacccttacgtacttgagcatggagatgaagtc aaaattggagaaactgtcttatcctttcacattcatcctggcagtgatacctgtgatggc tgtgaaccagggcaggttagagcccaccttcgccttgataagaaagatgaatcttttgtt ggtccaacactaagtaaggaggaaaaagagttggaaagaagaaaagaattaaagaaaata cgagtaaaatatggtttacagaatacagaatacgaagatgaaaagacattgaagaatcca aaatataaagatagagctggaaaacgtagggagcaggttggaagtgaaggaactttccaa agagatgatgctcctgcatctgttcattctgaaattactgatagcaacaaaggtcggaag atgttggagaagatgggttggaagaaaggagagggcctggggaaggatggtggaggaatg aaaacgccgatccagcttcagcttcggcgaacacatgcaggcttggggacaggcaaacca tcctcatttgaagatgttcaccttctccaaaacaagaacaaaaaaaactgggacaaagca cgagagcggtttactgaaaacttcccagaaactaagcctcaaaaagatgacccagggacc atgccttgggtaaaagggactttagagtga >gi568815593r:76977177_77177878|GENSCAN_predicted_peptide_2|201_aa MGALAVRGSRRERELERRELAVEQGERALERRRRALQEEERAAAQARRELQAEREALQAR LRDVSRREGALGWAPAAPPPLKDDPEGDRDGCVITKVLLTALLRPMPMPSDPAGTEAVTE ARPAAATHSEAPANCRIFHTLRLQLEKHFEARAEPKLTSMGSPGVPGRPFVGMHARNWGG TRDTRELQEPREPRPPSAMAT >gi568815593r:76977177_77177878|GENSCAN_predicted_CDS_2|606_bp atgggcgcgctggccgtgcgcggcagccggcgggagcgggagctggagcggcgcgagctg gccgtggagcagggcgagcgcgccctggagcggaggcggagggcgctgcaggaggaagag cgcgccgcggcccaggcgcgccgggaactgcaggccgagcgggaggcgctgcaggcgcgg ctgcgggatgtgagccgccgtgagggcgccctgggctgggcccccgctgcgccgccgccg ctcaaggacgaccccgagggtgacagggacggctgcgtcatcacaaaggtcctcctgaca gcgcttctccgtccaatgccaatgccttcagaccccgctgggaccgaagccgtaaccgaa gccaggcccgcagccgctactcactcggaagctccagctaactgtaggatcttccacacc ctaaggcttcagcttgagaagcacttcgaagccagagcagaaccaaaactcacttccatg gggtcaccgggggtgcctgggcggcctttcgtgggcatgcacgcaaggaattggggtggc acacgggacacccgagagctccaggagccccgtgaacccagaccacccagtgccatggcc acttag >gi568815593r:76977177_77177878|GENSCAN_predicted_peptide_3|111_aa MRVKRKGLLVYKTIRSREIYSLAQEQYGRNRARDSIISHQFHPTTRGNYGSRIQDEIWVG TQPNHITQVSELLLEASLSQPDRTLPEGKDYVTFLLECLAESTHLTLAGAQ >gi568815593r:76977177_77177878|GENSCAN_predicted_CDS_3|336_bp atgagggtcaagcgaaaagggcttctcgtttataaaaccatcagatctcgtgagatttat tcactagcacaagaacagtatgggagaaaccgcgctcgtgattcaattatctctcaccag ttccatcccacaacacgtgggaattatgggagtaggattcaggatgagatttgggtgggg acacagccaaaccatatcacacaggtatcagaattattgttggaagcatctctttcccaa ccagaccgtacgcttcctgagggcaaagactatgtgacgttcctcttggaatgcttggca gaaagcacgcacctgaccttagcaggtgcacaatga >gi568815593r:76977177_77177878|GENSCAN_predicted_peptide_4|285_aa MKSAKARINLKFVVKLGEKNGKIIDTLQKVYGDNAPKKLDVYKWISVRRDETMLKMKPIT TCECSAVSLRGTRGGFPSGTKCQRGAWRGIYHLGLPTRRRLRPRFPGKGVSVGFQTRWRG GLGRAAGSAGEQRPRTPLPASLAPARGVPFAGLPFSGPAVATVRKQVGPGFCGKAPLTVF QCLILAPSLHSSFDKLVRNVEIRHHSPALPLLKLSDYSLFTGQVTGRPAVGSFCNWDLLD VAHFRREKANASQKLTPESPPLPHPSWPPNKRIFRSFRSPLEQFL >gi568815593r:76977177_77177878|GENSCAN_predicted_CDS_4|858_bp atgaagtcggctaaagcaagaatcaacctcaaatttgtggtgaagcttggggagaagaat ggtaaaatcatcgatactttacagaaagtttatggggacaatgccccaaagaaattagat gtttacaaatggataagtgtaagaagggatgagacaatgttgaagatgaagcccatcacg acgtgtgaatgcagcgctgtgtctttaaggggcacgcgcggaggttttccgtccgggaca aaatgtcagcgaggcgcctggagggggatctaccatctcggactcccgacccgccgccgg ctccggccgcgtttcccgggtaaaggggtgagtgtgggcttccagactcggtggcgaggc gggctgggccgggcagcggggagcgcaggtgagcagcgaccgcggacgcccctgccggcc tccctggcacccgcgcggggcgtgcccttcgctgggctgcccttctccgggccggctgtc gcgactgtgagaaagcaggtgggcccgggattctgtggcaaggccccgctcaccgtcttc cagtgcctcattcttgctccttccctgcactcctctttcgacaagcttgttcggaatgtg gaaattcgtcaccactctcctgctctccctttattaaagttgtctgattacagtcttttc actgggcaagtgactggacgccctgcagtgggttctttctgtaactgggacttgcttgat gtcgcccactttcggagggagaaagcgaatgcaagccaaaagcttacaccagaatcccca cccctcccccacccctcatggcccccaaacaaaagaattttccgctcctttcggagtccg ctggagcagtttctgtga >gi568815593r:76977177_77177878|GENSCAN_predicted_peptide_5|139_aa MPQIKYLGIQLKKDVTDFFKENYKPLLNEIKEDTNKWKNIQCSWIGRINIMKMAILPKET WVHQFRRSLFTIYQTARLVMIWKMFIQKPAAKRGGQSQRQEENTDPGAEHEDRAGRRRKW KEGSFSLSLILLGLDAGES >gi568815593r:76977177_77177878|GENSCAN_predicted_CDS_5|420_bp atgccccaaataaaatacctaggaatccaacttaaaaaggatgtgacggacttcttcaag gagaactacaaaccactgctcaatgaaataaaagaggacacaaacaaatggaagaatatt caatgctcatggataggaagaatcaatatcatgaaaatggccatactgcccaaggaaacc tgggtccaccagttccggagatccttgtttactatataccagacagcgaggctggtgatg atctggaagatgtttattcagaagccagcagccaaaaggggtggacagagccagaggcag gaagaaaacacagatcctggtgccgaacatgaggacagggcaggccgaaggagaaagtgg aaggaggggagcttctctctcagcctcatccttctgggcctggatgctggagagtcatga >gi568815593r:76977177_77177878|GENSCAN_predicted_peptide_6|57_aa MGNWIYGGKEMDCGSPGVSSRKVRSTSSGIPVSASGPLCTLQKKAPYGNIKSHLCSS >gi568815593r:76977177_77177878|GENSCAN_predicted_CDS_6|174_bp atggggaactggatctatggaggaaaagaaatggactgtggcagtcctggagtgtcttca cgcaaagtaagaagcacaagcagtggaattccagtgtcagcaagtggccctctgtgtacc ctgcagaaaaaggctccctacggaaacataaagtcacacctttgctccagctaa >gi568815593r:76977177_77177878|GENSCAN_predicted_peptide_7|271_aa MQRSGENRIPGKGIANAKSLRQEKAWSDCVGATGNTLFPKASKQNQTQPPSIYAGDVGRP NAEDLNLVPSFMTDSPDGPLHDLSGHRPYDSYSLVLYLPFGATQFSLLFHLTALQIFENF ENSFLSPLQLIFFRPTHVVHSTAPSLTQFPDSSVPIAPLWRHSNFSTAPGDILTAKTWTQ CLRCGPTAPQSSGADTTLLLDTSLVNPCSFEQIYKIMDQRLRFLMSGFFSYEATSRLLYP PCSQETFSVLGPNLVSQILMYANNFKQTRAV >gi568815593r:76977177_77177878|GENSCAN_predicted_CDS_7|816_bp atgcaaagatctggggagaataggattccagggaagggaattgcaaatgcaaaatcctta aggcaggaaaaagcttggtctgactgtgttggtgctacgggaaacactttgtttccaaaa gcaagcaaacaaaatcaaacacaaccaccaagtatatatgcaggggatgtgggcaggcca aatgcagaggaccttaacctggttcccagtttcatgaccgattctcctgacgggcctctg catgatcttagcggacaccgtccctatgactcgtactcattggtcctgtatttaccgttt ggagccacgcaattttcactgctcttccacctgacagcacttcaaatctttgaaaacttt gaaaacagcttcttgtccccgctgcaacttatcttcttcaggccaacacacgttgttcat tcaactgctccttctttgacccagtttccagattcttctgtccccattgctcccctctgg aggcactccaatttctcaacagcccctggagacatcctgactgccaagacctggacccag tgcctgagatgtggtcctactgccccacagagcagtggggctgacactaccctcctcctg gacacttctcttgtgaatccatgctcttttgagcagatttacaaaataatggaccaaagg ctgagatttctcatgtctggattcttctcgtatgaggcaacttcacgtttgctttaccct ccatgtagccaggaaacattttctgttcttggaccgaacttagtttcccaaatactgatg tatgcaaacaacttcaagcagaccagggctgtgtga