GENSCAN 1.0 Date run: 14-Jul-118 Time: 21:18:34 Sequence gi568815595r:3050868_3279687 : 228820 bp : 40.37% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2940 3108 169 2 1 102 52 185 0.990 14.90 1.02 Intr + 15974 16106 133 0 1 48 -4 175 0.592 3.08 1.03 Intr + 16740 16938 199 0 1 -5 58 187 0.468 4.83 1.04 Term + 17212 17274 63 0 0 31 54 107 0.288 -1.49 1.05 PlyA + 19538 19543 6 1.05 2.00 Prom + 28462 28501 40 -0.25 2.01 Init + 36978 37120 143 2 2 70 113 109 0.945 11.25 2.02 Term + 41674 41776 103 1 1 -12 48 130 0.025 -4.23 2.03 PlyA + 42254 42259 6 -0.45 3.09 PlyA - 42276 42271 6 1.05 3.08 Term - 42862 42761 102 1 0 97 54 76 0.782 2.40 3.07 Intr - 44577 44432 146 1 2 75 91 85 0.964 6.58 3.06 Intr - 47190 47003 188 2 2 86 91 121 0.998 10.61 3.05 Intr - 47423 47270 154 0 1 56 78 76 0.931 1.61 3.04 Intr - 50963 50825 139 2 1 62 103 164 0.992 14.42 3.03 Intr - 51953 51808 146 0 2 34 105 59 0.586 1.28 3.02 Intr - 54120 54036 85 0 1 82 61 103 0.711 5.47 3.01 Init - 55173 55123 51 0 0 64 83 1 0.390 -1.44 3.00 Prom - 58411 58372 40 -6.25 4.00 Prom + 60590 60629 40 -5.35 4.01 Init + 64304 64429 126 1 0 39 88 126 0.379 7.91 4.02 Intr + 73942 73992 51 0 0 131 48 34 0.635 1.89 4.03 Intr + 75461 75550 90 1 0 117 110 36 0.835 7.97 4.04 Intr + 75875 76123 249 1 0 82 119 54 0.732 4.61 4.05 Intr + 76458 76680 223 2 1 38 54 152 0.508 3.58 4.06 Intr + 86393 86586 194 0 2 70 96 98 0.986 7.09 4.07 Intr + 89643 89781 139 2 1 51 58 160 0.999 8.42 4.08 Intr + 93717 93843 127 1 1 71 110 26 0.998 1.92 4.09 Intr + 95563 95756 194 1 2 62 66 189 0.993 12.31 4.10 Intr + 96583 96836 254 2 2 91 95 186 0.999 15.83 4.11 Term + 97039 97287 249 0 0 7 41 213 0.977 3.22 4.12 PlyA + 97375 97380 6 1.05 5.10 PlyA - 97642 97637 6 1.05 5.09 Term - 101128 101018 111 1 0 35 48 82 0.430 -3.52 5.08 Intr - 103195 103093 103 0 1 52 87 132 0.744 8.76 5.07 Intr - 122058 121909 150 2 0 -2 86 155 0.842 4.66 5.06 Intr - 123394 123192 203 1 2 28 98 169 0.370 9.06 5.05 Intr - 124402 124296 107 2 2 48 45 136 0.050 4.31 5.04 Intr - 128726 128549 178 2 1 13 52 86 0.023 -3.93 5.03 Intr - 128961 128754 208 2 1 51 51 238 0.035 14.56 5.02 Intr - 130858 130758 101 1 2 75 54 80 0.024 1.29 5.01 Init - 146817 146716 102 0 0 65 116 64 0.714 7.19 5.00 Prom - 155779 155740 40 -2.65 6.00 Prom + 157383 157422 40 -8.65 6.01 Init + 158825 158873 49 1 1 71 58 76 0.231 2.06 6.02 Intr + 161585 161671 87 0 0 77 86 63 0.231 4.02 6.03 Intr + 193238 193359 122 0 2 129 75 43 0.111 6.39 6.04 Intr + 197634 197739 106 2 1 56 81 13 0.004 -3.73 6.05 Intr + 210254 210390 137 0 2 68 98 59 0.876 4.27 6.06 Intr + 212758 212829 72 0 0 75 60 113 0.773 5.88 6.07 Term + 213573 213686 114 2 0 86 53 83 0.896 2.19 6.08 PlyA + 213923 213928 6 1.05 7.00 Prom + 222401 222440 40 -6.05 7.01 Init + 223179 223267 89 2 2 43 58 150 0.107 7.66 7.02 Intr + 224381 224508 128 2 2 82 63 51 0.278 1.40 7.03 Term + 227160 227284 125 1 2 74 42 47 0.066 -3.73 7.04 PlyA + 227686 227691 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:3050868_3279687|GENSCAN_predicted_peptide_1|187_aa VLYRWNRQSSTSVIETNKTSVELSLPFDEDYIIEIKPFSDGGDGSSSEQIRIPKISTLPH AARSEFQMPRKEIGLKIPIGTDILEKHVCSDPVAVEALGEKLHGGSDDGYGFPKNDYELH RARREEGVPNPPRRPMGYFVLLLKHIYQGPDNGTSSGKCGKELHRAQTLLEQQFSECVLE TSSMSIT >gi568815595r:3050868_3279687|GENSCAN_predicted_CDS_1|564_bp gtcttgtacagatggaacagacaaagcagcacatctgtcattgaaacaaataaaacatcg gtggagctttctttgcctttcgatgaagattatataatagaaattaagccattcagcgac ggaggagatggcagcagcagtgaacaaattcgaattccaaagatatcaacactcccccat gctgcacgcagtgaattccaaatgccaaggaaagaaattgggctaaaaatccctattgga acggatatcttggaaaagcatgtttgttctgatcccgtggctgtggaggctctaggagaa aagcttcatggaggaagtgatgatggttacgggttcccaaagaatgactatgagttgcac agggcaagaagggaggaaggagttcccaatccaccaaggaggccaatgggctactttgtg ttgctcctgaaacacatataccaagggccagataacggtaccagctcagggaagtgtggc aaggagctccacagagcccagactctcctggagcagcagttctccgagtgtgtcctggag accagcagcatgagcatcacctga >gi568815595r:3050868_3279687|GENSCAN_predicted_peptide_2|81_aa MQNVPEGLGLPITFINHLDENGKSTLIKFTNDPKLSRGVNSRKDGFISHPSVERYLVTKI YEMINGQDPGVPIQAQLMFPD >gi568815595r:3050868_3279687|GENSCAN_predicted_CDS_2|246_bp atgcaaaatgtcccagaggggcttggacttcctattacgtttattaatcatcttgatgaa aatggaaagtctacattgattaaattcactaatgatcctaaactgagcagaggagtaaat tccagaaaggatggatttataagtcacccttcagtggaacgctatcttgtaaccaaaata tacgaaatgatcaacggtcaagatccaggtgttcctatccaggcccagctgatgtttcca gactga >gi568815595r:3050868_3279687|GENSCAN_predicted_peptide_3|336_aa MYNSHMCLVPNLWGEHVDMIIVAHVLLILLGATEILQADLLPDEKISLLPPVNFTIKVTG LAQVLLQWKPNPDQEQRNVNLEYQVKINAPKEDDYETRITESKCVTILHKGFSASVRTIL QNDHSLLASSWASAELHAPPGSPGTSIVNLTCTTNTTEDNYSRLRSYQVSLHCTWLVGTD APEDTQYFLYYRYGSWTEECQEYSKDTLGRNIACWFPRTFILSKGRDWLAVLVNGSSKHS AIRPFDQLFALHAIDQINPPLNVTAEIEGTRLSIQWEKPVSAFPIHCFDYEVKIHNTRNG YLQEKPVQLEMGVWAQSLLCREELNGICKDEEKLVW >gi568815595r:3050868_3279687|GENSCAN_predicted_CDS_3|1011_bp atgtataacagccacatgtgtctggtaccaaatctctggggggagcacgtggatatgatc atcgtggcgcatgtattactcatccttttgggggccactgagatactgcaagctgactta cttcctgatgaaaagatttcacttctcccacctgtcaatttcaccattaaagttactggt ttggctcaagttcttttacaatggaaaccaaatcctgatcaagagcaaaggaatgttaat ctagaatatcaagtgaaaataaacgctccaaaagaagatgactatgaaaccagaatcact gaaagcaaatgtgtaaccatcctccacaaaggcttttcagcaagtgtgcggaccatcctg cagaacgaccactcactactggccagcagctgggcttctgctgaacttcatgccccacca gggtctcctggaacctcaattgtgaatttaacttgcaccacaaacactacagaagacaat tattcacgtttaaggtcataccaagtttcccttcactgcacctggcttgttggcacagat gcccctgaggacacgcagtattttctctactataggtatggctcttggactgaagaatgc caagaatacagcaaagacacactggggagaaatatcgcatgctggtttcccaggactttt atcctcagcaaagggcgtgactggcttgcggtgcttgttaacggctccagcaagcactct gctatcaggccctttgatcagctgtttgcccttcacgccattgatcaaataaatcctcca ctgaatgtcacagcagagattgaaggaactcgtctctctatccaatgggagaaaccagtg tctgcttttccaatccattgctttgattatgaagtaaaaatacacaatacaaggaatgga tatttgcaggaaaagccagtacagctagagatgggggtgtgggcacagagcctgctctgc agagaggaattgaacggaatctgtaaggatgaagagaaactagtctggtga >gi568815595r:3050868_3279687|GENSCAN_predicted_peptide_4|631_aa MQMKTTLRYLFTPTPMAEMKRTITTVDEDVETLEPSHTVAGELTFHGHGFSISGFNQPQK ELMNWIKSKFSTLQMGSLRAKEANGYTNQGPPGLRFAPAGLGGAGKPAEVRLTVGRSDGR RKPFGPGPAPALALAPPRLPIRRQPMCVGDVTAFTSPEVRVAAVAAAATAGPSVFLSRGG GARKLSGSFRPTWPAGKRGSSTRFRGMEWFFSAGLHRGHSRAAAASLWAGPPYGENEDVI CRREGRELFVKENHELRIAGGAVRDLLNGVKPQDIDFATTATPTQMKEMFQSAGIRMINN RGEKHGTITARLHEENFEITTLRIDVTTDGRHAEVEFTTDWQKDAERRDLTINSMFLGFD GTLFDYFNGYEDLKNKKVRFVGHAKQRIQEDYLRILRYFRFYGRIVDKPGDHDPETLEAI AENAKGLAGISGERIWVELKKILVGNHVNHLIHLIYDLDVAPYIGLPANASLEEFDKVSK NVDGFSPKPVTLLASLFKVQDDVTKLDLRLKIAKEEKNLGLFIVKNRKDLIKATDSSDPL KPYQDFIIDSREPDATTRVCELLKYQGEHCLLKEMQQWSIPPFPVSGHDIRKVGISSGKE IGALLQQLREQWKKSGYQMEKDELLSYIKKT >gi568815595r:3050868_3279687|GENSCAN_predicted_CDS_4|1896_bp atgcaaatgaaaaccacactgagatacctcttcacacccactccaatggctgaaatgaaa aggacaataacaactgttgatgaagatgtggaaacactggaaccctcacatactgttgct ggggagttgaccttccatggccatgggttctccatcagtggattcaaccagccacagaaa gaattaatgaactggatcaaatctaagttctccacgttacagatgggaagcctccgggcc aaggaggccaatggttacacaaaccagggccccccggggctccgcttcgccccagcgggc ctcggcggggctgggaaaccggctgaggttcgcctcacggtggggaggagcgacgggagg cggaagcccttcggtccgggccccgcccccgcactcgccctcgccccgccgcgacttccc attcgccgccaacccatgtgcgttggtgacgtcaccgcgttcaccagcccggaagtgcgc gtggcggcggtggcggctgcggcaacagcggggccgagcgttttcctaagtcgaggcgga ggtgcgaggaaactgagtgggtctttccgccccacgtggcctgccgggaagcgcggctcc tccacccgcttccgtggaatggaatggtttttctctgcaggtttacaccggggtcactcg agagctgctgcggcctccttatgggcaggacccccgtatggggagaacgaggatgtgata tgtagaagggaaggccgcgaattatttgtcaaagagaatcacgaattaagaatagcagga ggagcagtgagggatttattaaatggagtaaagcctcaggatatagattttgccaccact gctacccctactcaaatgaaggagatgtttcagtcggctgggattcggatgataaacaac agaggagaaaagcacggaacaattactgccaggcttcatgaagaaaattttgagattact acactacggattgatgtcaccactgatggaagacatgctgaggtagaatttacaactgac tggcagaaagatgcggaacgcagagatctcactataaattctatgtttttaggttttgat ggcactttatttgactactttaatggttatgaagatttaaaaaataagaaagttagattt gttggacatgctaaacagagaatacaagaggattatcttagaattttaagatacttcagg ttttatgggagaattgtagacaaacctggtgaccatgatcctgagactttggaagcaatt gcagaaaatgcaaaaggcttggctggaatatcaggagaaaggatttgggtggaactgaaa aaaattcttgttggtaaccatgtaaatcatttgattcaccttatctatgatcttgatgtg gctccttatataggtttacctgctaatgcaagtttagaagaatttgacaaagtcagtaaa aatgttgatggtttttcaccaaagccagtgactcttttggcctcattattcaaagtacaa gatgatgtcacaaaattggatttgaggttgaagatcgcaaaagaggagaaaaaccttggc ttatttatagttaaaaataggaaagatttaattaaagcaacagatagttcagacccattg aaaccctatcaagacttcattatagattctagggaacctgatgcaactactcgtgtatgt gaactactgaagtaccaaggagagcactgtctcctaaaggaaatgcagcagtggtccatt cctccatttcctgtaagtggccatgacatcagaaaagtgggcatttcttcaggaaaagaa attggggctctattacaacagttgcgagaacagtggaaaaaaagtggttaccaaatggaa aaagatgaacttctgagttacataaagaagacctaa >gi568815595r:3050868_3279687|GENSCAN_predicted_peptide_5|420_aa MEEGSQRWDIAALKTEKKAMNQRIWAASRHWKGKSSLSIDTARKQEEMLGTTQYDIFFQD KVTKLNKWLGYAPPSAPLGPPGGAVAPVRGSRATRALPRSLRHRPVPASFAGKQTWPAKE ISRTLRTTWATTCRSCLSGVGRGSEPLAVQLGLGGGRGGAKPGEPAQQGRTGKLLHVGPR PAMESRRFLRCSVAPSPESEEEDEMEVEDQDSKEAKKPNIINFDTSLPTSHTYLGADMEE FHGRTLHDDDSCQVIPVLPQVMMILIPGQTLPLQLFHPQEVSMVRNLIQKDRTFAVLAYS NVQEREAQFGTTAEIYAYREEQDFGIEIVKVKAIGRQRFKVLELRTQSDGVAACLPIDDV LRIQLLKIGSAIQRLRCELDIMNKSCNSENQLSYDPVELYIWTIFCSNVKPENVYKAMID >gi568815595r:3050868_3279687|GENSCAN_predicted_CDS_5|1263_bp atggaagaagggtcacagagatgggatattgctgctttgaagactgagaagaaggccatg aaccaaagaatctgggcagcctctagacactggaagggcaagagctccttatccattgat actgcacgcaagcaagaggaaatgttaggaacaactcagtatgatatcttctttcaggat aaagttaccaaattaaacaaatggcttgggtacgcgccgccgtctgctcccctggggccg ccagggggcgctgtggccccggtgcgcggcagccgcgcgacacgggccctccctcggagt cttcggcaccgccctgtcccagcctcctttgcgggtaaacagacatggccggcgaaggag atcagcaggacgctgcgcacaacatgggcaaccacctgccgctcctgcctgtcgggagtg gggcgaggctcggagccgctagcggtgcagctgggcctgggaggcgggcgtgggggcgcc aagcctggcgagccagcccagcagggccgcaccggaaagctgcttcacgtcggccctcgg ccagcgatggagtcgcggcgctttcttcgctgctctgttgccccgagcccagagagtgag gaagaagatgaaatggaagttgaagaccaggatagtaaagaagccaaaaaaccaaacatc ataaattttgacaccagtctgccgacatcacatacatacctaggtgctgatatggaagaa tttcatggcaggactttgcacgatgacgacagctgtcaggtgattccagttcttccacaa gtgatgatgatcctgattcccggacagacattacctcttcagctttttcaccctcaagaa gtcagtatggtgcggaatttaattcagaaagatagaacctttgctgttcttgcatacagc aatgtacaggaaagggaagcacagtttggaacaacagcagagatatatgcctatcgagaa gaacaggattttggaattgagatagtgaaagtgaaagcaattggaagacaaaggttcaaa gtccttgagctaagaacacagtcagatggagtagctgcttgtcttcctattgatgatgta ttgagaattcagctccttaaaattggcagtgctatccagcgacttcgctgtgaattagac attatgaataaatcatgtaattctgaaaatcagctgtcttatgacccagtagaactttat atctggaccatattctgtagtaatgtaaaaccggagaatgtttacaaagcaatgatagac taa >gi568815595r:3050868_3279687|GENSCAN_predicted_peptide_6|228_aa MGFRRVGQAGLELLTTAGCLEELRMYGPKKKRIKREVIAGCYVEEGSPDAISGAIPRRIG QQCHGTGRDSHQHLHVWGPKEKLPNQAYAMISLESLARVGIATLKGMHISHFDTYGQVTF QEHLAAQTKLSAGPEATSSLGDDTVICLIMSVGVIGPGFKNKIEKHTVEQAEEEEEEERL ILLSQVEEVEESWKPNAIVKYSQSIKQICKLDLARNKEAGLEEIPGRH >gi568815595r:3050868_3279687|GENSCAN_predicted_CDS_6|687_bp atgggatttcgccgtgttggccaggctggtctcgaactcctgaccacagctggttgcttg gaagaactgagaatgtatgggcccaaaaagaaaaggattaagagagaagtaatagctggc tgctatgtggaagaaggttctccagatgcaatcagtggagccatccctagaaggataggt cagcaatgccatggcactgggagggacagtcatcaacaccttcatgtttggggtcccaag gagaagctacccaatcaggcttatgcaatgatttccttagaatcacttgctagagtagga attgctacattaaagggaatgcacatttcacattttgatacctatggtcaggttactttt caagagcacttggcagctcagacaaaattgtctgctggtcctgaggccacctcctccctg ggagatgatacagtgatatgcctgataatgtctgttggtgtcataggcccagggtttaaa aacaaaattgaaaaacacacagttgagcaggccgaggaggaggaggaagaggagaggttg atcttgctgtctcaggtagaagaggtggaggagagttggaagccaaatgcgattgttaaa tattcacaatcgataaaacaaatctgtaaacttgatttagctcggaataaggaagctgga ctggaggagatccctggaaggcactga >gi568815595r:3050868_3279687|GENSCAN_predicted_peptide_7|113_aa MWCYDDATCNGTVEDEDGENEGKQLKRDKVNLSIGVSLRLFKGGVLEVLHGEESRSTLVD QSHRYWEGGVSPNSYEDKRLYTSLTISSRENSLWAPRSLHKNSSVEFRPDSVN >gi568815595r:3050868_3279687|GENSCAN_predicted_CDS_7|342_bp atgtggtgttatgatgatgctacatgcaatgggacagtagaagacgaagacggagaaaat gaaggaaagcaactaaagcgggacaaagtgaatctgagcattggagtatctttaagactc tttaaaggtggagttttagaggttctacatggagaagaatcacgttctactctggtggat caatctcatagatattgggagggtggtgtctccccaaatagctatgaagacaaaaggctg tatacctccctcacaatttcctcaagggaaaattccttgtgggccccaagatctttacac aaaaacagttctgttgaatttcgccctgacagtgtaaattaa