GENSCAN 1.0 Date run: 8-Nov-116 Time: 13:05:06 Sequence gi568815595r:155728213_155953997 : 225785 bp : 40.63% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3034 3210 177 0 0 30 36 176 0.001 5.79 1.02 Intr + 16457 16705 249 1 0 9 16 237 0.098 5.31 1.03 Intr + 16754 17142 389 1 2 38 53 213 0.091 5.46 1.04 Term + 26623 26827 205 1 1 42 47 167 0.225 3.86 1.05 PlyA + 28003 28008 6 1.05 2.13 PlyA - 28658 28653 6 1.05 2.12 Term - 35706 35305 402 0 0 39 47 245 0.991 9.67 2.11 Intr - 39457 39297 161 2 2 69 121 47 0.832 4.89 2.10 Intr - 47636 47489 148 2 1 42 100 94 0.200 4.89 2.09 Intr - 74379 74320 60 0 0 86 119 36 0.697 4.51 2.08 Intr - 78154 77927 228 1 0 51 51 305 0.624 20.24 2.07 Intr - 101691 101476 216 0 0 72 107 179 0.966 15.98 2.06 Intr - 105373 105256 118 0 1 73 115 23 0.995 3.05 2.05 Intr - 105829 105645 185 1 2 78 98 160 0.994 13.66 2.04 Intr - 114341 114220 122 0 2 18 116 64 0.410 1.49 2.03 Intr - 125010 124896 115 0 1 99 115 4 0.747 3.30 2.02 Intr - 125809 125132 678 1 0 72 60 416 0.333 28.08 2.01 Init - 126473 126144 330 2 0 50 -9 239 0.362 7.76 2.00 Prom - 126786 126747 40 -4.15 3.03 PlyA - 128514 128509 6 1.05 3.02 Term - 142969 141981 989 2 2 67 49 390 0.390 24.24 3.01 Init - 161481 161172 310 0 1 72 116 178 0.855 16.52 3.00 Prom - 162914 162875 40 -4.05 4.00 Prom + 163919 163958 40 -12.43 4.01 Init + 165036 165041 6 2 0 53 66 0 0.598 -4.73 4.02 Intr + 165306 165487 182 2 2 60 31 225 0.785 11.74 4.03 Intr + 169715 169829 115 2 1 62 91 100 0.704 7.23 4.04 Intr + 179557 179713 157 0 1 41 63 101 0.435 1.56 4.05 Intr + 182480 182673 194 0 2 54 99 172 0.973 13.19 4.06 Intr + 182902 183060 159 0 0 107 48 209 0.482 18.06 4.07 Intr + 184757 184841 85 1 1 64 94 72 0.610 3.87 4.08 Intr + 186207 186358 152 1 2 87 77 97 0.997 7.46 4.09 Intr + 187807 187980 174 0 0 45 83 192 0.999 13.61 4.10 Intr + 191021 191126 106 1 1 77 98 50 0.990 3.77 4.11 Intr + 193975 194090 116 2 2 83 85 49 0.886 3.35 4.12 Intr + 197047 197154 108 0 0 50 109 67 0.392 4.56 4.13 Intr + 201432 201731 300 2 0 25 86 151 0.443 4.61 4.14 Intr + 203553 203668 116 2 2 106 89 59 0.966 6.13 4.15 Intr + 206704 206834 131 1 2 99 119 107 0.990 14.32 4.16 Intr + 208126 208298 173 2 2 42 76 225 0.999 15.44 4.17 Term + 209379 209480 102 2 0 107 38 81 0.796 2.30 4.18 PlyA + 213836 213841 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 3034 3249 216 0 0 30 49 218 0.856 8.26 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:155728213_155953997|GENSCAN_predicted_peptide_1|339_aa VPDSSQQLASTCRLRLKAYSSDRSASVDPDSSPIPTNNKLQVYPHAPRQQANPHGPNQQG IPTHSTHSPKQACAGCGCRERNLGHLLTRSSDWRTSSLFPAHPLLSWAPSSPESTDLLSF ECSVAEGLLGVRGTRSHLRSAGGQKSPPRRALLPTSQQKPQTSGEEQGAAATTCFLIQIL TLTQEPPSLSAPQLPRKSGTGVAESGLERLLPPLARAGWVYGTLGLHVEPQGCTWNPWLT LSCGAPRLARARSTGNEVLLLPRKFQKFRNPGHPWASQPVSTGRGVQISKGGSAPMAIFE EAMEDKDDLTENRTEAPRWYDQETHTTRTRSYHYSSPAG >gi568815595r:155728213_155953997|GENSCAN_predicted_CDS_1|1020_bp gtaccagactcatcacagcagctggccagcacctgcagactcaggctcaaggcctattcc agtgacaggtctgcctctgtggatccagactcaagtccaattcccacaaacaacaaactc caagtctaccctcatgcacctaggcagcaggctaaccctcatggacccaatcaacagggg atccccacgcacagcactcacagtcctaaacaggcgtgtgccggctgtggctgccgagaa aggaacttgggccacctgctcactcgctcatcggactggcgtaccagttctctttttcct gcacacccccttctctcctgggccccttcttcccccgaaagcactgacctcctcagcttt gagtgttcagtagctgaaggactcctcggagtccgtggcacaaggagccacctccgaagt gctgggggccagaaaagcccccctagaagagcacttctccccactagccagcagaagccc cagacgagcggggaagagcagggcgccgccgccacaacctgctttctcattcaaatcctc acgctcacacaggagcccccctccctctcggccccgcagcttccgaggaaatcaggaact ggagttgcggagagcggactggagcgcctgctgcctcctctggcccgggcgggatgggtg tacggaaccctagggctgcacgtggaaccccaaggctgcacctggaacccttggcttaca ctctcctgcggggctccccgcttggcaagggccagaagcactgggaatgaagtactttta ctcccccgcaagtttcaaaagtttcgaaacccaggccatccctgggcttctcaacccgta tcaacaggaagaggtgtccagatctcaaaaggtggatctgctccaatggctatttttgaa gaggccatggaggataaggatgatcttacagagaacagaactgaggcacccagatggtat gaccaagaaacgcacacaactaggaccaggagctatcattattcctctccagcaggataa >gi568815595r:155728213_155953997|GENSCAN_predicted_peptide_2|920_aa MTEAHAFQTATVAVTGPEFGRRPAPAARCAGVNPPRRFRQPRLQVPTKHCFRTGPRERVP DWLALEAGKGPVRDWDHFLKREAGSYMLFLESCRESALCRIDSSESWRCWTGLCIVSDMS PTISHKDSSRQRRPGNFSHSLDMKSGPLPPGGWDDSHLDSAGREGDREALLGDTGTGDFL KAPQSFRAELSSILLLLFLYVLQGIPLGLAGSIPLILQSKNVSYTDQAFFSFVFWPFSLK LLWAPLVDAVYVKNFGRRKSWLVPTQYILGLFMIYLSTQVDRLLGNTDDRTPDVIALTVA FFLFEFLAATQDIAVDGWALTMLSRENVGYASTCNSVVYLSYYGVRFIYPFERVCDVGGK LINLLEIIVWEITQENEVSVVKEETQGITDTYKLLFAIIKMPAVLTFCLLILTAKIGFSA ADAVTGLKLVEEGVPKEHLALLAVPMVPLQIILPLIISKYTAGPQPLNTFYKAMPYRLLL GLEYALLVWWTPKVEHQGGFPIYYYIVVLLSYALHQVTVYSMYVSIMAFNAKVSDPLIGG TYMTLLNTVSNLGGNWPSTVALWLVDPLTVKECVGASNQNCRTPDAVEVAGPPRSAGKGT PPPSRGERRPLPPAGGAQGRSQEAGNMAGQPAATGSPSADKDGMEPNVVARISQWADDHL RLVRNISTGMAIAGIMLLLRSIRLTSKFTSSSDIPVEFIRRNVKLRGRLRRITENGLEIE HIPITLPIIASLRKEPRGALLVKLAGVELAETGKAWLQKELKPSQLLWFQLLGKENSALF CYLLVSKGGYFSVNLNEEILRRGLGKTVLVKGLKYDSKIYWTVHRNLLKAELTALKKGEG IWKEDSEKESYLEKFKDSWREIWKKDSFLKTTGSDFSLKKESYYEKLKRTYEIWKDNMNN CSLILKFRELISRINFRRKG >gi568815595r:155728213_155953997|GENSCAN_predicted_CDS_2|2763_bp atgaccgaggcccacgctttccagaccgccacagtcgctgtcacagggcctgagtttggg cggcgccccgccccggccgcacggtgcgccggcgttaatcctccccgaaggttccggcag ccaagattgcaggttcccaccaagcattgtttccgaactgggccacgagagcgtgtccct gattggcttgctctggaggctgggaagggacctgttagagactgggaccacttcctgaag cgcgaggcaggaagttatatgctttttctggagtcctgtagagaaagtgctctctgccgc attgatagcagcgagagctggaggtgttggacggggctctgcatcgtctctgatatgtca cccaccatctcccacaaggacagcagccggcaacggcggccagggaatttcagtcactct ctggatatgaagagcggtcccctgccgccaggcggttgggatgacagtcatttggactca gcgggccgggaaggggacagagaagctcttctgggggataccggcactggcgacttctta aaagccccacagagcttccgggccgaactaagcagcattttgctactactctttctttac gtgcttcagggtattcccctgggcttggcgggaagcatcccactcattttgcaaagcaaa aatgttagctatacagaccaagctttcttcagttttgtcttttggcccttcagtctcaaa ttactctgggccccgttggttgatgcggtctacgttaagaacttcggtcgtcgcaaatct tggcttgtcccgacacagtatatactaggactcttcatgatctatttatccactcaggtg gaccgtttgcttgggaataccgatgacagaacacccgacgtgattgctctcactgtggcg ttctttttgtttgaattcttggccgccactcaggacattgccgtcgatggttgggcgtta actatgttatccagggaaaatgtgggttatgcttctacttgcaattcggtagtgtattta agctattatggtgttaggtttatttatccttttgaaagagtgtgtgacgttggagggaag ctcattaatttactggaaatcattgtatgggagataactcaagaaaacgaagtatcagta gtaaaagaagaaacacaagggatcacagatacttacaagctgctttttgcaattataaaa atgccagcagttctgacattttgccttctgattctaactgcaaagattggtttttcagca gcagatgctgtaacaggactgaaattggtagaagagggagtacccaaagaacatttagcc ttattggcagttccaatggttcctttgcagataatactgcctctgattatcagcaaatac actgcaggtccccagccattaaacacattttacaaagccatgccctacagattattgctt gggttagaatatgccctactggtttggtggactcctaaagtagaacatcaagggggattc cctatatattactatatcgtagtcctgctgagttatgctttacatcaggttacagtgtac agcatgtatgtttctataatggctttcaatgcaaaggttagtgatccacttattggagga acatacatgacccttttaaataccgtgtccaatctgggaggaaactggccttctacagta gctctttggcttgtagatcccctcacagtaaaagagtgtgtaggagcatcaaaccagaat tgtcgaacacctgatgctgttgaggtcgcggggccaccccggtccgcggggaaggggacc ccgccgccttcccgaggtgagcggcgccccctgccgccagccggaggagctcagggccgc tcgcaggaggccgggaacatggcggggcagcccgcggccaccggctcgccgtctgccgac aaggacggaatggagcccaacgtcgtggctcggatctcgcagtgggcagacgaccacctg cgcctagtccggaacatcagcactggaatggccatagctggaataatgttacttttgaga agcattcgactgacatcaaaatttacaagctcttcggatattccagtagaatttataaga agaaatgttaaactacgtggacgattacgccgaataactgagaatggtttagaaattgaa catatacctattactttacctattatagcttcattgagaaaagagccacgtggtgctttg ctggttaagttggctggagtagaactcgctgaaactgggaaggcatggttacaaaaagag ctaaaaccttcccaattactatggttccaacttcttggaaaggagaattcagcactcttt tgctatcttctggtgagtaagggtggatatttcagcgtgaatctgaatgaagaaattttg agaagaggccttggcaaaactgttcttgttaaagggcttaaatatgattctaaaatctac tggacagttcacagaaacttacttaaagctgaattaacagccttaaaaaaaggagaagga atatggaaggaagactctgaaaaagaaagttacttagaaaaattcaaagattcctggaga gaaatatggaaaaaggacagttttttaaaaacaacaggatcagatttcagcttgaaaaaa gaaagttattatgaaaaacttaaaaggacttatgaaatatggaaagacaacatgaacaac tgctccttaatactgaagttcagagaacttataagtcgcataaactttcgtagaaaaggg tga >gi568815595r:155728213_155953997|GENSCAN_predicted_peptide_3|432_aa MEMFMEMFCDFLGSHIITEQLDSKDIYKNPKIQVGGDKLPQPQTLVLQHLRGKQSSLIKG PENRRIPNTQEVRPRRHSTQLENRSALGGGLPTPIVHKWQGTQAARGRRGPATRPRGADA LPDARTSTRRQDQQSIAAPEIKHATGPDQGLYDAWLPGAQGCADGLGEGGGEGPRGVAPR SGSLASAGMRCCRDPHADLGVSVAQSHRGQGRSRGGDGDSTRRGGRVPEGSAPRAEVEKE AWSRSSRQRRGASRQKSAEWSNSSGARAPAPRERRPAARTLPPPPASAPAGHWEEPEPVR LGSRASDTLRSRRQTGRPSRGGARGARPRPQPGPRRQSRPGLPRPALGRRSPRHPPAPPL LPSHRGARPGGGACGAGQEKNRSRIRASLLLPDLPATLDDFGRLAAPKRPPESWAPGGTL SRWRGLISGCWT >gi568815595r:155728213_155953997|GENSCAN_predicted_CDS_3|1299_bp atggaaatgtttatggaaatgttttgtgattttcttggatcccacataattactgaacaa ctagatagcaaagacatttacaagaaccccaaaatacaagttggtggtgacaaactacca cagccacaaaccctagtgctacagcacctacgtgggaagcaatccagccttattaaagga cctgaaaacaggagaattcccaacacccaagaggtacgacccagaaggcacagcacccaa ttagagaaccgcagcgcgctaggaggaggtttgcccacaccaatagtacacaagtggcag ggaacccaagcggcgcgggggaggcgggggccagcgacgcggccgcggggagctgatgcc ctccccgacgcccggacctcgacacgcagacaagatcaacaaagcatcgccgctcccgag ataaaacatgccacaggccccgaccagggactctacgacgcgtggctgcccggggcgcag ggctgcgcggacgggctgggggaagggggtggggaagggcctcgcggggtggctccccgg tccgggagcctcgcctccgccgggatgcggtgctgcagggacccccacgctgaccttgga gtctccgttgcacagagccatcggggccagggccggagccgcggcggtgacggcgacagt acgagacggggcgggagggtgccggaagggtcggcgccgcgggctgaggttgagaaggag gcctggtcgaggagcagccggcagcggagaggagccagccgccagaaaagcgccgaatgg agcaacagcagcggagcgcgggccccagcgccgcgggagagaagaccagcagcgcgcacg ctgcccccgcctcctgcttccgccccggccggccactgggaggagccagagcccgttcgg cttggcagccgagcctcagataccctccgcagccgccgccaaaccggccggccctcgaga ggaggggctcgcggcgcccggccccgcccccagcccggtccccgccggcaaagccggccg ggtctcccgcggcccgcccttgggaggaggagcccgcggcatccaccggctccgcccctc ctgccctcgcaccgaggagcccgccccggaggaggggcctgtggcgccggccaggagaaa aaccgaagccgaatccgagctagtttgctgctgccggatttgcctgctaccttggatgac ttcgggaggctggcggcaccgaagaggcctcctgagagctgggcccctggcggaactctt agccggtggcgtggccttatctcaggttgctggacatga >gi568815595r:155728213_155953997|GENSCAN_predicted_peptide_4|791_aa MHLENAGGDLKDGHHHYEGAVVILDAGAQYGKVIDRRVRELFVQSEIFPLETPAFAIKEQ GFRAIIISGGPNSVYAEDAPWFDPAIFTIGKPVLGICYGMQAIQGRHSYKMTLDMKKISA NLFGYLGKEETAVQRSWVGTIFEEQQESMWLEQSIANESKKLYGAQFHPEVGLTENGKVI LKNFLYDIAGCSGTFTVQNRELECIREIKERVGTSKVLVLLSGGVDSTVCTALLNRALNQ EQVIAVHIDNGFMRKRESQSVEEALKKLGIQNTVDVFSRGAKREVLGLSLLIPGIVEGGV INAAHSFYNGTTTLPISDEDRTPRKRISKTLNMTTSPEEKRKIIGDTFVKIANEVIGEMN LKPEEVFLAQGTLRPDLIESASLVASGKAELIKTHHNDTELIRKLREEGKVIEPLKDFHK DEVRILGRELGLPEELVSRHPFPGPGLAIRVICAEEPYICKDFPETNNILKIVADFSASV KKRVKACTTEEDQEKLMQITSLHSLNAFLLPIKTVGVQLISNFSKVSGYKINVQKSQAFL YTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLKEIKEDTNKWKNI PCSWVGRINIVKMAILPKGDCRSYSYVCGISSKDEPDWESLIFLARLIPRMCHNVNRVVY IFGPPVKEPPTDVTPTFLTTGVLSTLRQADFEAHNILRESGYAGKISQMPVILTPLHFDR DPLQKQPSCQRSVVIRTFITSDFMTGIPATPGNEIPVEVVLKMVTEIKKIPGISRIMYDL TSKPPGTTEWE >gi568815595r:155728213_155953997|GENSCAN_predicted_CDS_4|2376_bp atgcatctggagaatgctggaggagaccttaaggatggccaccaccactatgaaggagct gttgtcattctggatgctggtgctcagtacgggaaagtcatagaccgaagagtgagggaa ctgttcgtgcagtctgaaattttccccttggaaacaccagcatttgctataaaggaacaa ggattccgtgctattatcatctctggaggacctaattctgtgtatgctgaagatgctccc tggtttgatccagcaatattcactattggcaagcctgttcttggaatttgctatggtatg caggctattcagggaaggcattcctataagatgacattggacatgaagaaaataagtgct aacctttttggatatttggggaaagaggaaacagcagtacagagatcctgggtggggacc atatttgaggagcagcaggaatccatgtggttggagcagagcatagcaaatgaatctaaa aagttatatggagcacagttccaccctgaagttggccttacagaaaatggaaaagtaata ctgaagaatttcctttatgatatagctggatgcagtggaaccttcaccgtgcagaacaga gaacttgagtgtattcgagagatcaaagagagagtaggcacgtcaaaagttttggtttta ctcagtggtggagtagactcaacagtttgtacagctttgctaaatcgtgctttgaaccaa gaacaagtcattgctgtgcacattgataatggctttatgagaaaacgagaaagccagtct gttgaagaggccctcaaaaagcttggaattcagaacactgtggatgtattctctagaggg gctaaacgagaggttctgggcttgagcctactaataccaggcatagttgagggtggagtg ataaatgctgctcattctttctacaatggaacaacaaccctaccaatatcagatgaagat agaaccccacggaaaagaattagcaaaacgttaaatatgaccacaagtcctgaagagaaa agaaaaatcattggggatacttttgttaagattgccaatgaagtaattggagaaatgaac ttgaaaccagaggaggttttccttgcccaaggtactttacggcctgatctaattgaaagt gcatcccttgttgcaagtggcaaagctgaactcatcaaaacccatcacaatgacacagag ctcatcagaaagttgagagaggagggaaaagtaatagaacctctgaaagattttcataaa gatgaagtgagaattttgggcagagaacttggacttccagaagagttagtttccaggcat ccatttccaggtcctggcctggcaatcagagtaatatgtgctgaagaaccttatatttgt aaggactttcctgaaaccaacaatattttgaaaatagtagctgatttttctgcaagtgtt aaaaagagagtcaaagcctgcacaacagaagaggatcaggagaagctgatgcaaattacc agtctgcattcactgaatgccttcttgctgccaattaaaactgtaggtgtgcagctgata agcaacttcagcaaagtctcaggatacaaaatcaatgtacaaaaatcacaagcattctta tacaccaacaacagacaaacagagagccaaatcatgagtgaactcccattcacaattgct tcaaagagaataaaatacctaggaatccaacttacaagggatgtgaaggacctcttcaag gagaactacaaaccactgctcaaggaaataaaagaggatacaaacaaatggaagaacatt ccatgctcatgggtaggaagaatcaatatcgtgaaaatggccatactgcccaagggtgac tgtcgttcctacagttacgtgtgtggaatctccagtaaagatgaacctgactgggaatca cttatttttctggctaggcttatacctcgcatgtgtcacaacgttaacagagttgtttat atatttggcccaccagttaaagaacctcctacagatgttactcccactttcttgacaaca ggggtgctcagtactttacgccaagctgattttgaggcccataacattctcagggagtct gggtatgctgggaaaatcagccagatgccggtgattttgacaccattacattttgatcgg gacccacttcaaaagcagccttcatgccagagatctgtggttattcgaacctttattact agtgacttcatgactggtatacctgcaacacctggcaatgagatccctgtagaggtggta ttaaagatggtcactgagattaagaagattcctggtatttctcgaattatgtatgactta acatcaaagcccccaggaactactgagtgggagtaa