GENSCAN 1.0 Date run: 4-Nov-116 Time: 01:52:30 Sequence gi568815584r:68953860_69252978 : 299119 bp : 43.09% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.10 PlyA - 303 298 6 1.05 1.09 Term - 6259 6194 66 1 0 105 43 69 0.053 2.04 1.08 Intr - 15119 14997 123 2 0 71 44 63 0.183 0.98 1.07 Intr - 19236 19199 38 1 2 92 101 41 0.350 3.78 1.06 Intr - 21421 21313 109 1 1 44 94 75 0.417 3.56 1.05 Intr - 21720 21692 29 1 2 73 92 13 0.292 -1.97 1.04 Intr - 24423 24318 106 0 1 29 56 166 0.565 7.29 1.03 Intr - 24591 24470 122 1 2 92 -2 192 0.759 10.81 1.02 Intr - 24929 24788 142 2 1 25 1 70 0.626 -8.07 1.01 Init - 25197 25093 105 0 0 96 77 134 0.984 13.32 1.00 Prom - 26515 26476 40 -13.24 2.00 Prom + 27053 27092 40 -7.06 2.01 Init + 29278 29449 172 0 1 67 69 181 0.507 13.70 2.02 Intr + 49353 49448 96 1 0 98 60 120 0.646 10.18 2.03 Intr + 51925 52015 91 2 1 56 84 82 0.033 3.65 2.04 Intr + 70548 70597 50 0 2 72 94 44 0.007 1.72 2.05 Intr + 89508 89708 201 1 0 33 77 81 0.015 0.76 2.06 Intr + 93024 93081 58 1 1 89 55 42 0.090 -1.06 2.07 Term + 93267 93408 142 0 1 115 39 50 0.238 0.20 2.08 PlyA + 96770 96775 6 1.05 3.04 PlyA - 97084 97079 6 1.05 3.03 Term - 101752 99998 1755 1 0 98 44 1720 0.999 155.61 3.02 Intr - 108652 108525 128 2 2 127 100 99 0.994 15.40 3.01 Init - 128081 128021 61 2 1 67 106 3 0.210 1.41 3.00 Prom - 128330 128291 40 -6.96 4.00 Prom + 128616 128655 40 -7.16 4.01 Init + 129539 130001 463 1 1 60 42 338 0.950 22.26 4.02 Intr + 130231 130587 357 2 0 86 26 175 0.953 6.33 4.03 Term + 130630 131045 416 2 2 -12 34 292 0.661 9.22 4.04 PlyA + 134110 134115 6 1.05 5.08 PlyA - 134171 134166 6 1.05 5.07 Term - 137297 137196 102 2 0 73 48 36 0.856 -3.62 5.06 Intr - 138028 137815 214 0 1 107 99 175 0.824 19.22 5.05 Intr - 162636 162507 130 1 1 96 83 46 0.742 4.65 5.04 Intr - 164419 164280 140 0 2 86 80 89 0.616 8.01 5.03 Intr - 165371 165335 37 0 1 115 94 36 0.932 4.22 5.02 Intr - 168501 168358 144 1 0 65 83 41 0.109 1.55 5.01 Init - 199119 198906 214 0 1 84 99 204 0.467 18.08 5.00 Prom - 212480 212441 40 -3.36 6.00 Prom + 230276 230315 40 -4.66 6.01 Init + 230564 230664 101 1 2 45 60 103 0.211 2.93 6.02 Intr + 237664 237732 69 1 0 37 111 64 0.765 1.90 6.03 Intr + 237788 237962 175 2 1 9 85 164 0.952 8.14 6.04 Intr + 238051 238180 130 0 1 37 41 79 0.129 -1.63 6.05 Intr + 255670 255944 275 2 2 49 106 227 0.086 17.86 6.06 Intr + 274957 275213 257 0 2 123 48 120 0.878 7.74 6.07 Intr + 276706 276739 34 1 1 94 80 30 0.653 1.03 6.08 Intr + 280841 281172 332 1 2 116 93 64 0.800 4.13 6.09 Intr + 282187 282293 107 1 2 118 48 79 0.998 6.76 6.10 Intr + 282548 282683 136 0 1 65 109 144 0.998 13.83 6.11 Intr + 283716 284072 357 0 0 64 87 576 0.970 49.47 6.12 Term + 287025 287241 217 0 1 79 44 324 0.999 23.72 6.13 PlyA + 288477 288482 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 77378 77455 78 1 0 28 50 154 0.933 3.36 S.002 Term - 197620 197517 104 2 2 37 44 117 0.892 0.64 S.003 Init + 199598 199727 130 1 1 4 86 261 0.863 15.81 S.004 Sngl - 265433 264852 582 2 0 60 47 177 0.914 6.98 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584r:68953860_69252978|GENSCAN_predicted_peptide_1|279_aa MDHYDSQQTNDYMQPEEDWDRDLLLDPAWEKQQRKAPGGSGSAPAPEPGNAGPERGEVAP LAGRSPWGSVVQATLRLLQPSPRGGESAGEIGGRLRICLEKDQLECGWGRENWGAYAGSP AGPWVGSCNPDMGLRYVLPLRPPGKVGSSVRVEGLQNEGVCVYPVKLEELNQYLLSEWML KQRGSVGKCIEVVVNKAHIPILQTGGYLAGKSGYLGMKPLVRKQAQHRPTLLALGLFWEY GAEHPCVPLPCPQQQPALPTQCGDNQDENFHDDPLPLTE >gi568815584r:68953860_69252978|GENSCAN_predicted_CDS_1|840_bp atggaccattatgattctcagcaaaccaacgattacatgcagccagaagaggactgggac cgggacctgctcctggacccggcctgggagaagcagcagagaaaggcgcctggcggaagc gggagcgcccctgccccggagcccgggaacgccgggccggagcgcggggaggtggcgccg ctggcgggcaggtctccttgggggtccgtggtccaggccaccctgcggctgttgcaacca tccccgcgcggaggggagagcgccggagagatcggcggccgcctccggatctgcctggag aaggaccagctggagtgcggctggggacgagaaaactggggggcgtacgcgggctccccc gcgggcccttgggttggaagctgtaaccccgacatgggcttgcgctacgtcctgcccctc cgacccccgggaaaggtgggcagctcggttcgagttgaggggctgcagaacgagggtgta tgtgtgtacccagtcaagctggaggaacttaatcaatacttgttgagcgaatggatgttg aaacagagaggcagcgttggcaagtgcatagaggtggttgtgaataaggcccacatcccg attctgcagacagggggttatctggcgggcaagtctggttatctgggcatgaagccactg gttcggaagcaagctcagcataggcccacattgctggccctgggactgttctgggaatat ggagctgagcatccctgcgtcccgctgccctgcccccagcagcagccagccctgcctact caatgtggagacaaccaggatgaaaactttcatgatgatccacttccacttactgaatag >gi568815584r:68953860_69252978|GENSCAN_predicted_peptide_2|269_aa MKNAQVDMRRGQNPGEQHLERDSKEQEPLKETKKMVIQERGGPGVYDVEEVKGVRFQTAA VKAAMNFNLPIWMVWGGPRLLGKSAAFGRGMRATDLPGAQGISIDLNEAAGLTQASFREG CDVLDSGKCSQPTETAGQYHGSGAKVNRSLDLKKDRIMMTITAALSTQFLLGIVLGALCM FSLIHRRALQGKPRRNFVDEEIQDLQKVESWHQIPGINEAVRRLRLTRVQYILSRKERLL SQQTPSKKVFYMVCTLVSTSNESKKERMY >gi568815584r:68953860_69252978|GENSCAN_predicted_CDS_2|810_bp atgaagaatgcccaggtggacatgagaagagggcagaaccctggagaacaacacttagaa agggacagcaaagaacaagaacccctgaaggagacaaagaaaatggtcatccaggaacga ggagggcctggagtatatgacgttgaggaggtcaaaggagtaaggtttcaaactgctgcg gtcaaagctgccatgaacttcaacttgcccatctggatggtctggggtggtcctcgtctt ttgggcaagtctgcagcttttggcagagggatgagagccacggacttaccgggtgctcag ggaataagcattgacctgaatgaagctgctgggttgacacaggcatccttccgggagggg tgtgacgtgcttgattcaggcaaatgtagccagcccacagagacagcaggacagtatcat ggcagtggagccaaagtcaacaggtcattggacctgaaaaaagacaggattatgatgaca ataacagctgctctttctacccagttcctactgggcattgtgctaggtgctttgtgtatg ttttcactaatccacaggagagcattgcaaggtaagcctcgtcgcaactttgtggatgaa gaaattcaagacttgcagaaagtggagagttggcaccaaattccaggaatcaatgaggct gtcaggaggttaagacttactagggtacagtacattctttctcggaaggagaggttactt tctcagcagaccccatccaaaaaagtcttctacatggtgtgcacactggtatcaacaagc aatgaaagcaaaaaagagagaatgtactaa >gi568815584r:68953860_69252978|GENSCAN_predicted_peptide_3|647_aa MPFTGESLINYYGRSLWRELGGIGRVVNGAFMVLKGHRSIVNQVRFNPHTYMICSSGVEK IIKIWSPYKQPGCTGDLDGRIEDDSRCLYTHEEYISLVLNSGSGLSHDYANQSVQEDPRM MAFFDSLVRREIEGWSSDSDSDLSESTILQLHAGVSERSGYTDSESSASLPRSPPPTVDE SADNAFHLGPLRVTTTNTVASTPPTPTCEDAASRQQRLSALRRYQDKRLLALSNESDSEE NVCEVELDTDLFPRPRSPSPEDESSSSSSSSSSEDEEELNERRASTWQRNAMRRRQKTTR EDKPSAPIKPTNTYIGEDNYDYPQIKVDDLSSSPTSSPERSTSTLEIQPSRASPTSDIES VERKIYKAYKWLRYSYISYSNNKDGETSLVTGEADEGRAGTSHKDNPAPSSSKEACLNIA MAQRNQDLPPEGCSKDTFKEETPRTPSNGPGHEHSSHAWAEVPEGTSQDTGNSGSVEHPF ETKKLNGKALSSRAEEPPSPPVPKASGSTLNSGSGNCPRTQSDDSEERSLETICANHNNG RLHPRPPHPHNNGQNLGELEVVAYSSPGHSDTDRDNSSLTGTLLHKDCCGSEMACETPNA GTREDPTDTPATDSSRAVHGHSGLKRQRIELEDTDSENSSSEKKLKT >gi568815584r:68953860_69252978|GENSCAN_predicted_CDS_3|1944_bp atgccatttactggagaatcattaatcaattactatggaaggagcctttggagagaactg ggtggcattggtagggtggtcaacggagccttcatggtgctgaaagggcatcgatctatt gttaaccaagtccgatttaatccccacacctacatgatctgctcttctggtgtagaaaag attatcaagatctggagcccatacaagcagccaggatgtactggagacctcgacggtcgg attgaggacgattcccgctgcctctatacccatgaagagtacatcagccttgtgctgaac agtgggagtggcctgtcgcatgactacgccaaccagtcggtccaggaagacccccggatg atggccttctttgactcactggtacgccgagagatcgagggctggagctctgactcagac agtgacctcagtgagagtactatcctccaactgcacgctggggtcagcgagcgctcaggc tacactgactcagagtcttcggcctcattgcctcgctccccgcctcccacagtagatgag tctgccgacaacgccttccacctggggcccctgcgggtcaccaccacaaacacagtagcc tcaactccaccaacacccacgtgtgaggatgcagcctctcgccagcagcgtctgtctgct ctgcggcgctaccaagacaaacgcctcctggccctttccaatgagtccgattctgaggag aatgtctgtgaggtggaactagacacagatctctttccccggccacggtcacccagcccc gaagatgaatccagcagttccagcagctctagcagctctgaggatgaggaggagctgaat gaacgccgagcctctacctggcagcggaatgccatgcggcgccgacagaagacaacccga gaagacaagcccagtgccccaatcaagcccaccaacacttacattggagaagacaactat gattacccccagatcaaagtggatgacctctcctcctccccaacctcgtcccctgagcgg agcacttccacgctagagattcaaccaagccgggcatcaccaacttctgacatagaatca gttgagcgaaaaatttataaagcttacaagtggctccgctactcttatatctcctactca aataacaaagatggagagacctccttggtgaccggggaggcagatgaagggagagcagga accagccacaaagacaacccagccccttcttccagtaaggaagcctgtctaaacatagca atggcccagaggaaccaggacctgccacctgaaggctgcagcaaggacacttttaaagaa gagactcctagaactcccagcaatggcccaggccatgagcacagcagccatgcttgggca gaggtgccagagggtacctctcaggacactggcaatagcggctctgtagagcaccctttt gaaaccaagaagctcaatggaaaggccctgagcagtcgggctgaggagccgccttctcct cctgtccccaaggcatctggctccactctcaacagcgggtctggcaactgtcccaggacc cagtctgatgacagtgaggagaggagcctcgaaaccatctgtgccaaccacaacaatgga cgcttacaccctcgtccccctcaccctcacaataacgggcagaacttgggggagctggag gtggtggcctactcttccccaggacactcagacactgaccgtgataactcgtccctgaca gggacactcctacacaaagattgttgcgggtctgaaatggcctgtgagacccccaatgct ggaacaagagaggaccccactgacaccccagccacagatagtagcagggctgttcatggc cacagtggcctcaaaaggcaacgaattgaattggaagatacagattcagagaattcctcc tcagagaagaaattaaaaacatga >gi568815584r:68953860_69252978|GENSCAN_predicted_peptide_4|411_aa MSHLLMKLLRKKIKKWNLKLRQWNLKLQGASNLTLSETQNTDVSEETTGGGKVKKSKHSM NVGLSDAQNGDVSQEAVENIKVKKSPQKSTVLTNGEAAMQSPNSESKKKRKMVNDAESDT KKAKTENKGESEEESAKSPKETENNVEKPDDEDDTELIVKLNFMPRNGTGVLILSPTREL AMQTFGVLKELMSHHVHTYGLIMGGSNRSAEAQKLANGINITVVTPGCLLDHMQNIPGFM YKNLQCLVIDEADRILDVEFEEELKQIIKLLPTLEGPARISLKKEPLYVGVDDDKANATV DGLEQGYVVCPSEKRFLLLFTFLKKNQKKKLMAFFSSCMSVKYHYELLNYIDLPILAIHG KQKQNKHTTTFFQFCNADSGTLLCTDVAARELDITEVNWIVQYDPPDDPKE >gi568815584r:68953860_69252978|GENSCAN_predicted_CDS_4|1236_bp atgtctcacctgctgatgaaactcctacgcaagaagatcaagaagtggaacctcaaactg cggcagtggaacctaaagttgcagggggcctcaaatctgaccctgtccgaaactcaaaat acagatgtgtctgaagaaacaacgggaggtggaaaggttaaaaaatcaaaacattctatg aatgtgggcttatcagatgctcaaaatggagatgtgtctcaagaagcagtggaaaatata aaagttaaaaaatctccccagaaatccaccgtattaaccaatggagaagcagcaatgcag tctcctaattcagaatcaaaaaagaagagaaaaatggtgaatgatgctgagtctgataca aaaaaagcaaaaactgaaaacaaaggggaatctgaagaagaaagtgccaagtctcctaaa gaaacagaaaataatgttgagaagccagatgatgaagatgacactgaactcattgttaag ttaaatttcatgcccaggaatggaacaggggtccttattctctcacctactagagaacta gccatgcaaacttttggtgttcttaaggagctaatgagtcaccacgtgcatacctatggg ttgataatgggtggcagtaatagatctgctgaagcacagaaacttgctaatgggatcaac atcactgtggtcacaccaggctgtctgctggaccatatgcagaatatcccagggtttatg tataaaaatctgcagtgtctggttatcgatgaagctgatcgtatcttggatgttgagttt gaagaggaattaaaacaaattattaaacttttgccaacacttgaaggcccggcaaggatt tctctgaaaaaggagccattgtatgttggtgttgatgacgataaagctaatgcaacagtg gatggtcttgagcagggatatgttgtttgtccttctgaaaagagattccttctactcttt acattccttaagaagaaccaaaaaaagaagcttatggccttcttttcatcttgtatgtcc gtgaaataccactatgagttgctgaactacattgatttgcccatcttggccattcatgga aagcaaaagcaaaataagcatacgaccacattcttccagttctgcaatgcagattcagga acactattgtgtacagatgtggcagcaagagaactggacattactgaagtcaactggatt gttcagtatgaccctccggatgaccctaaggaataa >gi568815584r:68953860_69252978|GENSCAN_predicted_peptide_5|326_aa MKRRAGLGGSMRSVVGFLSQRGLHGDPLLTQDFQRRRLRGCRNLYKKDLLGHFGCVNAIE FSNNGGQWLVSGGDDRRVLLWHMEQAIHSRVKPIQLKGEHHSNIFCLAFNSGNTKVFSGG NDEQVILHDVESSETLDVFAHEDAVYGLSVSPVNDNIFASSSDDGRVLIWDIRESPHGEP FCLANYPSAFHSVMFNPVEPRLLATANSKEGVGLWDIRKPQSSLLRYGGNLSLQSAMSVR FNSNGTQLLALRRRLPPVLYDIHSRLPVFQFDNQGYFNSCTMKSCCFAGDRDQHHNWKWR AQAVSFQPFLSLTRNGDADVDLKLAV >gi568815584r:68953860_69252978|GENSCAN_predicted_CDS_5|981_bp atgaagaggagagctggcctggggggcagcatgaggtcagtggtgggcttcttgtcccag cggggcttgcatggggaccccctgctcactcaggactttcagaggagacgcctgcggggc tgcagaaacctctacaagaaggacctcctcggccacttcggctgtgtcaatgccattgaa ttctccaacaatggaggccagtggctggtctcaggaggagatgaccgccgggttctgcta tggcacatggaacaagccatccactccagggtcaagcccatacagctgaaaggagagcac cattccaacattttttgcctggctttcaacagtgggaacactaaagtgttctctggaggc aatgatgagcaagttatcctccatgatgttgaaagcagtgagacattggacgtgtttgct catgaagatgcagtatatggcttgtctgtgagcccagtgaatgacaacatttttgccagt tcctcagatgatggccgggttctcatttgggacattcgggaatccccccatggagagccc ttctgcctggcaaactatccatcagcctttcatagtgtcatgtttaaccctgtggagccc aggttgttggccacagccaattcaaaggaaggagtgggactctgggacattcgaaaacct cagagttctctcctgcgctatggtggaaacctgtccctccaaagtgccatgagtgtacga ttcaacagcaacgggacccagctcctggccctgaggcgacgcctgccccctgtgctctat gacatccattcccgcctgcctgtgtttcagtttgacaatcagggttacttcaactcatgc accatgaaaagctgctgttttgcaggagatcgtgaccagcatcacaactggaaatggaga gcccaagctgtgagtttccagccttttctgagtctcaccaggaatggagatgctgatgtt gacttaaagctggcagtttaa >gi568815584r:68953860_69252978|GENSCAN_predicted_peptide_6|729_aa MYSNSSKESAGSKGLENDIDFAEPYLWKIEDEVRRPHRFQVFNVSPLQVCGPVRDSRGCF QCRLRGVVERGDITVYGGDIAGPPLSRSELALPLSPPLGVTPSPPSHTAGRFSTQTLPDL RGCVSQSAKKEAPPAAPGSSSGAAGSTRQQLLFSLLDSWGFVLWKGIQRRRRSKTSPVTQ QPQQKVLGSRELPPPEDDQLHSSAPRSSWKERILKAKVVTVSQEAEWDQIEPLLRSELED FPVLGIDCEWVNLEGKASPLSLLQMASPSGLCVLVRLPKLICGGKTLPRTLLDILADGTI LKVGVGCSEDASKLLQDYGLVVRGCLDLRYLAMRQSNWDAETLTEDQVIYAARDAQISVA LFLHLLGYPFSRNSPGEKNDDHSSWRKVLEKCQGVVDIPFRSKGMSRLGEEVNGEATESQ QKPRNKKSKMDGMVPGNHQGRDPRKHKRKPLGVGYSARKSPLYDNCFLHAPDGQPLCTCD RRKAQWYLDKGIGELVSEEPFVVKLRFEPAGRPESPGDYYLMVKENLCVVCGKRDSYIRK NVIPHEYRKHFPIEMKDHNSHDVLLLCTSCHAISNYYDNHLKQQLAKEFQAPIGSEEGLR LLEDPERRQVRSGARALLNAESLPTQRKEELLQALREFYNTDVVTEEMLQEAASLETRIS NENYVPHGLKVVQCHSQGGLRSLMQLESRWRQHFLDSMQPKHLPQQWSVDHNHQKLLRKF GEDLPIQLS >gi568815584r:68953860_69252978|GENSCAN_predicted_CDS_6|2190_bp atgtattcaaattcctccaaagaatccgctggatcaaaaggattagagaatgacatagac tttgctgaaccttacctgtggaagatagaagatgaagttaggcggccgcacaggttccag gtctttaacgtgagcccgctgcaggtgtgcggcccagtccgagacagcagggggtgcttc cagtgcagactaagaggggtcgtggagcggggggatattaccgtgtacgggggtgacatt gcggggcccccgctgtcccggagcgagttggccctgcccctctccccgcccctcggcgtg accccctcgcccccgtcgcacacggcggggcggttcagcacccagactctaccagacctt cgaggctgcgtgtcccagtcagctaaaaaggaggccccgcccgcggcacctggtagctcc tcgggcgctgcgggttcgacgcggcaacagctgctgttttctctcctggattcgtggggg tttgtcctctggaaaggcatccagcgccgccgaaggagtaaaacgagtcctgtgacccaa cagccacagcagaaagtgctgggcagtagagagctgccccctccagaagatgatcagctg cactccagtgcccccagatcctcgtggaaggaacggatccttaaagcaaaggtggtgacg gtgtctcaggaggcagagtgggatcaaatcgagcccttgcttagaagtgaattagaagat tttccagtacttggaattgactgtgagtgggtaaatttggaaggcaaagccagccctctg tcacttctacaaatggcctccccaagtggcctgtgtgtcttggttcgcctgcccaagcta atctgtggaggaaaaacactaccaagaacgttattggatattttggcagatggcaccatt ttgaaagttggagtgggatgctcagaagatgccagcaagcttctgcaggattatggcctc gttgttagggggtgcctggacctccgatacctagccatgcggcagagcaactgggatgct gagactctcacagaggaccaggtaatttatgctgccagggatgcccagatttcagtggct ctctttcttcatcttcttggataccctttctctaggaattcacctggagaaaaaaacgat gaccacagtagctggagaaaagtcttggaaaaatgccagggtgtggtcgacatcccattt cgaagcaaaggaatgagcagattgggagaagaggttaatggggaagcaacagaatctcag cagaagccaagaaataagaagtctaagatggatgggatggtgccaggcaaccaccaaggg agagaccccagaaaacataaaagaaagcctctgggggtgggctattctgccagaaaatca cctctttatgataactgctttctccatgctcctgatggacagcccctctgcacttgtgat agaagaaaagctcagtggtacctggacaaaggcattggtgagctggtgagtgaagagccc tttgtggtgaagctacggtttgaacctgcaggaaggcccgaatctcctggagactattac ttgatggttaaagagaacctgtgtgtagtgtgtggcaagagagactcctacattcggaag aacgtgattccacatgagtaccggaagcacttccccatcgagatgaaggaccacaactcc cacgatgtgctgctgctctgcacctcctgccatgccatttccaactactatgacaaccat ctgaagcagcagctggccaaggagttccaggcccccatcggctctgaggagggcttgcgc ctgctggaagatcctgagcgccggcaggtgcgttctggggccagggccctgctcaacgcg gagagcctgcctactcagcgaaaggaggagctgctgcaagcactcagagagttttataac acagacgtggtcacagaggagatgcttcaagaggctgccagcctggagaccagaatctcc aatgaaaactatgttcctcacgggctgaaggtggtgcagtgtcacagccagggtggcctg cgctccctcatgcagctggagagccgctggcgtcagcacttcctggactccatgcagccc aagcacctgccccagcagtggtcagtggaccacaaccatcagaagctgctccggaaattc ggggaagatcttcccatccagctgtcttga