GENSCAN 1.0 Date run: 7-Nov-116 Time: 17:03:24 Sequence gi568815597r:37945430_38146743 : 201314 bp : 49.87% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.21 Intr - 421 327 95 1 2 26 77 149 0.281 6.36 1.20 Intr - 3519 3457 63 0 0 123 30 34 0.267 0.01 1.19 Intr - 4266 4171 96 0 0 117 83 45 0.692 7.11 1.18 Intr - 6418 6351 68 2 2 98 115 63 0.964 8.62 1.17 Intr - 9398 9308 91 2 1 100 36 25 0.330 -1.93 1.16 Intr - 14746 14691 56 2 2 88 90 32 0.812 2.10 1.15 Intr - 22705 22615 91 1 1 119 95 19 0.983 5.27 1.14 Intr - 23057 23034 24 2 0 83 111 19 0.783 1.92 1.13 Intr - 24035 23925 111 2 0 91 59 166 0.835 14.58 1.12 Intr - 24306 24142 165 0 0 49 106 218 0.958 19.86 1.11 Intr - 31524 31455 70 2 1 74 98 42 0.987 2.98 1.10 Intr - 33398 33291 108 1 0 102 101 67 0.988 8.80 1.09 Intr - 33626 33559 68 2 2 90 90 -33 0.801 -5.20 1.08 Intr - 35295 35157 139 2 1 93 105 44 0.823 7.07 1.07 Intr - 36382 36300 83 1 2 102 80 67 0.992 5.74 1.06 Intr - 38831 38740 92 0 2 69 94 88 0.999 7.21 1.05 Intr - 39350 39278 73 2 1 87 106 82 0.999 8.88 1.04 Intr - 42249 42144 106 2 1 47 87 112 0.997 7.22 1.03 Intr - 42407 42355 53 2 2 80 106 34 0.972 2.11 1.02 Intr - 44166 44119 48 0 0 91 76 26 0.589 0.58 1.01 Init - 44536 44441 96 1 0 95 113 194 0.999 22.91 1.00 Prom - 45753 45714 40 -5.96 2.06 PlyA - 46486 46481 6 1.05 2.05 Term - 50036 49945 92 0 2 106 54 57 0.953 1.98 2.04 Intr - 52441 52255 187 1 1 64 47 194 0.382 12.16 2.03 Intr - 52703 52534 170 0 2 113 81 152 0.999 16.67 2.02 Intr - 53719 53545 175 1 1 116 84 204 0.982 22.31 2.01 Init - 53983 53828 156 1 0 108 86 268 0.895 28.51 2.00 Prom - 65262 65223 40 -5.36 3.00 Prom + 66541 66580 40 -7.16 3.01 Init + 67374 67436 63 2 0 103 91 70 0.801 9.96 3.02 Intr + 70930 70991 62 0 2 101 70 92 0.997 6.23 3.03 Intr + 72239 72341 103 2 1 69 94 118 0.997 10.68 3.04 Intr + 73035 73148 114 2 0 30 86 68 0.726 1.44 3.05 Intr + 73630 73723 94 0 1 74 86 63 0.970 4.24 3.06 Intr + 73824 73954 131 1 2 30 86 66 0.917 1.01 3.07 Intr + 77270 77380 111 1 0 82 77 137 0.999 12.58 3.08 Term + 78116 78199 84 1 0 125 45 68 0.996 3.85 3.09 PlyA + 79371 79376 6 1.05 4.04 PlyA - 79451 79446 6 1.05 4.03 Term - 79999 79906 94 0 1 123 47 57 0.514 2.40 4.02 Intr - 81836 81685 152 2 2 70 41 52 0.158 -2.34 4.01 Init - 85099 85043 57 1 0 59 102 59 0.279 5.71 4.00 Prom - 94505 94466 40 -3.06 5.02 PlyA - 96442 96437 6 -0.45 5.01 Sngl - 101314 99959 1356 1 0 103 46 2115 0.937 202.95 5.00 Prom - 104453 104414 40 -6.26 6.03 PlyA - 107747 107742 6 1.05 6.02 Term - 114721 114458 264 1 0 44 36 104 0.200 -3.69 6.01 Init - 117720 117448 273 0 0 89 81 147 0.668 9.31 6.00 Prom - 136658 136619 40 -2.66 7.04 PlyA - 136716 136711 6 -0.45 7.03 Term - 138084 138014 71 1 2 95 43 53 0.427 -0.40 7.02 Intr - 138553 138489 65 0 2 83 106 15 0.418 1.16 7.01 Init - 143383 143235 149 1 2 66 108 87 0.549 6.21 7.00 Prom - 143952 143913 40 -5.36 8.00 Prom + 149676 149715 40 -7.36 8.01 Init + 150413 150486 74 1 2 9 87 127 0.636 3.26 8.02 Term + 151999 152215 217 1 1 120 37 153 0.938 10.02 8.03 PlyA + 157679 157684 6 1.05 9.07 PlyA - 157906 157901 6 1.05 9.06 Term - 167821 167678 144 1 0 86 45 76 0.152 1.01 9.05 Intr - 170164 170051 114 1 0 111 81 11 0.179 3.34 9.04 Intr - 178029 177963 67 2 1 81 28 95 0.025 1.71 9.03 Intr - 182502 182375 128 0 2 61 56 105 0.262 4.08 9.02 Intr - 190984 190616 369 1 0 96 22 116 0.140 1.10 9.01 Intr - 193569 193494 76 2 1 115 64 56 0.371 5.42 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:37945430_38146743|GENSCAN_predicted_peptide_1|599_aa METILEQQRRYHEEKERLMDVMAKEMLTKKSTLRDQINSDHRTRAMQDRYMEVSGNLRDL YDDKDGLRKEELNAISGPNEFAEFYNRLKQIKEFHRKHPNEICVPMSVEFEELLKARENP SEEAQNLVEFTDEEGYGRYLDLHDCYLKYINLKASEKLDYITYLSIFDQLFDIPKERKNA EYKRYLEMLLEYLQDYTDRVKPLQDQNELFGKIQAEFEKKWENGTFPGWPELASLGLDRL KSALLALGLKCGGTLEERAQRLFSTKGKSLESLDTSLFAKNPKSKGTKRDTERNKDIAFL EAQIYEYVEILGEQRHLTHENVQRKQARTGEEREEEEEEQISESESEDEENEIIYNPKNL PLGWDGKPIPYWLYKLHGLNINYNCEICGNYTYRGPKAFQRHFAVFGVAGALEWRHAHGM RCLGIPNTAHFANVTQIEDAVSLWAKLKLQKASERWQPDTEVLLSASAQSSQGGNEPVTL STLIQKSPPITVLLLALLVAQNLASSQLTPAPVQKKICATFSCQGGDCFKGPKAVAMCPE GLDFCELGPAESGSSSQEDPVGSTSLQAVQGVLCEGDSRQSRLLGLVRYRLEHGGQEHA >gi568815597r:37945430_38146743|GENSCAN_predicted_CDS_1|1797_bp atggagacaatactggagcagcagcggcgctatcatgaggagaaggaacggctcatggac gtcatggctaaagagatgctcaccaagaagtccacgctccgggaccagatcaattctgat caccgcactcgggccatgcaagataggtatatggaggtcagtgggaacctgagggatttg tatgatgataaggatggattacgaaaggaggagctcaatgccatttcaggacccaatgag tttgctgaattctataatagactcaagcaaataaaggaattccaccggaagcacccaaat gagatctgtgtgccaatgtcagtggaatttgaggaactcctgaaggctcgagagaatcca agtgaagaggcacaaaacttggtggagttcacagatgaagagggatatggtcgttatctc gatctccatgactgttacctcaagtacattaacctgaaggcatctgagaagctggattat atcacatacctgtccatctttgaccaattatttgacattcctaaagaaaggaagaatgca gagtataagagatacctagagatgctgcttgagtaccttcaggattacacagatagagtg aagcctctccaagatcagaatgaactttttgggaagattcaggctgagtttgagaagaaa tgggagaatgggacctttcctggatggccggagttggcttctctgggtttggacagattg aaatctgctctcttagctttaggcttgaaatgtggcgggaccctagaagagcgagcccag agactattcagtaccaaaggaaagtccctggagtcacttgatacctctttgtttgccaaa aatcccaagtcaaagggcaccaagcgagacactgaaaggaacaaagacattgcttttcta gaagcccagatctatgaatatgtagagattctcggggaacagcgacatctcactcatgaa aatgtacagcgcaagcaagccaggacaggagaagagcgagaagaagaggaagaagagcag atcagtgagagtgagagtgaagatgaagagaacgagatcatttacaaccccaaaaacctg ccacttggctgggatggcaaacctattccctactggctgtataagcttcatggcctaaat atcaactacaactgtgagatttgtggaaactacacctaccgagggcccaaagccttccag cgacactttgctgtgtttggagttgctggggctttggaatggcgtcatgctcatggcatg aggtgtttgggcatcccaaacactgctcactttgctaatgtgacacagattgaagatgct gtctccttgtgggccaaactgaaattgcagaaggcttcagaacgatggcagcctgacact gaggtcctactgtcagcctcagctcagagttctcagggtgggaatgaacctgtgaccctc tccaccctcatccagaagagtcctcccattacagtcctcctgctggccttactggtagcc cagaacctggcaagcagtcagttgacccctgctcctgtccagaagaaaatctgtgctact ttctcctgccaaggtggtgactgtttcaaagggccgaaagctgtggccatgtgccctgag ggcttggacttctgtgagctgggcccagcagaatctgggagcagcagtcaagaagaccct gtaggatctacaagccttcaggcggtgcaaggtgtgctgtgtgagggggacagccggcag agccgcctcctgggactcgtgcgctaccgcctggagcacggcggccaggaacacgcn >gi568815597r:37945430_38146743|GENSCAN_predicted_peptide_2|259_aa MSESFDCAKCNESLYGRKYIQTDSGPYCVPCYDNTFANTCAECQQLIGHDSRELFYEDRH FHEGCFRCCRCQRSLADEPFTCQDSELLCNDCYCSAFSSQCSACGETVMPGSRKLEYGGQ TWHEHCFLCSGCEQPLGSRSFVPDKGAHYCVPCYENKFAPRCARCSKTLTQGGVTYRDQP WHRECLVCTGCQTPLAGQQFTSRDEDPYCVACFGELFAPKCSSCKRPIVAPGTYYVFSKY LLRKSSPSIENNWRKGLEA >gi568815597r:37945430_38146743|GENSCAN_predicted_CDS_2|780_bp atgagcgagtcatttgactgtgcaaaatgcaacgagtccctgtatggacgcaagtacatc cagacagacagcggcccctactgtgtgccctgctatgacaatacctttgccaacacctgt gctgagtgccagcagcttatcgggcatgactcgagggagctgttctatgaagaccgccat ttccacgagggctgcttccgctgctgccgctgccagcgctcactagccgatgaacccttc acctgccaggacagtgagctgctctgcaatgactgctactgcagtgcgttttcctcgcag tgctccgcttgtggggagactgtcatgcctgggtcccggaagctggaatatggaggccag acatggcatgagcactgcttcctgtgcagtggctgtgaacagccactgggctcccgttct tttgtgcccgacaagggtgctcactactgcgtgccctgctatgagaacaagtttgctcct cgctgcgcccgctgcagcaagacgctgacacagggtggagtgacataccgtgatcagccg tggcatcgagaatgtctggtctgtaccggatgccagacgcccctggcagggcagcagttc acctcccgggatgaagatccctactgtgtggcctgttttggagaactctttgcacctaag tgcagcagctgcaagcgccccatcgtagctcccggcacttactatgtgttcagtaaatac ttgttgaggaagagcagtccaagcatagaaaacaactggcgcaaaggcctggaggcctga >gi568815597r:37945430_38146743|GENSCAN_predicted_peptide_3|253_aa MAAAFRKAAKSRQREHRERSQPGFRKHLGLLEKKKDYKLRADDYRKKQEYLKALRKKALE KNPDEFYYKMTRVKLQDGVHIIKETKEEVTPEQLKLMRTQDVKYIEMKRVAEAKKIERLK SELHLLDFQGKQQNKHVFFFDTKKEVEQFDVATHLQTAPELVDRVFNRPRIETLQKEKVK GVTNQTGLKRIAKERQKQYNCLTQRIEREKKLFVIAQKIQTRKDLMDKTQKVKVKKETVN SPAIYKFQSRRKR >gi568815597r:37945430_38146743|GENSCAN_predicted_CDS_3|762_bp atggcggcggcttttcggaaggcggctaagtcccggcagcgggaacacagagagcgaagc cagcctggctttcgaaaacatctgggcctgctggagaaaaagaaagattacaaacttcgt gcagatgactaccgtaaaaaacaagaatacctcaaagctcttcggaagaaggctcttgaa aaaaatccagatgaattctactacaaaatgactcgggttaaactccaggatggagtacat attattaaggagactaaggaagaagtaaccccagaacaactaaagctgatgagaactcag gacgtcaaatatatagaaatgaagagggttgcagaagctaagaaaatcgaaagactaaaa tcagagctccatctgctggatttccaggggaagcaacagaacaagcatgtgttctttttt gacaccaaaaaggaagttgaacagtttgatgtcgcaactcacctgcaaacagccccggag ctagtcgacagagtctttaataggcccaggatagagaccttgcagaaagaaaaagtgaaa ggagttaccaatcagactggacttaagcgaatagctaaagaaaggcaaaagcagtataac tgcctgacacagcggattgaacgagagaagaaattgttcgttattgctcagaaaattcaa acacgcaaagatcttatggataaaactcagaaagtgaaggtgaagaaagaaacggtgaac tccccagctatttataaatttcagagtcgtcgaaaacgttga >gi568815597r:37945430_38146743|GENSCAN_predicted_peptide_4|100_aa MIKPTEKIDKGAKQLQLCKVGRASLHLLAAFPAGGRAALTAQCPTVSSMGERTVSDKQML KAARRETPPRAPDLNSTGYTLRKQASSLGLYRKLTVMGTY >gi568815597r:37945430_38146743|GENSCAN_predicted_CDS_4|303_bp atgattaaacccacagagaagattgataagggagctaaacagctgcaactctgcaaggta ggcagagcgagtctgcatctgttggctgccttccctgctggaggacgtgcagctctaact gctcagtgccccaccgtcagcagcatgggggagaggacagtgagcgacaaacagatgctt aaagctgcacggagggagacccctcccagggcccctgatttaaattccactgggtacacc ctgaggaaacaggcttctagtctaggcttgtacaggaagctcacagtcatgggcacttat taa >gi568815597r:37945430_38146743|GENSCAN_predicted_peptide_5|451_aa MATTAQYLPRGPGGGAGGTGPLMHPDAAAAAAAAAAAERLHAGAAYREVQKLMHHEWLGA GAGHPVGLAHPQWLPTGGGGGGDWAGGPHLEHGKAGGGGTGRADDGGGGGGFHARLVHQG AAHAGAAWAQGSTAHHLGPAMSPSPGASGGHQPQPLGLYAQAAYPGGGGGGLAGMLAAGG GGAGPGLHHALHEDGHEAQLEPSPPPHLGAHGHAHGHAHAGGLHAAAAHLHPGAGGGGSS VGEHSDEDAPSSDDLEQFAKQFKQRRIKLGFTQADVGLALGTLYGNVFSQTTICRFEALQ LSFKNMCKLKPLLNKWLEETDSSSGSPTNLDKIAAQGRKRKKRTSIEVGVKGALESHFLK CPKPSAHEITGLADSLQLEKEVVRVWFCNRRQKEKRMTPAAGAGHPPMDDVYAPGELGPG GGGASPPSAPPPPPPAALHHHHHHTLPGSVQ >gi568815597r:37945430_38146743|GENSCAN_predicted_CDS_5|1356_bp atggccaccaccgcgcagtacctgccgcggggccccggtggcggagccgggggcaccggg ccgcttatgcacccggacgccgcggcggcggcggcggcggcggcggccgcggagcgattg catgcaggggccgcgtaccgcgaagtgcagaagctgatgcaccacgagtggctgggcgcg ggcgcgggccaccccgtgggcctagcgcacccccagtggctacccacgggaggaggcggc ggcggcgattgggccggcggcccgcacctagaacacggcaaggcaggcggcggcggcacc ggccgagccgacgacggcggcggcggcggaggtttccacgcgcgcctggtgcaccagggg gcggcccacgcgggcgcggcatgggcgcagggcagcacagcgcaccacttgggcccggcc atgtcgccctcgcccggggccagcgggggccaccagccccagccgctcgggctgtacgcg caggcggcctacccggggggcggcggcggcggcctggccgggatgctggcggcgggcggt ggcggcgcggggccgggcctgcaccacgcgctgcacgaggatggccacgaggcgcagctg gagccgtcgccgccgccgcatctgggcgcccacggacacgcacacggacatgcacacgcg ggcggcctgcacgcggcggcggcgcacctgcacccgggcgcgggcggcggcggctcatcg gtgggcgagcactcggacgaggatgcgcccagctcggacgacctggagcagttcgccaag cagttcaagcagcggcgcatcaagctgggctttacgcaggccgacgtggggctggcgctg ggcacgctctacggtaacgtgttctcgcagaccaccatctgccgcttcgaggccctgcag ctgagcttcaagaacatgtgcaagctcaagccgctgctcaacaagtggctggaggagacc gactcgtccagcggcagccccaccaacctggacaagatcgcggcgcagggccgcaagcgc aagaagcgcacgtccatcgaggtgggggtcaaaggcgcgctcgagagccactttctcaag tgccccaagccctcggcgcacgagatcaccggcttggcagacagcctgcagctggagaag gaggtggtgcgcgtctggttctgcaaccggcggcagaaggagaagcgcatgacccctgcg gccggcgcgggccacccgcccatggacgatgtatacgcgcctggggagctagggcctggc gggggcggcgcatcgccaccctccgcgcccccaccgcccccgccggcggcgctgcaccac caccaccaccacacactgcccggctcagtgcagtga >gi568815597r:37945430_38146743|GENSCAN_predicted_peptide_6|178_aa MGRLPGNKALPLEQCKALALLTWRPHWHTPQLGLKVIGLVQQEWQQHARFPTNQGAQARQ LLPTPQPTQSPSSWLYRDGGTSLHKELSTNLTSPTHTSCPNHARITGSRLLTTPSMAASH YLWLPATTHGCHFQATREPPHAHTEPAATLMPAMPLQGPEAWQKNHTGTLTLIKDTHK >gi568815597r:37945430_38146743|GENSCAN_predicted_CDS_6|537_bp atgggcaggctgccaggcaacaaggccctgcccctggagcagtgcaaagccctggccctg ctcacctggcgaccacattggcacactccacagctgggactgaaggtcataggcctggtg cagcaggagtggcagcagcatgcaagattccccactaaccagggtgcccaagccaggcag ctgctgccaaccccacagccaacacagagcccctcctcttggctctacagagatggaggg acttcacttcacaaggagctctctacaaacctgacatccccgactcacaccagctgccca aatcacgctagaataacaggctccaggctgctcaccacgccctctatggctgccagccac tacctatggctgccagccactacccatggctgtcactttcaggccacccgagaaccccca catgctcacacggagcctgcagccacattgatgccagccatgcccttgcagggcccagaa gcctggcagaaaaatcacacaggcactttgacattaatcaaagacactcataaataa >gi568815597r:37945430_38146743|GENSCAN_predicted_peptide_7|94_aa MPTVPALGLPSLPQASQSVLCLKILGAFWELWHSLELLAQKRAGESVEGNTYPQAFQRSM GWPATDLPVAVVSMHINASLHCQCSSTSHIFVGV >gi568815597r:37945430_38146743|GENSCAN_predicted_CDS_7|285_bp atgcccactgtacctgcacttgggttgccaagccttccccaggccagccagtcagtcctc tgcctcaagattcttggtgctttctgggaactctggcacagcctggagctgctggcccag aagagggctggggagagtgtggaaggcaacacatatccacaggcatttcaaaggagcatg ggctggccagctactgatctccctgtggccgtggtctccatgcacatcaatgcttcgtta cattgtcagtgtagcagtacaagtcacatctttgtcggggtctag >gi568815597r:37945430_38146743|GENSCAN_predicted_peptide_8|96_aa MLLQMGLCLWACGFLVATQLPVWPNTRRPEPDASALPPPSRSTWDGRVLVAVAVLSSPTP GAATAGPRGLETTNGSLLMGCLPPPVCQPVLLHTGR >gi568815597r:37945430_38146743|GENSCAN_predicted_CDS_8|291_bp atgttgctgcagatgggactctgcctctgggcctgtggcttcctggttgccactcagctg cccgtgtggccaaacacaaggaggccagagccggatgccagcgcccttcctcccccatcc cgcagcacctgggatgggcgtgtgctcgtcgctgtcgccgtgctctcgtctcccactcct ggagcagccacagcaggtcccaggggcctggagacaacaaatggctccctgctgatgggc tgcttgccgcctcctgtctgccagccagtcctgctgcacaccggcagataa >gi568815597r:37945430_38146743|GENSCAN_predicted_peptide_9|299_aa XATTYAPPLGSCPEGKAVGSRAVDNQERRPEFHAGSLQSQWRRLYETQPHWLQPLRMHLR EQPGRHPPAWPVLSAGQSVSESRKADWGSIHTDSSRKASLWFLITPSREAFLVMREKIFN NLSLQCADLLERERERERAEKVEEGFQRRNQENTTRPPGVVIQSKQSEEGKGGVEQNGKE EEGGCHMVEKSVGFGLDNFKGSWVMSKAFKLPGRTLILHTPTIHLIDVHTRIHVWAPQEH PYLHTISPVPPKTAHFKHSKSAWLPYTAIQARPQVLREEYHYGAAKEPLFQDSWNHRNL >gi568815597r:37945430_38146743|GENSCAN_predicted_CDS_9|900_bp nnggccacaacttatgctcctcctctggggtcctgccctgaaggcaaagctgtgggcagc agggctgtggacaatcaggagaggcgccctgaattccatgccgggtcattacagagccag tggcggagactctatgagacccagccccactggctgcagccccttcggatgcatctgcga gagcagcctggccgtcacccgccggcctggcccgtgctgtcagccggccagagcgtgtct gaatcgaggaaagctgactggggctcgattcacactgactcgagccgcaaggcgagttta tggttcttgattacacccagcagggaggccttcttagtgatgagagagaaaatattcaac aacctgtctttgcaatgtgctgatctcttggagagagagagagagagagagagagcagag aaagtggaggaggggtttcaaaggaggaaccaagagaacacaaccaggccaccaggtgtt gtcatccagtcaaagcagagtgaagaaggaaaaggaggggtggagcagaatggcaaggag gaggagggtggttgtcacatggtagagaaaagtgtgggatttggactggacaatttcaaa ggctcttgggtgatgtcgaaggccttcaagctgccagggaggactctcattttacacaca cctacaattcacttaatagacgtacacactcgcatacacgtctgggccccccaggaacac ccgtatttacacacgatatccccagtccccccaaagactgctcatttcaagcattccaag tccgcctggctgccgtacacagcaatacaagctaggccacaggttctgagggaggagtac cactatggggcagccaaggagcccctcttccaggactcatggaaccacagaaatctatga