GENSCAN 1.0 Date run: 6-Nov-116 Time: 10:34:41 Sequence gi568815579f:55661801_55771804 : 110004 bp : 48.28% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 702 818 117 2 0 127 83 141 0.967 18.16 1.02 Intr + 1806 1944 139 2 1 106 105 196 0.999 23.14 1.03 Intr + 3444 3804 361 1 1 32 31 506 0.955 33.58 1.04 Intr + 5816 5916 101 2 2 49 48 97 0.974 1.55 1.05 Intr + 6707 6786 80 0 2 84 93 148 0.999 14.17 1.06 Intr + 6870 6992 123 2 0 91 109 358 0.534 38.98 1.07 Intr + 7283 7369 87 1 0 73 93 146 0.996 13.77 1.08 Intr + 7644 7892 249 2 0 110 77 542 0.978 52.93 1.09 Intr + 8465 8857 393 1 0 75 109 81 0.513 3.75 1.10 Intr + 12134 12264 131 1 2 136 40 221 0.706 21.59 1.11 Intr + 12861 13100 240 0 0 82 58 71 0.314 0.16 1.12 Intr + 16707 17055 349 0 1 82 82 487 0.455 42.86 1.13 Intr + 18957 19150 194 2 2 90 52 105 0.519 5.39 1.14 Intr + 23484 23845 362 0 2 43 100 802 0.557 71.76 1.15 Intr + 27070 27194 125 2 2 82 84 172 0.998 16.50 1.16 Intr + 27497 27571 75 1 0 120 77 73 0.998 9.11 1.17 Intr + 28067 28150 84 1 0 72 83 151 0.594 13.02 1.18 Intr + 29954 30119 166 1 1 124 27 156 0.827 12.63 1.19 Intr + 30886 30996 111 2 0 108 84 34 0.970 5.35 1.20 Intr + 31151 31237 87 0 0 38 99 142 0.978 10.24 1.21 Intr + 32926 33183 258 2 0 89 101 241 0.667 22.93 1.22 Intr + 35115 35181 67 1 1 87 90 2 0.273 -1.74 1.23 Intr + 36104 36163 60 2 0 79 92 68 0.172 4.15 1.24 Intr + 42438 42566 129 0 0 52 115 58 0.117 4.71 1.25 Term + 42745 42775 31 1 1 82 43 19 0.106 -5.87 1.26 PlyA + 43891 43896 6 1.05 2.10 PlyA - 44409 44404 6 1.05 2.09 Term - 47244 47112 133 2 1 59 45 140 0.291 4.36 2.08 Intr - 50170 50000 171 0 0 76 84 252 0.992 22.66 2.07 Intr - 50790 50620 171 2 0 91 100 18 0.833 2.36 2.06 Intr - 55098 54928 171 2 0 42 95 180 0.995 13.16 2.05 Intr - 62344 62180 165 0 0 100 95 116 0.995 12.58 2.04 Intr - 68192 68031 162 1 0 46 103 105 0.983 6.89 2.03 Intr - 70560 70199 362 0 2 38 70 91 0.344 -3.68 2.02 Intr - 71750 71062 689 0 2 73 22 341 0.202 17.26 2.01 Init - 76574 76295 280 2 1 75 100 188 0.950 15.88 2.00 Prom - 78367 78328 40 -8.06 3.03 PlyA - 81091 81086 6 1.05 3.02 Term - 85244 84857 388 1 1 -27 45 979 0.826 76.51 3.01 Init - 86831 86710 122 2 2 42 36 60 0.443 -4.14 3.00 Prom - 86963 86924 40 -5.76 4.00 Prom + 88934 88973 40 -5.66 4.01 Init + 92324 92504 181 1 1 57 54 180 0.808 10.85 4.02 Intr + 97173 97375 203 1 2 85 105 57 0.766 6.10 4.03 Intr + 99992 100286 295 1 1 83 116 274 0.964 26.28 4.04 Term + 100798 101375 578 2 2 79 36 369 0.974 25.43 4.05 PlyA + 101599 101604 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815579f:55661801_55771804|GENSCAN_predicted_peptide_1|1372_aa EAMMDFFNAQMRLGGLTQAPGNPVLAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQ SLKIRRPHDYQPLPGMSENPSVYVPGAVLSALHARQFYDTGIGAVLSALHGRQFYDTGIG AVLSALHARQFYDTGIGAVLSALHGRQFYDTGIGAVLSALHGRQFYDTGIGAVLSALHGR QFYDTGIGAVLSALHARQFYDTGIGSQPTVPSEIRAANAADAVLRLPGQPRLLIVSCHPW VVSTVVPDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYAFCEYV DINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNATLSTINQTPVTLQVPGLMSSQVQMGG HPTEVLCLMNMVLPEELLDDEEYEEIVEDVRDECSKYGLVKSIEIPRPVDGVEVPGCGKR RDRCNHLTSLPHEMVVVRACDLLFDPLAGCMYGIPIVPGSDLDAEGSVVNRGPCCPCTLL SRAPCCPVHPAVRAPCCPCTLLSRAPCCPCTLLSVQLLEDLHAVLSRGRGRIPGCTVRSG WSRSRGGSEQIFVEFTSVFDCQKAMQGLTGRKFANRVVVTKYCDPDSYHRRDFCFIFAKT NLLIKHFVVFYFMERLCVQYVRPSSSVLGSGGGSLGKLRAPGWDAPAAGAGSLEAFPACA VPFSGSAQAQPRGTGPLPLADAVTCQHLPQPSSGSRPISPRIGALCPLLLQPGTMSTSSL RRQMKNIVHNYSEAEIKVREATSNDPWGPSSSLMSEIADLTYNVVAFSEIMSMIWKRLND HGKNWRHVYKSPSGSTGDEDNPTGPEALSHCEAATPPLALSGALCADLFWHEGESSCFSP SLRHMVSSCHFSPRWWVVGLAPSRPPQSPRRRQSLARLVPGVLTGHPRARSQAMTLMEYL IKTGSERVSQQCKENMYAVQTLKDFQYVDRDGKDQGVNVREKAKQLVALLRDEDRLREER AHALKTKEKLAQTATASSAAVGSGPPPEAEQAWPQSSGEEELQLQLALAMSKEEADQPPS CGPEDDAQLQLALSLSREEHDKEERIRRGDDLRLQMAIEESKRETGGKEESSLMDLADVF TAPAPAPTTDPWGGPAPMAAAVPTAAPTSDPWGGPPVPPAADPWGGGVPVSGPSASDPWT PAPAFSDPWGGSPAKPSTNGTTAAGGFDTEPDEFSDFDRLRTALPTSGSSAGELELLAGE VPARSPGAFDMSGVRGSLAEAVGSPPPAATPTPTPPTRKTPESFLGPNAALVDLDSLVSR PGPTPPGAKASNPFLPGDQGVWRVVGGLMEELLTTPCRKSGMASLLNGREEDFIQDHRDR HLQSLSLAELRPPESSMLKPYPPPCTDPVPQNGAVFGDRAFTRTVRKSIDVL >gi568815579f:55661801_55771804|GENSCAN_predicted_CDS_1|4119_bp gaggccatgatggatttcttcaacgcccagatgcgcctgggggggctgacccaggcccct ggcaacccagtgttggctgtgcagattaaccaggacaagaattttgcctttttggagttc cgctcagtggacgagactacccaggctatggcctttgatggcatcatcttccagggccag tcactaaagatccgcaggcctcacgactaccagccgcttcctggcatgtcagagaacccc tccgtctatgtgcctggcgctgtcctaagtgccctccatgcacggcagttctacgacaca gggattggcgctgtcctaagtgccctccatggacggcagttctacgacacagggattggc gctgtcctaagtgccctccatgcacggcagttctacgacacagggattggcgctgtccta agtgccctccatggacggcagttctacgacacagggattggcgctgtcctaagtgccctc catggacggcagttctacgacacagggattggcgctgtcctaagtgccctccatggacgg cagttctacgacacagggattggcgctgtcctaagtgccctccatgcacggcagttctac gacacagggattggcagtcagcccacagttcccagtgagatcagagcagctaatgctgcg gatgctgttctgagacttcccgggcagccgcggctgctcatcgtgtcgtgccacccctgg gttgtgtccactgtggtccccgactctgcccacaagctgttcatcgggggcttacccaac tacctgaacgatgaccaggtcaaagagctgctgacatcctttgggcccctcaaggccttc aacctggtcaaggacagtgccacggggctctccaagggctacgccttctgtgagtacgtg gacatcaacgtcacggatcaggccattgcggggctgaacggcatgcagctgggggataag aagctgctggtccagagggcgagtgtgggagccaagaatgccacgctgagcaccatcaat cagacgcctgtgaccctgcaagtgccgggcttgatgagctcccaggtgcagatgggcggc cacccgactgaggtcctgtgcctcatgaacatggtgctgcctgaggagctgctggacgac gaggagtatgaggagatcgtggaggacgtgcgggacgagtgcagcaagtacgggcttgtc aagtccatcgagatcccccggcctgtggacggcgtcgaggtgcccggctgcggaaagcgc cgtgacaggtgtaatcacctcacctctctgcctcatgagatggtggtggtgagggcgtgt gacctgctctttgatccccttgctgggtgtatgtatggaattcccatcgtaccaggctct gatctagacgctgagggttcagttgtgaatagaggtccctgctgtccgtgcaccctgctg tcccgtgcaccctgctgtcccgtgcaccctgctgtccgtgcaccctgctgtccgtgcacc ctgctgtcccgtgcaccctgctgtccctgcaccctgctgtccgtgcagctgttagaggac ctgcatgccgtattgagtcggggcaggggccgcattcctgggtgtactgttcgttcaggg tggtcaaggtcaagaggtggcagtgagcagatctttgtggagttcacctctgtgtttgac tgccagaaagccatgcagggcctgacgggccgcaagttcgccaacagagtggttgtcaca aaatactgtgaccccgactcttatcaccgccgggacttctgtttcatatttgctaagacg aatttgctcattaaacattttgttgtattttactttatggagcggctgtgtgtccagtat gtccgaccctcttcctcggttctgggctcgggtgggggttcccttggcaaactgcgggcc cctggctgggacgcccctgctgccggcgccggcagcctcgaggccttccctgcttgcgcg gtgcccttctcggggtcggcacaggcacagccccgggggacaggtcctcttcccctcgca gatgcggtgacctgccagcacctgccgcagccttcgtccgggagtcgccccatctctcca cgcatcggggccctgtgccccttgctgctgcagccgggcaccatgtcgacctcgtccttg aggcgccagatgaagaacatcgtccacaactactcagaggcggagatcaaggttcgagag gccacgagcaatgacccctggggcccatccagctccctcatgtcagagattgccgacctc acctacaacgttgtcgccttctcggagatcatgagcatgatctggaagcggctcaatgac catggcaagaactggcgtcacgtttacaagtccccttctggaagtacgggtgatgaggac aaccccacaggcccagaggccctgagccactgtgaagctgctacaccccctttggccctc tccggagccctctgtgctgacctgttctggcatgagggcgagtcctcttgcttctcacca tcccttcgccacatggtctcatcttgtcacttcagccccaggtggtgggttgtgggcctt gcgcccagtcggccgccccagagcccgcgtcgcaggcagtctctggcccgccttgtgccc ggcgtgctgaccgggcatccccgtgcccgctcgcaggccatgacgctgatggagtacctc atcaagaccggctcggagcgcgtgtcgcagcagtgcaaggagaacatgtacgccgtgcag acgctgaaggacttccagtacgtggaccgcgacggcaaggaccagggcgtgaacgtgcgt gagaaagctaagcagctggtggccctgctgcgcgacgaggaccggctgcgggaagagcgg gcgcacgcgctcaagaccaaggaaaagctggcacagaccgccacggcctcatcagcagct gtgggctcaggcccccctcccgaggcggagcaggcgtggccgcagagcagcggggaggag gagctgcagctccagctggccctggccatgagcaaggaggaggccgaccagcccccgtcc tgcggccccgaggacgacgcccagctccagctggcccttagtttgagccgagaagagcat gataaggaggagcggatccgtcgcggggatgacctgcggctgcagatggcaatcgaggag agcaagagggagactgggggcaaggaggagtcgtccctcatggaccttgctgacgtcttc acggccccagctcctgccccgaccacagacccctgggggggcccagcacccatggctgct gccgtccccacggctgcccccacctcggacccctggggcggcccccctgtccctccagct gctgatccctggggaggtggggtcccggtcagtgggccctcagcctccgatccctggaca ccggccccggccttctcagatccctggggagggtcacctgccaagcccagcaccaatggc acaacagcagccgggggattcgacacggagcccgacgagttctctgactttgaccgactc cgcacggcactgccgacctccgggagcagcgcaggagagctggagctgctggcaggagag gtgccggcccgaagccctggggcgtttgacatgagtggggtcaggggatctctggctgag gctgtgggcagccccccacctgcagccacaccaactcccacgccccccacccggaagacg ccggagtcattcctggggcccaatgcagccctcgtcgacctggactcgctggtgagccgg ccgggccccacgccgcctggagccaaggcctccaaccccttcctgccaggcgaccagggt gtctggagggttgtcgggggcttgatggaggaattgctgaccacaccatgcaggaagagt gggatggcgagcttgttaaatggccgtgaggaagactttattcaggaccatcgagacaga cacctgcagtcactgtcactagctgaactgcgccctcctgaaagctctatgttgaaaccc taccccccaccctgcaccgaccccgtaccccagaatggggctgtatttggagacagggcc tttacaagaactgtgagaaaatccattgacgttctatag >gi568815579f:55661801_55771804|GENSCAN_predicted_peptide_2|767_aa MAESFFSDFGLLWYLKELRKEEFWKFKELLKQPLEKFELKPIPWAELKKASKEDVAKLLD KHYPGKQAWEVTLNLFLQINRKDLWTKAQEEMRNKLNPYRKHMKETFQLIWEKETCLHVP EHFYKETMKNEYKELNDAYTAAARRHTVVLEGPDGIGKTTLLRKVMLDWAEGNLWKDRFT FVFFLNVCEMNGIAETSLLELLSRDWPESSEKIEDIFSQPERILFIMDGFEQLKFNLQLK ADLSDDWRQRQPMPIILSSLLQKKMLPESSLLIALGKLAMQKHYFMLRHPKLIKLLGFSE SEKKSYFSYFFGEKSKALKVFNFVGIFMFGISTEEIVSMLETSFGFPLSKDLKQEITQCL ESLSQCEADREAIAFQELFIGLFETQEKEFVTKVMNFFEEVFIYIGNIEHLVIASFCLKH CQHLTTLRMCVENIFPDDSGCISDYNEKLVYWRELCSMFITNKNFQILDMENTSLDDPSL AILCKALAQPVCKLRKLIFTSVYFGHDSELFKAVLHNPHLKLLSLYGTSLSQSDIRHLCE TLKHPMCKIEELILGKCDISSEVCEDIASVLACNSKLKHLSLVENPLRDEGMTLLCEALK HSHCALERLMLMGCFLTSDSCKDIAAVLICNGKLKTLKLGHNEIGDTGVRQLCAALQHPH CKLECLGLQTCPITRACCDDIAAALIACKTLRSLNLDWIALDADAVVVLCEALSHPDCAL QMLGLHKSGFDEETQKILMSVEEKIPHLTISHGPWIDEEYKIRGVLL >gi568815579f:55661801_55771804|GENSCAN_predicted_CDS_2|2304_bp atggcagaatcttttttttcggattttggcttgttgtggtatctgaaggagctcagaaag gaagagttttggaaatttaaggagctcctcaaacaacctttggagaaatttgaactcaag ccaatcccctgggctgagctgaagaaggcctccaaagaagatgtagcaaagctgctggac aaacattacccaggaaagcaggcatgggaggtaacactgaacctgtttctacagatcaat aggaaagatctctggacaaaggctcaggaagagatgagaaataagctaaacccatacaga aagcatatgaaggaaacatttcaactcatatgggagaaggaaacctgtcttcacgtccct gagcatttctacaaagaaaccatgaaaaatgagtataaagaattgaatgacgcatatact gctgcggctagacgacacactgtggtcctggaaggtcctgatggaattggaaaaacaacc cttttaagaaaagtgatgttggactgggcagagggaaacttatggaaggacaggttcaca tttgtgtttttcctcaatgtctgtgaaatgaacggtatcgcagagaccagcttactggag ctcctctctagggactggccggagtcttcagagaagatcgaagacattttttcccagcca gagagaattctgttcatcatggatggctttgagcaactgaagtttaacttacaacttaag gctgacttgagcgatgattggaggcagcggcagccaatgccaattatcctgagcagtttg ttgcaaaaaaagatgcttccagaatcctctctccttattgcattaggaaaactggctatg caaaaacactattttatgttgcggcatccaaaactcataaagctcttaggattcagtgaa tctgaaaagaagtcgtatttctcctacttctttggtgagaagagcaaagccctgaaagtc ttcaattttgtggggatattcatgtttggaatttcaacagaagaaatcgtcagcatgctg gagacctcctttggttttccactgtcaaaagacctaaagcaggaaataacccaatgcctt gaaagtttaagtcaatgtgaagctgatagggaagccatagctttccaggaactattcatt ggtttgtttgaaactcaggaaaaagaatttgtaaccaaagtgatgaatttctttgaagaa gttttcatttatattggtaacatagaacatttggtaatagcttcattctgcctgaagcat tgtcaacatttaacgacacttcgcatgtgtgtggagaatatctttccagatgactcagga tgcatctcagattacaatgagaagctcgtctactggcgggagctttgctcaatgttcatt accaacaagaacttccagattttagacatggaaaataccagcctcgatgatccctccctg gcgattctttgcaaagcgctggctcagcctgtttgtaaactccgaaaactcatatttact tctgtgtactttggacatgattcagaattatttaaggcagttcttcacaaccctcatctg aaacttctgagcctgtacggcactagcctctcccagtctgacatcagacacctgtgtgag acgctgaaacatccaatgtgcaagatagaagagctgatactgggaaagtgtgacatctcc agtgaagtttgtgaagacatcgcctccgtcctggcctgcaacagcaagctgaaacacctc tccttggtagaaaatcccttgagggacgaaggaatgacgttgctgtgtgaagccctgaag cactcacactgtgccctggagaggctgatgttgatgggctgtttccttacttccgattcc tgtaaggacattgctgctgttcttatttgcaatgggaaactgaagaccctgaaacttggg cataatgaaataggagacactggtgtcagacagttatgtgcagctttgcagcatcctcac tgtaaattagagtgtctcgggctgcaaacgtgtccgatcacccgtgcctgctgcgacgac atcgccgcagcactcatcgcctgcaaaacactgaggagcctgaacctcgactggattgcc ttggatgctgatgcagtggtggtgctgtgtgaggcattgagccacccggactgtgccctg cagatgctggggctgcacaaatctggctttgatgaagaaactcagaagatcctgatgtct gtggaagaaaaaattccccatctgaccatttcacatggaccttggattgacgaggaatac aagatcaggggtgtgctcctctga >gi568815579f:55661801_55771804|GENSCAN_predicted_peptide_3|169_aa MKGELSTKLKKPPGRVNESEMEFRVFPLKSMLLRALGMVGGKRTGNSQKRNQVTAQPPFP PPIITIITIITIITIIIIFIFTITIIIIFTITIIIITITIIITTIITIFIFTITVIIITI IFIFTITIITTIIIFIFTITVIIIITIITIIINIITIIILFISQKRKNK >gi568815579f:55661801_55771804|GENSCAN_predicted_CDS_3|510_bp atgaaaggtgaactttcaacaaaactaaagaaacctccaggacgtgtaaacgaaagtgaa atggaattccgagtgttccctctcaagtccatgctactacgagctttgggcatggttgga gggaaaagaacaggcaattcacaaaagagaaaccaagtgacagcccagccaccctttccg ccccccatcatcaccatcatcaccatcatcaccatcatcaccatcatcattatcttcatc ttcaccatcaccatcatcatcatcttcaccatcaccatcatcatcatcaccatcactatc atcatcaccaccatcatcacaatcttcatcttcaccatcaccgtcatcatcatcaccatc atcttcatcttcaccatcaccatcatcaccaccataataatcttcatcttcaccatcacc gtcatcatcatcatcaccatcatcaccatcatcatcaatatcatcacgatcatcatcctc ttcatttcacagaagaggaaaaacaagtga >gi568815579f:55661801_55771804|GENSCAN_predicted_peptide_4|418_aa MLLVATILDSPGLDPFYHFNSMSNNERYSSFQLACEIQKSAAMTLHVCTRIAWYKGYHIV GKNLSNSNNLNDGRMKSESDWIKKEGKGVAKVGGDTLWYKSPWQAALTPDLSCPQKQLEA RGETPEGETFAMAEHFKQIIRCPVCLKDLEEAVQLKCGYACCLQCLNSLQKEPDGEGLLC RFCSVVSQKDDIKPKYKLRALVSIIKELEPKLKSVLTMNPRMRKFQVDMTFDVDTANNYL IISEDLRSFRSGDLSQNRKEQAERFDTALCVLGTPRFTSGRHYWEVDVGTSQVWDVGVCK ESVNRQGKIVLSSEHGFLTVGCREGKVFAASTVPMTPLWVSPQLHRVGIFLDVGMRSIAF YNVSDGCHIYTFIEIPVCEPWRPFFAHKRGSQDDQSILSICSVINPSAASAPVSSEGK >gi568815579f:55661801_55771804|GENSCAN_predicted_CDS_4|1257_bp atgctgttggtggccaccatattggacagcccgggtctagaccctttttaccacttcaac tctatgtcgaataatgaacgatattcttcattccagctagcatgtgaaattcagaagtca gcagccatgactcttcacgtctgtacccgtattgcctggtacaagggatatcacatagta ggtaaaaacctgtccaatagtaacaatctcaatgatggcagaatgaaatcagaatctgat tggattaaaaaggagggaaagggtgtggctaaggtgggtggagacaccctctggtataaa agcccctggcaggcagctctcactccagatctgagctgtccgcagaagcagctggaggcc aggggagaaactccagaaggagagacatttgccatggctgagcacttcaaacagatcatt agatgtcctgtctgtctaaaagatcttgaagaagccgtgcaactgaaatgtggatatgcc tgctgcctccagtgcctcaattcactccagaaggagcccgatggggaaggtttactgtgc cgtttctgctctgtggtctctcagaaggatgacatcaagcccaagtacaagctgagggcg ctggtttccatcatcaaggaactagagcccaagctgaaatctgttctaacaatgaaccca aggatgaggaagtttcaagtggatatgacgttcgatgtggacacagccaacaactatctc atcatttctgaagacctgaggagtttccgaagtggggatttgagccagaataggaaggag caagctgagaggttcgacactgccctgtgcgtcctgggcacccctcgcttcacttccggc cgccattactgggaggtggacgtgggcaccagccaagtgtgggatgtgggcgtgtgcaag gaatctgtgaaccgacaggggaagattgtgctttcttcagaacacggcttcttgactgtg ggttgcagagaaggaaaggtctttgctgccagcactgtgcctatgactcctctctgggtg agtccccagttgcacagagtggggattttcctggatgtaggtatgaggtccattgccttt tacaatgttagtgatgggtgccatatctacacattcatcgagattcctgtttgcgagccc tggcgtccattttttgctcataaacgtggaagtcaagatgatcagagcatcctgagtatc tgttctgtgatcaatccatccgctgccagtgccccagtttcttctgagggaaagtaa