GENSCAN 1.0 Date run: 6-Nov-116 Time: 07:09:04 Sequence gi568815597f:56545388_56807710 : 262323 bp : 38.55% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 405 551 147 2 0 51 93 60 0.124 2.31 1.02 Term + 5368 5592 225 0 0 53 55 170 0.510 6.00 1.03 PlyA + 7025 7030 6 1.05 2.08 PlyA - 7685 7680 6 1.05 2.07 Term - 27971 27850 122 0 2 89 38 141 0.719 6.86 2.06 Intr - 32953 32885 69 2 0 115 54 26 0.357 0.14 2.05 Intr - 33724 33491 234 2 0 54 80 239 0.668 16.44 2.04 Intr - 34179 33996 184 0 1 -13 32 259 0.606 8.64 2.03 Intr - 34333 34247 87 1 0 52 80 79 0.644 2.75 2.02 Intr - 45716 45629 88 1 1 30 21 128 0.016 -0.65 2.01 Init - 51424 51261 164 1 2 71 76 79 0.738 4.25 2.00 Prom - 51478 51439 40 -7.75 3.00 Prom + 58118 58157 40 -4.35 3.01 Sngl + 65205 65582 378 2 0 49 47 204 0.776 8.31 3.02 PlyA + 65898 65903 6 1.05 4.00 Prom + 67654 67693 40 -8.05 4.01 Init + 67707 68071 365 2 2 91 66 175 0.921 12.17 4.02 Intr + 74039 74177 139 2 1 22 67 104 0.124 1.25 4.03 Intr + 77880 78050 171 2 0 6 99 124 0.085 4.52 4.04 Intr + 82399 82546 148 0 1 77 44 60 0.552 -0.61 4.05 Intr + 87167 87275 109 0 1 89 100 46 0.181 4.32 4.06 Intr + 89764 89822 59 1 2 75 76 42 0.242 -0.69 4.07 Intr + 99919 100030 112 2 1 105 39 145 0.318 9.82 4.08 Intr + 100736 100806 71 2 2 48 62 55 0.010 -3.19 4.09 Intr + 128994 129135 142 1 1 87 93 135 0.792 12.39 4.10 Intr + 133940 134076 137 2 2 104 84 22 0.529 2.69 4.11 Term + 134468 134481 14 0 2 116 49 -6 0.195 -4.31 4.12 PlyA + 135423 135428 6 1.05 5.00 Prom + 140857 140896 40 -5.45 5.01 Init + 141134 141217 84 1 0 100 77 58 0.690 6.77 5.02 Intr + 146971 147115 145 0 1 77 88 132 0.966 11.03 5.03 Intr + 148378 148465 88 2 1 54 90 33 0.704 -1.79 5.04 Intr + 150548 150772 225 2 0 74 55 148 0.964 6.28 5.05 Intr + 158584 159088 505 1 1 66 97 474 0.951 38.15 5.06 Intr + 160705 160831 127 0 1 71 91 68 0.998 4.73 5.07 Term + 162088 162326 239 2 2 97 39 171 0.992 8.45 5.08 PlyA + 163496 163501 6 1.05 6.12 PlyA - 163631 163626 6 1.05 6.11 Term - 165015 164957 59 1 2 84 39 79 0.385 -0.53 6.10 Intr - 174942 174804 139 0 1 37 -12 124 0.000 -3.58 6.09 Intr - 178282 178183 100 0 1 75 93 40 0.770 2.39 6.08 Intr - 181196 181110 87 1 0 67 81 118 0.765 7.17 6.07 Intr - 191760 191700 61 1 1 87 106 74 0.705 5.97 6.06 Intr - 193266 193238 29 2 2 81 97 8 0.445 -2.16 6.05 Intr - 195408 195310 99 2 0 72 93 40 0.331 1.21 6.04 Intr - 198879 198765 115 1 1 115 78 75 0.800 7.79 6.03 Intr - 205816 205657 160 1 1 74 93 89 0.885 6.64 6.02 Intr - 208548 208452 97 2 1 73 77 45 0.804 0.99 6.01 Init - 213938 213832 107 2 2 52 98 77 0.776 4.84 6.00 Prom - 217706 217667 40 -3.45 7.04 PlyA - 218074 218069 6 1.05 7.03 Term - 220316 220210 107 0 2 13 43 114 0.226 -3.01 7.02 Intr - 222551 222442 110 1 2 91 97 107 0.969 10.91 7.01 Init - 235198 235134 65 1 2 94 53 56 0.385 3.37 7.00 Prom - 237515 237476 40 -4.55 8.06 PlyA - 238343 238338 6 1.05 8.05 Term - 240665 240410 256 1 1 86 48 185 0.993 8.37 8.04 Intr - 241821 241788 34 1 1 131 107 58 0.999 8.36 8.03 Intr - 243747 243586 162 1 0 91 116 115 0.995 13.63 8.02 Intr - 247349 246669 681 0 0 91 92 347 0.476 26.03 8.01 Init - 249008 248888 121 2 1 55 99 88 0.575 7.00 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 118095 118005 91 0 1 57 86 125 0.839 9.90 S.002 Intr - 174942 174786 157 0 1 37 101 146 0.985 9.46 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:56545388_56807710|GENSCAN_predicted_peptide_1|123_aa NKQHVDWQDTRHQGQPKIEIVKRQDWSQMQCQKRRGEHYGRPCKNKITLGEMPNNIIPLH HGIAMTAAWYVCLRNGRFSWKYQLPRQLPLCGSGLEKKWKNPQLESNWKDKRGEQGPRNC SKA >gi568815597f:56545388_56807710|GENSCAN_predicted_CDS_1|372_bp aataaacaacacgtggactggcaggacaccagacaccagggacagccaaagatcgagata gtcaaaaggcaagactggagccagatgcagtgccaaaagcgaagaggtgagcattatggg aggccctgcaagaacaaaatcaccctgggcgaaatgccaaataacattattccccttcac catggtattgccatgacagctgcctggtatgtttgcctgagaaatggaaggttttcatgg aaatatcagcttccacgacagctccccctttgtggaagtggccttgaaaagaagtggaag aatccccaactggaatctaattggaaggacaagaggggagagcaaggcccccggaattgt tcaaaggcctga >gi568815597f:56545388_56807710|GENSCAN_predicted_peptide_2|315_aa MGKDFMTKTPKAMATKAKIDKWDLIKLKSFCTAKETMIRVYRKKKRKKEKKKEKRKKKKK KEYTVSDKFYGEKESKGIRGDADTGVGENVALIILRLRQLKEGAVLSESHPSQCRKQTQA ITAAAAAAAAPTAGGAGGAGGGGGGGGKVRVGAGAPELLGSAQLPFIKEPAAEEAASARR RPGRLGVWLLLRDVFAGREARAAASAMQNYKYDKAIVPESKNGGSPALNNNPRRSGSKRV LLICLDLFCLFMGAFPGLGAGHGFSNHLSAYCVPGVKIIKFARILPNGKLPLDADLKVSC VFQCKDLVLFLMPRG >gi568815597f:56545388_56807710|GENSCAN_predicted_CDS_2|948_bp atgggcaaagacttcatgactaaaacaccaaaagcaatggcaacaaaagccaaaattgac aaatgggatctaattaaactaaagagcttctgcacagcaaaagaaactatgatcagagtg tacaggaaaaaaaaaagaaagaaagaaaagaaaaaagaaaaaagaaaaaagaagaagaag aaggaatatacagtgtcagataaattctatggagaaaaagaaagcaagggcatcagggga gatgcagacacgggggtgggggaaaatgtggccttaattatcctacgtcttaggcagctt aaggaaggggctgtgctttcggaatcccatccgagccaatgcagaaaacaaacccaggcg atcacagcagcagccgccgcggcagcagcaccaacagcaggaggagcaggaggagccgga ggaggaggaggaggaggaggcaaagttagagttggggctggcgctccggagttgctgggc tcagcgcagctcccattcattaaggaaccagctgcggaggaagcagcctcggccaggagg cgacccgggcgcctgggtgtgtggctgctgttgcgggacgtcttcgcggggcgggaggct cgcgccgcagccagcgccatgcaaaactacaagtacgacaaagcgatcgtcccggagagc aagaacggcggcagcccggcgctcaacaacaacccgaggaggagcggcagcaagcgggtg ctgctcatctgcctcgacctcttctgcctcttcatgggtgctttccccgggcttggtgct gggcacggcttcagcaatcacttgagtgcctactgtgtgccaggtgtaaagataatcaag tttgcaagaattcttccgaatggaaaattacccctggatgcagacctgaaggtgtcctgt gtctttcagtgcaaagacttggtgttgtttctaatgccaagaggctaa >gi568815597f:56545388_56807710|GENSCAN_predicted_peptide_3|125_aa MGDFNTLLSILDKSMRQKVNKDIQDLNSALHQADLIDIYRTLHLKSTEYTFFSAPHCTYS KIDHIVGSKALLSKCKTAEIKTNCLSDHSANKLELRIKKLTQHHTTTRKLNNVLVNDYWQ NKDVL >gi568815597f:56545388_56807710|GENSCAN_predicted_CDS_3|378_bp atgggagactttaacaccctactgtcaatattagacaaatcaatgagacagaaggttaac aaggatatccaggacttgaactcagctctgcaccaagcagacctaatagacatctacaga actctccacctcaaatcaacagaatatacattcttctcagcaccacattgcacttattcc aaaattgaccacatagttggaagtaaagcactcctcagcaaatgtaaaacagcagaaatc aaaacaaactgtctgtcagaccacagtgcaaacaaattagaactcaggattaagaaactc actcaacaccacacaactacacggaaactgaacaacgtgctcgtgaatgactactggcag aacaaagatgttctttga >gi568815597f:56545388_56807710|GENSCAN_predicted_peptide_4|488_aa MGKDFMTKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQLTEWEKIFAICPSDKGL ISRIYKEVKQIYKKKSNNPIKKWAKDMNRHFSQEDIYAANRQMKKCSSSLVIREMQMKTT MSWSLSPLHGDSGVGLDEVVSEFLIGRLGKYSLLPEVGGQVAVEMASKPHGKNGFLTGTA LGLGALHLMLQLQLWLKGAKVHLRPLLQRVQALSLDGFHVVLGLQAFALATALLGCMCFC SPGLPEESASFGRDQKLKRPSQFMQAHLKLLQGCGEAFWGTRMMIQRGKPNWGEKKAAIN LGESGRCSLRRRAQIVAAIIPMCHGEDLVGGPAALWVGGGGGGYAERQAVERGRARRRWL RSRSTTGGFLHVPGLSVVGGDCLGDAPGARGFGEHQLTGHKVAVKILNRQKIRSLDVVGK IKREIQNLKLFRHPHIIKLFLSTFIKLFIGCSHSKVTESPEAKLFSLFQIYFSVFGGCGG RVEGVYSI >gi568815597f:56545388_56807710|GENSCAN_predicted_CDS_4|1467_bp atgggcaaggacttcatgactaaaacaccaaaagcaatggcaacaaaagccaaaattgac aaatgggatctaattaaactaaagagcttctgcacagcaaaagaaactaccataagagtg aacaggcaacttacagaatgggagaaaatttttgcaatctgcccatctgacaaagggcta atatccagaatctacaaagaagttaaacaaatttacaagaaaaagtcaaacaaccccatc aaaaagtgggcaaaggatatgaacagacacttctcacaagaagacatttatgcagccaac agacagatgaaaaaatgctcatcatcactggtcatcagagaaatgcaaatgaaaaccaca atgagctggagcctgagtccgctgcatggagactctggtgtgggtcttgacgaggtggtc agtgaattcctgatcgggagacttggtaaatacagtctccttccagaggtcgggggtcag gtagctgtagaaatggcatcaaagccccatgggaaaaatggttttctcaccggaacagct ttaggacttggtgccctgcatctcatgctccagctccagctctggctaaaaggggccaag gtacatctcaggccattgcttcagagggtgcaagccctaagtcttgatggcttccatgtg gtgttgggcctgcaggcctttgcgttggctacagctttgctgggctgtatgtgtttttgc agtcctggactgcctgaggaaagcgcttcttttggtagagatcagaagctcaagagacca agccaattcatgcaggcacatctaaaacttctgcaaggatgtggagaagctttctggggc actaggatgatgatacaaagaggaaaaccaaactggggggaaaaaaaagctgccatcaat ttgggggaaagtggaagatgttccctgagaaggagagctcaaattgtagctgccataatt cctatgtgtcatggggaggacctggtgggagggcccgctgcactgtgggtaggcggcggc ggcggcggctacgcggagcggcaggcggtggagcgaggccgcgcgcgccgaagatggctg agaagcagaagcacgacgggcggattccttcacgttccaggactctcggttgttgggggt gactgtttgggggatgctccaggggccagaggttttggagaacatcaattaacaggccat aaagtggcagttaaaatcttaaatagacagaagattcgcagtttagatgttgttggaaaa ataaaacgagaaattcaaaatctaaaactctttcgtcatcctcatattatcaaactattt ctgagcacatttatcaaattgtttatcggatgttcacattctaaagtcacagagtcacct gaagctaaacttttctctctcttccaaatttacttctctgtttttggagggtgtggtggg agagtggagggagtctactcaatatga >gi568815597f:56545388_56807710|GENSCAN_predicted_peptide_5|470_aa MVLAFTSSKGLRKLTIMTEGNWQPGCHMVEEMEARRLFQQILSAVDYCHRHMVVHRDLKP ENVLLDAHMNAKIADFGLSNMMSDGEFLRTSCGSPNYAAPEVISGRLYAGPEVDIWSCGV ILYALLCGTLPFDDEHVPTLFKKIRGGVFYIPEYLNRSVATLLMHMLQVDPLKRATIKDI REHEWFKQDLPSYLFPEDPSYDANVIDDEAVKEVCEKFECTESEVMNSLYSGDPQDQLAV AYHLIIDNRRIMNQASEFYLASSPPSGSFMDDSAMHIPPGLKPHPERMPPLIADSPKARC PLDALNTTKPKSLAVKKAKWHLGIRSQSKPYDIMAEVYRAMKQLDFEWKVVNAYHLRVRR KNPVTGNYVKMSLQLYLVDNRSYLLDFKSIDDEVVEQRSGSSTPQRSCSAAGLHRPRSSF DSTTAESHSLSGSLTGSLTGSTLSSVSPRLGSHTMDFFEMCASLITTLAR >gi568815597f:56545388_56807710|GENSCAN_predicted_CDS_5|1413_bp atggtgctggcttttacttctagtaagggtctcaggaagcttacaatcatgacagaaggc aactggcagccagggtgtcacatggttgaagagatggaagccaggcggctctttcagcag attctgtctgctgtggattactgtcataggcatatggttgttcatcgagacctgaaacca gagaatgtcctgttggatgcacacatgaatgccaagatagccgatttcggattatctaat atgatgtcagatggtgaatttctgagaactagttgcggatctccaaattatgcagcacct gaagtcatctcaggcagattgtatgcaggtcctgaagttgatatctggagctgtggtgtt atcttgtatgctcttctttgtggcaccctcccatttgatgatgagcatgtacctacgtta tttaagaagatccgagggggtgtcttttatatcccagaatatctcaatcgttctgtcgcc actctcctgatgcatatgctgcaggttgacccactgaaacgagcaactatcaaagacata agagagcatgaatggtttaaacaagatttgcccagttacttatttcctgaagacccttcc tatgatgctaacgtcattgatgatgaggctgtgaaagaagtgtgtgaaaaatttgaatgt acagaatcagaagtaatgaacagtttatatagtggtgaccctcaagaccagcttgcagtg gcttatcatcttatcattgacaatcggagaataatgaaccaagccagtgagttctacctc gcctctagtcctccatctggttcttttatggatgatagtgccatgcatattcccccaggc ctgaaacctcatccagaaaggatgccacctcttatagcagacagccccaaagcaagatgt ccattggatgcactgaatacgactaagcccaaatctttagctgtgaaaaaagccaagtgg catcttggaatccgaagtcagagcaaaccgtatgacattatggctgaagtttaccgagct atgaagcagctggattttgaatggaaggtagtgaatgcataccatcttcgtgtaagaaga aaaaatccagtgactggcaattacgtgaaaatgagcttacaactttacctggttgataac aggagctatcttttggactttaaaagcattgatgatgaagtagtggagcagagatctggt tcctcaacacctcagcgttcctgttctgctgctggcttacacagaccaagatcaagtttt gattccacaactgcagagagccattcactttctggctctctcactggctctttgaccgga agcacattgtcttcagtttcacctcgcctgggcagtcacaccatggatttttttgaaatg tgtgccagtctgattactactttagcccgttga >gi568815597f:56545388_56807710|GENSCAN_predicted_peptide_6|350_aa MHFSEQIPIVKRCMTVIKIGIMSLNKIHRTIGKVKSAKHEDKKMKEKQPCELKPKNTEKE PYSNHVFKVDACEGTPEKIQMTNVHTGRRNMLAGKQEAMIDIIQTNPCPEGPKLARHSQG HCGHLEVLESTKETPDLGVSKTSSISEEIYDDVEYSRKEVLDGKEALKRLQQFFKKEKDR FKIKKTKSKENLSAFSILLPDLELKSQEVIIYDDVDLSEKESKDEDKLKMWKPKFLTPKE KKEKNGAEESERNFFKTKKQNLEKNRMKREEKLFRERFKVRSLLKYDKEIIVINTAVACS NNSRNGIFDLPISPGEELEVIDTTEQNLVICQRPVKVFNADIHCHLRGEE >gi568815597f:56545388_56807710|GENSCAN_predicted_CDS_6|1053_bp atgcatttctcagaacagatccccattgttaagcgatgcatgactgtaattaagattggc atcatgtcattaaataaaatccacagaactattggaaaagtcaaaagtgctaaacatgaa gataaaaaaatgaaggaaaaacaaccatgtgaattgaaacctaaaaacacagaaaaggaa ccatattcaaaccatgttttcaaggtagatgcctgtgaagggacacctgaaaaaattcag atgaccaacgtccacacaggtagaaggaacatgttggctggaaagcaagaggccatgatt gacatcatccagacaaatccctgccctgagggcccaaagctggccaggcactcccaaggc cactgtgggcatctggaggttttggagtcaactaaagaaactccagacctaggggtctct aagacaagttccatctcggaggagatatatgatgatgtcgagtactccaggaaagaggtt ttagatggaaaagaagcactcaaaagactgcagcaattcttcaagaaagaaaaggataga tttaaaataaagaaaaccaagtcgaaagaaaacttaagtgcattttccattttgctgcct gatttagaacttaagtctcaggaagttattatttatgatgatgtagacctgagtgaaaaa gagtcaaaagatgaagataaactgaaaatgtggaagcccaagtttctgacaccaaaggaa aaaaaagagaaaaacggtgctgaagaatcagaaagaaatttcttcaaaaccaagaagcaa aacttagaaaagaacagaatgaaaagagaagaaaaactatttagagaaaggtttaaggta cgttctcttctgaagtacgacaaagagattattgtcatcaatacagcagtggcctgttcc aataattcaagaaatggaatatttgatttgccaataagtcctggagaagaattggaagtc attgataccaccgaacaaaatctagtgatatgtcaaagacctgtcaaagtctttaatgct gacatccactgtcatcttcgaggggaagagtag >gi568815597f:56545388_56807710|GENSCAN_predicted_peptide_7|93_aa MESQVKSSRVMQLDQQHQGLTWLFNAEFEEPHNYEATISYLRHSGNSINLCTAKEIADLL EVKKLATQKHQWAQQQLQLKPALSSQTTRKRAT >gi568815597f:56545388_56807710|GENSCAN_predicted_CDS_7|282_bp atggagagtcaggtgaagtcatccagggtgatgcaactggatcagcagcaccagggcttg acctggcttttcaatgcagaatttgaagaaccacataattacgaggcaacaatttcatat ctgagacactctggcaactccattaacctgtgcactgcaaaagaaattgctgatcttttg gaggtaaaaaagttagcaactcagaaacaccagtgggcacaacaacagcttcagctgaag cctgctctttctagccaaacgaccaggaaaagggcaacctag >gi568815597f:56545388_56807710|GENSCAN_predicted_peptide_8|417_aa MAPLDIVLLPAKAKDDRTHLQNLMQFSGRKKDQQEEHGAGGPIKFPAGVSPKGDIGGTQS TQILANGKPLSSNHKQRTPYCSSSESQPLQPQKIKLAQKSEIPKCSNSPGPLGKSTVCSA TSSQKASLLLEVTQSNVEIITKEKVMVANSFRNKLWNWEKVSSQKSEMSSALLLANYGSK AIHLEGQKGMGLTPEEPRKKLETKGAQTLPSQKHVVAPKILHNVSEDPSFVISQHIRKSW ENPPPERSPASSPCQPIYECELASQAPEKQPDVRHHHLPKTKPLPSIDSLGPPPPKPSRP PIVNLQAFQRQPAAVPKTQGEVTVEEGSLSPESKLAQGHSPTVAANKQDKPNHTSTFQAF TVLQLLTSYWSKQVIWPSPEAEWEGAESFMAKSRDAGRSEEWGIDPVNLSPKRKHLK >gi568815597f:56545388_56807710|GENSCAN_predicted_CDS_8|1254_bp atggctccattagacattgtcctacttcctgcgaaagcaaaagatgacagaacacattta caaaatctaatgcagtttagtgggaggaagaaggaccagcaggaagagcatggagcagga ggacctattaaattcccagcaggtgtttctccaaagggtgacattggaggcacacagtca actcaaattttggccaatgggaaacccctctcatccaaccacaagcagcgcacaccatac tgttccagtagtgagtcccagcctcttcaacctcagaaaataaagttggctcagaagagt gaaattccaaaatgttctaactccccagggcctctgggaaagtctactgtatgttctgca acaagttcacagaaggcttctctgctgttagaggtgactcaatcaaatgttgagataatc actaaggaaaaagtaatggtggccaatagcttcagaaacaaactctggaactgggagaag gtttcatctcagaaaagtgaaatgtcttcagcccttctccttgccaactatggaagtaag gccatccatctggaagggcaaaaaggcatggggcttactccagaggaacccaggaaaaag ctggaaacaaaaggagcccagactcttccttcccagaagcacgtggtggcccccaaaata ttacataacgtctctgaagatccctcttttgtaatttctcaacatatcagaaaaagctgg gaaaacccacctcctgagaggagcccggcaagcagcccctgccagcccatctatgagtgt gagcttgccagtcaggccccagaaaaacagccagatgtcaggcatcaccaccttcccaaa acaaagccattgccctccatcgactccctgggtcctcctcccccaaagccttcaagacct cccatcgtgaacctccaggcctttcagaggcagccagctgctgttcccaagactcagggg gaagtgactgtggaagagggctccctgtctccagagagcaagctagcccaggggcactct cctacagtcgctgctaacaagcaagacaagcccaatcacaccagtacttttcaagccttt actgtgttgcagttgctgacatcttattggtccaagcaagtaatatggccaagcccagaa gcagagtgggagggtgctgaaagtttcatggcaaaaagcagggatgcaggaaggagtgaa gaatggggtattgatccagtcaacctatccccgaagaggaaacatttgaaatga