GENSCAN 1.0 Date run: 6-Nov-116 Time: 07:51:50 Sequence gi568815597r:205516549_205723882 : 207334 bp : 47.65% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 4357 4501 145 0 1 71 56 106 0.791 5.98 1.02 Intr + 5913 6097 185 1 2 104 57 -4 0.763 -2.39 1.03 Intr + 6599 6749 151 1 1 90 42 232 0.878 18.54 1.04 Intr + 6935 7077 143 0 2 40 94 142 0.755 10.07 1.05 Intr + 7684 7809 126 0 0 81 105 136 0.900 15.48 1.06 Intr + 8591 8647 57 1 0 78 94 58 0.956 4.48 1.07 Intr + 9517 9631 115 0 1 70 54 224 0.999 17.22 1.08 Intr + 9819 9913 95 1 2 73 85 180 0.586 15.88 1.09 Intr + 10227 10289 63 2 0 70 100 125 0.532 10.81 1.10 Intr + 11246 11369 124 1 1 117 73 299 0.986 31.46 1.11 Intr + 11500 11620 121 2 1 59 89 111 0.885 7.85 1.12 Intr + 12451 12548 98 1 2 141 69 133 0.995 16.35 1.13 Intr + 12776 12881 106 0 1 118 68 156 0.996 15.87 1.14 Intr + 12973 13015 43 1 1 119 105 40 0.988 7.04 1.15 Intr + 13711 13801 91 0 1 83 79 90 0.907 7.17 1.16 Intr + 14080 14157 78 2 0 93 91 127 0.914 13.02 1.17 Intr + 16845 16998 154 1 1 40 72 70 0.232 -0.27 1.18 Intr + 20187 20283 97 0 1 90 61 30 0.124 0.51 1.19 Term + 30131 30307 177 1 0 98 35 145 0.977 7.89 1.20 PlyA + 32838 32843 6 1.05 2.00 Prom + 45591 45630 40 -2.96 2.01 Init + 52522 52751 230 0 2 93 86 478 0.853 45.94 2.02 Intr + 63203 63401 199 2 1 122 115 394 0.983 44.85 2.03 Intr + 64113 64397 285 2 0 138 91 269 0.648 29.74 2.04 Intr + 67407 67575 169 2 1 95 63 172 0.989 14.92 2.05 Intr + 68328 68443 116 1 2 124 101 128 0.997 17.87 2.06 Intr + 69486 69659 174 2 0 101 91 197 0.994 21.44 2.07 Intr + 75592 75739 148 0 1 78 100 285 0.999 28.51 2.08 Intr + 81232 81302 71 2 2 69 88 54 0.433 2.40 2.09 Intr + 82583 82703 121 1 1 109 110 0 0.845 4.37 2.10 Intr + 83746 83778 33 2 0 101 94 14 0.617 1.49 2.11 Intr + 83858 83909 52 0 1 96 83 12 0.261 -0.53 2.12 Term + 86707 86809 103 1 1 96 42 27 0.160 -3.35 2.13 PlyA + 89436 89441 6 1.05 3.07 PlyA - 91086 91081 6 1.05 3.06 Term - 96794 96789 6 2 0 123 50 0 0.311 -2.43 3.05 Intr - 104290 103562 729 1 0 80 33 413 0.449 26.63 3.04 Intr - 107343 107128 216 0 0 91 62 182 0.985 14.60 3.03 Intr - 109616 109438 179 0 2 14 -1 182 0.504 1.54 3.02 Intr - 114647 114544 104 1 2 99 53 68 0.854 4.22 3.01 Init - 115267 115161 107 1 2 69 47 196 0.998 11.29 3.00 Prom - 120536 120497 40 -2.06 4.05 PlyA - 121099 121094 6 1.05 4.04 Term - 143123 142686 438 2 0 97 43 321 0.984 23.78 4.03 Intr - 145578 145313 266 1 2 135 78 355 0.996 36.43 4.02 Intr - 147070 146285 786 2 0 133 116 850 0.953 83.75 4.01 Init - 148108 147937 172 1 1 76 92 277 0.999 24.50 4.00 Prom - 157178 157139 40 -1.96 5.06 PlyA - 158102 158097 6 1.05 5.05 Term - 162246 162113 134 1 2 110 40 103 0.808 5.95 5.04 Intr - 170930 170838 93 0 0 80 81 67 0.459 5.14 5.03 Intr - 173810 173766 45 0 0 122 88 -2 0.317 1.58 5.02 Intr - 176797 176765 33 2 0 122 92 13 0.162 3.29 5.01 Init - 187413 187401 13 0 1 81 61 1 0.020 -2.86 5.00 Prom - 192031 191992 40 -2.36 6.04 PlyA - 193395 193390 6 1.05 6.03 Term - 201931 201732 200 2 2 91 42 163 0.997 9.56 6.02 Intr - 203128 202979 150 2 0 70 91 169 0.996 15.53 6.01 Intr - 204105 203953 153 1 0 50 36 209 0.972 11.94 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 154036 154128 93 0 0 109 72 61 0.912 6.76 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:205516549_205723882|GENSCAN_predicted_peptide_1|722_aa MVAALWKLPVCKSPADSRPETLHRATEERLKMTDRKEFRDGESGKSKAAGEKVQLASSDA LFPSSSFSPLASWGMPEQRPSGVSIRGEMRFGPNHVPRVLAKLTFSLRALDPAAQSLMIM NKMKNFKRRFSLSVPRTETIEESLAEFTEQFNQLHNRRNENLQLGPLGRDPPQECSTFSP TDSGEEPGQLSPGVQFQRRQNQRRFSMEDVSKRLSLPMDIRLPQEFLQKLQMESPDLPKP LSRMSRRASLSDIGFGKLETYVKLDKLGEGTYATVFKGRSKLTENLVALKEIRLEHEEGA PCTAIREVSLLKNLKHANIVTLHDLIHTDRSLTLVFEYLDSDLKQYLDHCGNLMSMHNVK IFMFQLLRGLAYCHHRKILHRDLKPQNLLINERGELKLADFGLARAKSVPTKTYSNEVVT LWYRPPDVLLGSTEYSTPIDMWGVGCIHYEMATGRPLFPGSTVKEELHLIFRLLGTPTEE TWPGVTAFSEFRTYSFPCYLPQPLINHAPRLDTDGIHLLSSLLLYESKSRMSAEAALSHS YFRSLGERVHQLEDTASIFSLKEIQLQKDPGYRGLAFQQPGSARIAADLEYTSCCCLDRL PTFDLDTTHPRNTSYTAPKVLRVLLLSSSVSRTSHHLPTKSIIGHQGPAPALAVEARWLA SLLMGCVPSTSKFLPGCPNLEIMLKMEPDVQDFQLGFILLVNLLGYKDKDESNDSQIRKY LY >gi568815597r:205516549_205723882|GENSCAN_predicted_CDS_1|2169_bp atggtggctgccctctggaagcttcctgtctgcaagagtcctgcagacagcaggcctgag acattgcacagagccacggaggagcggctgaagatgactgacaggaaggagtttagggat ggggagtcaggaaagtccaaggctgctggagagaaggtgcagctggccagctctgatgca ctcttcccttcctcctctttttcccctcttgcctcctggggcatgccagagcagagacct tcaggtgtcagtatcagaggtgaaatgaggtttggacccaatcatgtcccaagggtgttg gccaagctgacattcagtttacgggccttggacccggctgcccagtccctcatgatcatg aacaagatgaagaactttaagcgccgtttctccctgtcagtgccccgcactgagaccatt gaagaatccttggctgaattcacggagcaattcaaccagctccacaaccggcggaatgag aacttgcagctcggtcctcttggcagagaccccccgcaggagtgcagcaccttctcccca acagacagcggggaggagccggggcagctctcccctggcgtgcagttccagcggcggcag aaccagcgccgcttctccatggaggacgtcagcaagaggctctctctgcccatggatatc cgcctgccccaggaattcctacagaagctacagatggagagcccagatctgcccaagccg ctcagccgcatgtcccgccgggcctccctgtcagacattggctttgggaaactggaaaca tacgtgaaactggacaaactgggagagggcacctatgccacagtcttcaaagggcgcagc aaactgacggagaaccttgtggccctgaaagagatccggctggagcacgaggagggagcg ccctgcactgccatccgagaggtgtctctgctgaagaacctgaagcacgccaatattgtg accctgcatgacctcatccacacagatcggtccctcaccctggtgtttgagtacctggac agtgacctgaagcagtatctggaccactgtgggaacctcatgagcatgcacaacgtcaag attttcatgttccagctgctccggggcctcgcctactgtcaccaccgcaagatcctgcac cgggacctgaagccccagaacctgctcatcaacgagaggggggagctgaagctggccgac tttggactggccagggccaagtcagtgcccacaaagacttactccaatgaggtggtgacc ctgtggtacaggccccccgatgtgctgctgggatccacagagtactccacccccattgat atgtggggcgtgggctgcatccactacgagatggccacagggaggcccctcttcccgggc tccacagtcaaggaggagctgcacctcatctttcgcctcctcgggacccccacagaagag acgtggcccggcgtgaccgccttctctgagttccgcacctacagcttcccctgctacctc ccgcagccgctcatcaaccacgcgcccaggttggatacggatggcatccacctcctgagc agcctgctcctgtatgaatccaagagtcgcatgtcagcagaggctgccctgagtcactcc tacttccggtctctgggagagcgtgtgcaccagcttgaagacactgcctccatcttctcc ctgaaggagatccagctccagaaggacccaggctaccgaggcttggccttccagcagcca ggctctgcaagaattgctgcagacctcgaatacacctcctgctgctgcctagaccgcctc cccacctttgatctggacacaacccacccccgcaacacatcatacacagctcccaaagtg ctcagagtcctgctgttgtccagttcggtgtccagaacctcccaccatctgccaactaaa agcatcattgggcaccagggccctgctcccgccctagctgtggaggcccggtggctggca tcattgctgatgggctgtgtgccctccacatccaagtttctccctggctgccctaatctg gagataatgctgaaaatggagccagatgtccaagacttccagctgggatttatcctcttg gtaaatctgctagggtataaggataaggatgaaagcaacgacagccagattaggaagtat ctctattag >gi568815597r:205516549_205723882|GENSCAN_predicted_peptide_2|566_aa MGCDGRVSGLLRRNLQPTLTYWSVFFSFGLCIAFLGPTLLDLRCQTHSSLPQISWVFFSQ QLCLLLGSALGGVFKRTLAQSLWALFTSSLAISLVFAVIPFCRDVKVLASVMALAGLAMG CIDTVANMQLVRMYQKDSAVFLQVLHFFVGFGALLSPLIADPFLSEANCLPANSTANTTS RGHLFHVSRVLGQHHVDAKPWSNQTFPGLTPKDGAGTRVSYAFWIMALINVSVPGVGQLP VPMAVLMLLSKERLLTCCPQRRPLLLSADELALETQPPEKEDASSLPPKFQSHLGHEDLF SCCQRKNLRGAPYSFFAIHITGALVLFMTDGLTGAYSAFVYSYAVEKPLSVGHKVAGYLP SLFWGFITLGRLLSIPISSRMKPATMVFINVVGVVVTFLVLLIFSYNVVFLFVGTASLGL FLSSTFPSMLAYTEDSLQYKGCATTVLVTGAGVGEMVLQMLVGSIFQAQGSYSFLVCGVI FGCLAFTFYILLLFFHRMHPGLPSEPQNGRETNSLVPTQDRSIGMENSECYQRNLDISES CGITLFPDFPLLLVYGNLTHQAHRIH >gi568815597r:205516549_205723882|GENSCAN_predicted_CDS_2|1701_bp atgggctgcgacggccgcgtgtcggggctgctccgccgcaacctgcagcccacgctcacc tactggagcgtcttcttcagcttcggcctgtgcatcgccttcctggggcccacgctgctg gacctgcgctgtcagacgcacagctcgctgccccagatctcctgggtcttcttctcgcag cagctctgcctcctgctgggcagcgccctcgggggcgtcttcaaaaggaccctggcccag tcactatgggccctgttcacctcctctctggccatctccctggtgtttgccgtcatcccc ttctgccgcgacgtgaaggtgctggcctcagtcatggcgctggcgggcttggccatgggc tgcatcgacaccgtggccaacatgcagctggtaaggatgtaccagaaggactcggccgtc ttcctccaggtgctccatttcttcgtgggctttggtgctctgctgagcccccttattgct gaccctttcctgtctgaggccaactgcttgcctgccaatagcacggccaacaccacctcc cgaggccacctgttccatgtctccagggtgctgggccagcaccacgtagatgccaagcct tggtccaaccagacgttcccagggctgactccaaaggacggggcagggacccgagtgtcc tatgccttctggatcatggccctcatcaatgtgagtgtccctggggtgggccagcttcca gtgcccatggctgtgctgatgctgctgtccaaggagcggctgctgacctgctgtccccag aggaggcccctgcttctgtctgctgatgagcttgccttggagacacagcctcctgagaag gaagatgcctcctcactgcccccaaagtttcagtcacacctagggcatgaggacctgttc agctgctgccaaaggaagaacctcagaggagccccttattccttctttgccatccacatc acgggcgccctggtactgttcatgacggatgggttgacgggtgcctattccgccttcgtg tacagctatgctgtggagaagcccctgtctgtgggacacaaggtggctggctacctcccc agcctcttctggggcttcatcacactgggccggctcctctccattcccatatcctcaaga atgaagccggccaccatggttttcatcaacgtggttggcgtggtggtgacgttcctggtg ctgcttattttctcctacaacgtcgtcttcctgttcgtggggacggcaagcctgggcctg tttctcagcagcaccttccccagcatgctggcctacacggaggactcgctgcagtacaaa ggctgtgcaaccacagtgctggtgacaggggcaggagttggcgagatggtgctgcagatg ctggttggttcgatattccaggctcagggcagctatagtttcctggtctgtggcgtgatc tttggttgtctggcttttaccttctatatcttgctcctgtttttccacaggatgcaccct ggactcccatcagagccacagaatggcagagaaacaaacagcctggttcctacccaagac agatcaattggaatggaaaactctgagtgctaccagaggaatttggacatttcagaatcc tgtggaatcaccctgttcccagattttcctctgctacttgtatatggaaatctaacccat caggcacatcggattcactaa >gi568815597r:205516549_205723882|GENSCAN_predicted_peptide_3|446_aa MRAAAGTRRARLGRRRLLRLRRGGRSGCRAVLEFPASLVIRCRVSFLFLFRSSGPPPLFT FRLVHSCIEAGVAASQTVDTLVTVGNVEKEVFMVFLVELTHCGTGGQNDIVDKEKQGILS SEMNSLLDQELIAMDSAITLWQFLLQLLQKPQNKHMICWTSNDGQFKLLQAEEVARLWGI RKNKPNMNYDKLSRALRYYYVKNIIKKVNGQKFVYKFVSYPEILNMDPMTVGRIEGDCES LNFSEVSSSSKDVENGGKDKPPQPGAKTSSRNDYIHSGLYSSFTLNSLNSSNVKLFKLIK TENPAEKLAEKKSPQEPTPSVIKFVTTPSKKPPVEPVAATISIGPSISPSSEETIQALET LVSPKLPSLEAPTSASNVMTAFATTPPISSIPPLQEPPRTPSPPLSSHPDIDTDIDSVAS QPMELPENLSLEPKDQDSVLLEKDKK >gi568815597r:205516549_205723882|GENSCAN_predicted_CDS_3|1341_bp atgcgggcggctgcgggcacccggcgggctcggcttggccgccgccgccttctacggctc cgccgcgggggtcgcagcggctgccgcgccgtcctcgagtttccagcctcgctagtcatc cggtgtcgagtttcatttctttttctctttcgctccagcggccccccgcccctctttact ttccgccttgtgcacagctgcatcgaggctggggtagcagcatcacagactgtagacact ttggtcactgtaggcaacgtagagaaagaagtcttcatggtgttcctggtagagctgacc cattgtggcactggtgggcagaatgacattgttgacaaagaaaaacaaggcatcctcagc tcggagatgaattcgcttctggatcaagaactcattgctatggacagtgctatcaccctg tggcagttccttcttcagctcctgcagaagcctcagaacaagcacatgatctgttggacc tctaatgatgggcagtttaagcttttgcaggcagaagaggtggctcgtctctgggggatt cgcaagaacaagcctaacatgaattatgacaaactcagccgagccctcagatactattat gtaaagaatatcatcaaaaaagtgaatggtcagaagtttgtgtacaagtttgtctcttat ccagagattttgaacatggatccaatgacagtgggcaggattgagggtgactgtgaaagt ttaaacttcagtgaagtcagcagcagttccaaagatgtggagaatggagggaaagataaa ccacctcagcctggtgccaagacctctagccgcaatgactacatacactctggcttatat tcttcatttactctcaactctttgaactcctccaatgtaaagcttttcaaattgataaag actgagaatccagccgagaaactggcagagaaaaaatctcctcaggagcccacaccatct gtcatcaaatttgtcacgacaccttccaaaaagccaccggttgaacctgttgctgccacc atttcaattggcccaagtatttctccatcttcagaagaaactatccaagctttggagaca ttggtttccccaaaactgccttccctggaagccccaacctctgcctctaacgtaatgact gcttttgccaccacaccacccatttcgtccataccccctttgcaggaacctcccagaaca ccttcaccaccactgagttctcacccagacatcgacacagacattgattcagtggcttct cagccaatggaacttccagagaatttgtcactggagcctaaagaccaggattcagtcttg ctagaaaaggacaaaaaatga >gi568815597r:205516549_205723882|GENSCAN_predicted_peptide_4|553_aa MVQRLWVSRLLRHRKAQLLLVNLLTFGLEVCLAAGITYVPPLLLEVGVEEKFMTMVLGIG PVLGLVCVPLLGSASDHWRGRYGRRRPFIWALSLGILLSLFLIPRAGWLAGLLCPDPRPL ELALLILGVGLLDFCGQVCFTPLEALLSDLFRDPDHCRQAYSVYAFMISLGGCLGYLLPA IDWDTSALAPYLGTQEECLFGLLTLIFLTCVAATLLVAEEAALGPTEPAEGLSAPSLSPH CCPCRARLAFRNLGALLPRLHQLCCRMPRTLRRLFVAELCSWMALMTFTLFYTDFVGEGL YQGVPRAEPGTEARRHYDEGVRMGSLGLFLQCAISLVFSLVMDRLVQRFGTRAVYLASVA AFPVAAGATCLSHSVAVVTASAALTGFTFSALQILPYTLASLYHREKQVFLPKYRGDTGG ASSEDSLMTSFLPGPKPGAPFPNGHVGAGGSGLLPPPPALCGASACDVSVRVVVGEPTEA RVVPGRGICLDLAILDSAFLLSQVAPSLFMGSIVQLSQSVTAYMVSAAGLGLVAIYFATQ VVFDKSDLAKYSA >gi568815597r:205516549_205723882|GENSCAN_predicted_CDS_4|1662_bp atggtccagaggctgtgggtgagccgcctgctgcggcaccggaaagcccagctcttgctg gtcaacctgctaacctttggcctggaggtgtgtttggccgcaggcatcacctatgtgccg cctctgctgctggaagtgggggtagaggagaagttcatgaccatggtgctgggcattggt ccagtgctgggcctggtctgtgtcccgctcctaggctcagccagtgaccactggcgtgga cgctatggccgccgccggcccttcatctgggcactgtccttgggcatcctgctgagcctc tttctcatcccaagggccggctggctagcagggctgctgtgcccggatcccaggcccctg gagctggcactgctcatcctgggcgtggggctgctggacttctgtggccaggtgtgcttc actccactggaggccctgctctctgacctcttccgggacccggaccactgtcgccaggcc tactctgtctatgccttcatgatcagtcttgggggctgcctgggctacctcctgcctgcc attgactgggacaccagtgccctggccccctacctgggcacccaggaggagtgcctcttt ggcctgctcaccctcatcttcctcacctgcgtagcagccacactgctggtggctgaggag gcagcgctgggccccaccgagccagcagaagggctgtcggccccctccttgtcgccccac tgctgtccatgccgggcccgcttggctttccggaacctgggcgccctgcttccccggctg caccagctgtgctgccgcatgccccgcaccctgcgccggctcttcgtggctgagctgtgc agctggatggcactcatgaccttcacgctgttttacacggatttcgtgggcgaggggctg taccagggcgtgcccagagctgagccgggcaccgaggcccggagacactatgatgaaggc gttcggatgggcagcctggggctgttcctgcagtgcgccatctccctggtcttctctctg gtcatggaccggctggtgcagcgattcggcactcgagcagtctatttggccagtgtggca gctttccctgtggctgccggtgccacatgcctgtcccacagtgtggccgtggtgacagct tcagccgccctcaccgggttcaccttctcagccctgcagatcctgccctacacactggcc tccctctaccaccgggagaagcaggtgttcctgcccaaataccgaggggacactggaggt gctagcagtgaggacagcctgatgaccagcttcctgccaggccctaagcctggagctccc ttccctaatggacacgtgggtgctggaggcagtggcctgctcccacctccacccgcgctc tgcggggcctctgcctgtgatgtctccgtacgtgtggtggtgggtgagcccaccgaggcc agggtggttccgggccggggcatctgcctggacctcgccatcctggatagtgccttcctg ctgtcccaggtggccccatccctgtttatgggctccattgtccagctcagccagtctgtc actgcctatatggtgtctgccgcaggcctgggtctggtcgccatttactttgctacacag gtagtatttgacaagagcgacttggccaaatactcagcgtag >gi568815597r:205516549_205723882|GENSCAN_predicted_peptide_5|105_aa MEMIGKKSPVLPDFSGLDGLCKMVLCLGARGATLKPEVTSCVVKGSFVSNSKALPSCDDS QGLSWSQDSVVLKVLRKCSLSQQNEFEDKNTAVDECTAVDRQAPI >gi568815597r:205516549_205723882|GENSCAN_predicted_CDS_5|318_bp atggaaatgatagggaagaaaagtcctgttttaccagacttctcaggcctggatggattg tgcaaaatggtactatgccttggagcaagaggggcaacactcaaacctgaggtcaccagc tgtgtagtgaagggctcatttgtcagcaactccaaggctctgccatcttgtgatgacagt caaggcctgtcgtggagccaggactctgttgttctgaaagtgctcagaaaatgcagcttg agtcagcagaatgagtttgaagataaaaatacagctgtggatgaatgtacagctgtggac agacaggccccaatttag >gi568815597r:205516549_205723882|GENSCAN_predicted_peptide_6|167_aa XDSEDEKEDHKNVRQQRQAASKAASKQREMLMEDVGSEEEQEEEDEAPFQEKDSGSDEDF LMEDDDDSDYGSSKKKNKKMVKKSKPERKEKKMPKPRLKATVTPSPVKGKGKVGRPTASK ASKEKTPSPKEEDEEPESPPEKKTSTSPPPEKSGDEGSEDEAPSGED >gi568815597r:205516549_205723882|GENSCAN_predicted_CDS_6|504_bp naggatagtgaagatgaaaaagaagatcataaaaatgtgcgccaacaacggcaggcggca tctaaagcagcttctaaacagagagagatgctcatggaagatgtgggcagtgaggaagaa caagaagaggaggatgaggcaccattccaggagaaagattccggcagcgatgaagatttc ctaatggaagatgatgacgatagtgactatggcagttcgaaaaagaaaaacaaaaagatg gttaagaagtccaaacctgaaagaaaagaaaagaaaatgcccaaacccagactaaaggct acagtgacgccaagtccagtgaaaggcaaagggaaagtgggtcgccccacagcttcaaag gcatcaaaggaaaagactccttctcccaaagaagaagatgaggaaccggaaagcccgcca gaaaagaaaacatctacaagccccccacccgagaaatctggggatgaagggtctgaagat gaagccccttctggggaggattaa