GENSCAN 1.0 Date run: 4-Nov-116 Time: 09:31:01 Sequence gi568815583f:63996014_64237615 : 241602 bp : 44.64% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 1288 1283 6 1.05 1.03 Term - 2017 1905 113 2 2 97 33 115 0.966 5.62 1.02 Intr - 4453 4359 95 0 2 85 85 -22 0.374 -3.19 1.01 Init - 7486 7431 56 1 2 113 67 84 0.966 9.56 1.00 Prom - 12137 12098 40 -4.36 2.00 Prom + 12159 12198 40 -4.66 2.01 Init + 19735 19821 87 0 0 94 80 35 0.578 3.94 2.02 Intr + 21319 21535 217 0 1 95 56 37 0.611 -0.72 2.03 Intr + 22231 22423 193 2 1 50 100 83 0.356 4.25 2.04 Intr + 29080 29195 116 1 2 42 115 27 0.180 0.89 2.05 Term + 34586 34677 92 0 2 113 49 53 0.198 1.78 2.06 PlyA + 36647 36652 6 1.05 3.06 PlyA - 37423 37418 6 1.05 3.05 Term - 42405 42168 238 2 1 58 40 127 0.883 0.54 3.04 Intr - 44254 44157 98 1 2 106 75 105 0.259 9.81 3.03 Intr - 51134 51069 66 2 0 143 81 -12 0.001 2.70 3.02 Intr - 64977 64709 269 1 2 25 40 227 0.018 8.85 3.01 Init - 65138 65063 76 2 1 88 56 103 0.995 6.42 3.00 Prom - 69411 69372 40 -4.86 4.06 PlyA - 73156 73151 6 1.05 4.05 Term - 77015 76918 98 0 2 89 44 113 0.835 5.13 4.04 Intr - 79524 79479 46 0 1 89 98 40 0.708 3.08 4.03 Intr - 85138 85089 50 2 2 63 116 -6 0.443 -1.90 4.02 Intr - 92838 92674 165 1 0 30 111 86 0.704 5.03 4.01 Init - 97755 97632 124 0 1 79 46 170 0.850 10.29 4.00 Prom - 98883 98844 40 -7.86 5.00 Prom + 98945 98984 40 -7.96 5.01 Init + 100001 100159 159 1 0 81 72 127 0.566 10.49 5.02 Intr + 116560 116689 130 0 1 82 -34 131 0.320 0.57 5.03 Intr + 122104 122231 128 2 2 62 78 67 0.674 3.50 5.04 Intr + 122775 122841 67 2 1 56 97 85 0.883 4.68 5.05 Intr + 127490 127533 44 0 2 110 109 6 0.959 2.86 5.06 Intr + 130066 130207 142 0 1 70 87 89 0.997 6.93 5.07 Intr + 131161 131239 79 2 1 78 115 56 0.999 5.91 5.08 Intr + 131718 131793 76 0 1 88 88 28 0.999 2.32 5.09 Intr + 133903 134016 114 0 0 87 68 153 0.881 13.84 5.10 Intr + 134215 134308 94 0 1 67 87 102 0.995 7.54 5.11 Intr + 135674 135879 206 0 2 112 70 221 0.999 21.62 5.12 Intr + 138651 138794 144 2 0 80 105 271 0.989 28.48 5.13 Intr + 140317 140397 81 0 0 108 78 85 0.987 9.33 5.14 Intr + 140848 140919 72 0 0 34 91 70 0.342 1.50 5.15 Intr + 155903 156052 150 1 0 25 62 136 0.651 5.06 5.16 Intr + 156230 156313 84 1 0 68 76 162 0.998 13.02 5.17 Intr + 156625 156729 105 0 0 -3 85 100 0.665 1.01 5.18 Intr + 157232 157326 95 1 2 107 74 109 0.997 10.16 5.19 Intr + 157922 157989 68 2 2 73 94 43 0.998 1.95 5.20 Term + 158374 158495 122 2 2 92 43 127 0.999 7.24 5.21 PlyA + 158727 158732 6 1.05 6.15 PlyA - 159838 159833 6 1.05 6.14 Term - 160132 160010 123 1 0 120 49 336 0.998 31.28 6.13 Intr - 160896 160712 185 1 2 136 101 319 0.999 37.51 6.12 Intr - 164184 164091 94 0 1 118 76 108 0.978 12.14 6.11 Intr - 166141 166028 114 1 0 70 107 87 0.998 9.44 6.10 Intr - 167004 166839 166 2 1 75 63 311 0.944 27.26 6.09 Intr - 167389 167249 141 0 0 71 -6 129 0.231 1.27 6.08 Intr - 175578 175417 162 2 0 84 69 79 0.258 4.69 6.07 Intr - 184441 184335 107 1 2 133 59 130 0.910 13.61 6.06 Intr - 207176 207069 108 2 0 101 91 71 0.992 9.18 6.05 Intr - 208576 208428 149 2 2 91 87 231 0.902 23.25 6.04 Intr - 208936 208852 85 1 1 120 75 34 0.757 4.69 6.03 Intr - 211581 211496 86 1 2 103 63 -23 0.240 -3.76 6.02 Intr - 218111 217877 235 2 1 103 93 179 0.978 17.16 6.01 Intr - 220700 220549 152 0 2 124 119 130 0.979 19.58 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 48921 49008 88 2 1 72 82 53 0.958 3.91 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583f:63996014_64237615|GENSCAN_predicted_peptide_1|87_aa MGQQRSRRILVMLEALDALAYTLGKTRGFVEVSFAVDGDISGVWVWGQSPGGKTEAQSTK DKILFVGVYHVLDAIVGTCVDQCMKSS >gi568815583f:63996014_64237615|GENSCAN_predicted_CDS_1|264_bp atgggtcagcaaagaagtaggcgaatcctggtgatgctggaagcattggatgccctagcc tatactttagggaaaaccagaggctttgtagaggtctcttttgctgtagatggtgacatc tcaggagtctgggtgtggggacaaagtccaggtgggaaaactgaagcccagagcaccaag gataaaatcctatttgttggtgtttaccatgtgctggatgccattgtgggcacttgtgtg gatcagtgcatgaaatcatcatag >gi568815583f:63996014_64237615|GENSCAN_predicted_peptide_2|234_aa MKMEAEIGVMCLQAKNTKDHQQPPDARRQGSCKDQIDNSQGRHRKRAKHHTKPHLHLKGC PTLHRGGSAGPRTVTVRSQGACISLCLESRIPGHGQVHTLTGRQISSGGGRNQHLASPKG CEDRTCEPKEATAASDSEISSTNPAKKLNSLKTLLLVVLINQVRRRGSQGHRQKENSRQL GTSKRALDQDSEVIGLNPSSDTYQAPFTDKLFDPLPLLALNHPEPWAFTRIDDL >gi568815583f:63996014_64237615|GENSCAN_predicted_CDS_2|705_bp atgaagatggaggcagagattggagtgatgtgtctacaagccaagaacacaaaggatcac cagcagccaccagacgctaggagacagggcagttgtaaagaccaaatagacaacagccaa ggaaggcataggaagagagcaaagcaccacacgaagccccacctgcacctcaaaggctgt ccaaccctccaccgaggtggctctgctgggcctaggacagtcacagtcaggtctcaaggt gcttgcatctccctgtgcctagagagccgcattccaggacacgggcaggtccacacctta acaggtagacagatctccagtggaggtggaagaaatcagcacttggccagtcccaagggc tgtgaggacaggacctgtgagcccaaggaagcaactgctgcctcagactcagaaattagc tctactaacccagcgaaaaagctaaacagccttaagactcttttactggttgttttaatc aaccaggtacgaagaaggggatcccagggtcacagacagaaagaaaattctaggcaactt ggtacttcaaagagagcactggaccaggattcagaagtcataggtttaaatcccagctca gacacttaccaagccccttttactgacaaactctttgatcccctacccctactcgccctc aaccaccctgagccctgggcctttactagaatagatgacctctga >gi568815583f:63996014_64237615|GENSCAN_predicted_peptide_3|248_aa MAGCRSRALPRGEAAKARREIKLSAGPASGSECGARRAHTHPELALARKYGPQVPPAAPV PTRSPGSHPGLSLHTSSQAKGAGSGLSQPRKGLPQCSSGLKGSSSAAKVGTRAEEGFFGR GQREEPIPKSSIREGVMDCMFQASMRSPNMEPFKQQKVEDFYDIGEELGRFFSCPWWHQQ KDAILGNKPQPALPMMLDSVPSLGHVLHAGVCAEYCGYKDEQDETVSALLELHPGSYIHA NKGILGLD >gi568815583f:63996014_64237615|GENSCAN_predicted_CDS_3|747_bp atggcgggctgcaggtcccgagccctgccccgtggggaggcagctaaggcccggcgagaa atcaagctcagcgctgggccagccagcggctctgagtgcggggcccgccgagcccacacc cacccggaactcgcgctggcccgcaagtacggcccgcaagtaccgcccgcagccccggtt cccacccgcagccccggttcccacccgggcctctccctccacacctcctcgcaagctaag ggagccggctctggcctcagccagcccaggaaggggctcccacagtgcagcagcgggctg aagggctcctcaagtgccgccaaagtgggaacccgggcagaggagggattttttggcaga gggcagagagaggaacccattcccaagtcctctattagagaaggcgtgatggattgtatg ttccaggcctcaatgaggagtccaaacatggagccattcaagcagcagaaggtggaggac ttttatgacatcggagaggagctggggaggttcttctcctgcccttggtggcaccagcag aaagatgccattctcggcaacaaaccacagccagccctgccaatgatgcttgattcagtg ccctcacttggccatgtcctgcatgcaggggtctgtgctgagtactgtgggtacaaagac gagcaagatgagacagtctctgccctcctggagctccaccctggcagttacatacatgct aacaagggcattcttggattggactag >gi568815583f:63996014_64237615|GENSCAN_predicted_peptide_4|160_aa MQRVSGLLSWTLSRVLWLSGLSEPGAARQPRIMEEKALEVYDLIRTIRDPEKPNTLEELE VVSESCVEVQEINEEEYLVIIRFTPTVPHCSLATLIGLCLRVKLQRCLPFKHKLEIYISE GTHSTEEDINKQINDKERVAAAMENPNLREIVEQCVLEPD >gi568815583f:63996014_64237615|GENSCAN_predicted_CDS_4|483_bp atgcagcgggtgtccgggctgctctcctggacgctgagcagagtcctgtggctctccggc ctctctgagccgggagctgcccggcagccccggatcatggaagagaaagcgctagaagtt tatgatttgattagaactatccgggacccagaaaagcccaatactttagaagaactggaa gtggtctcggaaagttgtgtggaagttcaggagataaatgaagaagaatatctggttatt atcaggttcacgccaacagtacctcattgctctttggcgactcttattgggctgtgctta agagtaaaacttcagcgatgtttaccatttaaacataagttggaaatctacatttctgaa ggaacccactcaacagaagaagacatcaataagcagataaatgacaaagagcgagtggca gctgcaatggaaaaccccaacttacgggaaattgtggaacagtgtgtccttgaacctgac tga >gi568815583f:63996014_64237615|GENSCAN_predicted_peptide_5|719_aa MASGGGGCSASERLPPPFPGLEPESEGAAGGSEPEAGDSDTEGEDIFTGAAVVSKHQSPK ITTSLLPINNGSKENGIHEEQDQEPQDLFAGKFGLKNATVELSLDSTQNNQKKVLAKTLI SLPPQEATNSSKPQPTYEELEEEEQEDQFDLTVGITDPEKIGDGMNAYVAYKVTTQTSLP LFRSKQFAVKRRFSDFLGLYEKLSEKHSQNGFIVPPPPEKSLIGMTKVKVGKEDSSSAEF LEKRRAALERYLQRIVNHPTMLQDPDVREFLEKEELPRAVGTQTLSGAGLLKMFNKATDA VSKMTIKMNESDIWFEEKLQEVECEEQRLRKLHAVVETLVNHRKELALNTAQFAKSLAML GSSEDNTALSRALSQLAEVEEKIEQLHQEQANNDFFLLAELLSDYIRLLAIVRAAFDQRM KTWQRWQDAQATLQKKREAEARLLWANKPDKLQQAKDEILEWESRVTQYERDFERISTVV RKEVIRFEKEKSKDFKNHVIKYLETLLYSQQQWGPGARAWVNERCGLDRRGKPCRGFGGR LDLPASRTMVRALEGPQGSEGQVFRVEVLCSGRRHTVPRRYSEFHALHKRIKKLYKVPDF PSKRLPNWRTRGLEQRRQGLEAYIQGILYLNQEVPKELLEFLRLRHFPTDPKASNWGSQQ HQRPVLSFHVDPYVCNPSPESLPNVVVNGVLQGLYSFSISPDKAQPKAACHPAPLPPMP >gi568815583f:63996014_64237615|GENSCAN_predicted_CDS_5|2160_bp atggcgtcgggtggtggtggctgtagcgcttcggagagactgcctccgcccttccccggc ctggagccggagtccgagggggcggccgggggatcagaacccgaggctggggacagcgac accgagggggaggacattttcaccggcgccgcggtggtcagtaaacatcagtctccaaag ataactacatcccttcttcccatcaacaatggctccaaagaaaatgggatccatgaagaa caagaccaagagccacaggatctctttgcaggcaagtttggactcaaaaatgccacagtg gagctatccttggacagcacacaaaataatcagaagaaggtgctagccaaaacactcatt tctcttcctcctcaggaagccacaaattcttcgaagccccagccaacctatgaggagcta gaggaagaagaacaggaggatcaatttgatttgacagtcggtataactgatcctgagaag ataggggatggtatgaatgcatatgtagcctacaaagttacaacacagacaagcttacca ttgttcagaagcaaacagtttgcagtaaaaagaagatttagtgactttctgggtctttat gagaagctttccgagaagcactctcagaatggcttcattgtccctccgcccccggagaag agcctcatagggatgacaaaagtgaaagttgggaaggaagattcttcttctgcagaattt cttgaaaaacggagggccgctttagaaaggtaccttcagaggattgtaaatcatcctacc atgttacaggaccctgacgtcagagagttcttggaaaaagaagagctgccacgtgccgtg ggtacccagacattgagtggtgctggtctcctcaagatgttcaacaaagccacagatgcc gtcagcaaaatgaccatcaagatgaatgaatcagacatttggtttgaggagaagctccag gaggtagagtgtgaggagcagcgcttacggaaactgcatgctgttgtagaaactctagtc aaccataggaaagagctagcgctgaacacagcccagtttgcaaagagtctagccatgctt gggagctctgaggacaacacggcattgtcacgggcactctcccagctggctgaggtggaa gaaaaaattgagcagctccaccaggaacaggccaacaatgacttcttcctccttgctgag ctcctgagtgactacattcgcctcctggccatagtccgcgctgccttcgaccagcgcatg aagacatggcagcgctggcaggatgcccaagccacactgcagaagaagcgggaggccgag gctcggctgctgtgggccaacaagcctgataagctgcagcaggccaaggacgagatcctc gagtgggagtctcgggtgactcaatatgaaagggacttcgagaggatttcaacagtggtc cgaaaagaagtgatacggtttgagaaagagaaatccaaggacttcaagaaccacgtgatc aagtaccttgagacactcctttactcacagcagcagtggggacccggggcccgggcctgg gtgaacgagcgctgcgggctggaccgtcgggggaaaccttgtcgagggtttgggggccgc ttggatttgccagcctcgcggaccatggttcgagctcttgagggaccccagggctccgag gggcaggtgttccgagtggaggtgctgtgcagcgggcgcagacacacggtgccaaggcgc tacagcgagttccacgcgctgcacaagcggatcaagaagctgtacaaagtgcccgacttc ccctcgaaacgcctgcccaactggaggaccagagggttggaacagcgccggcagggcttg gaggcttacatccagggcatcctgtacctgaaccaggaggtgcccaaggagttactggaa ttcctgagacttcggcacttccccacagaccccaaggctagcaactggggctcccagcag caccagcggcctgtcctgagcttccatgtggatccctatgtttgcaacccctccccagag tcgctgcccaacgtggtggtgaatggtgtgctccagggcctctacagcttcagcatcagc ccagataaagcccagccaaaggcggcctgtcaccctgctcctctgccaccgatgccctga >gi568815583f:63996014_64237615|GENSCAN_predicted_peptide_6|635_aa XEGLPQVYYFGPCGKYNAMVLELLGPSLEDLFDLCDRTFTLKTVLMIAIQLLSRMEYVHS KNLIYRDVKPENFLIGRQGNKKEHVIHIIDFGLAKEYIDPETKKHIPYREHKSLTGTARY MSINTHLGKEQSRRDDLEALGHMFMYFLRGSLPWQGLKADTLKERYQKIGDTKRNTPIEA LCENFPEEMATYLRYVRRLDFFEKPDYEYLRTLFTDLFEKKGYTFDYAYDWVGRPIPTPV GSVHVDSGASAITRESHTHRDRPSQQQPLRNQVVSSTNGELNVDDPTGAHSNAPITAHAE VEVVEEANLLLVFTPPSTLSSIKPSELKALSAEGDQEAGGKAKKVGGKWQTWAGEEPSPY PRPTYLLTSSASYRRPVEQRSGRGGPRSMPGLVVFSEPGPVLCEAAFRRCPGCFRLRLWM LRLSERNMKVLLAAALIAGSVFFLLLPGPSAADEKKKGPKVTVKVYFDLRIGDEDVGRVI FGLFGKTVPKTVDNFVALATGEKGFGYKNSKFHRVIKDFMIQGGDFTRGDGTGGKSIYGE RFPDENFKLKHYGPGWVSMANAGKDTNGSQFFITTVKTAWLDGKHVVFGKVLEGMEVVRK VESTKTDSRDKPLKDVIIADCGKIEVEKPFAIAKE >gi568815583f:63996014_64237615|GENSCAN_predicted_CDS_6|1908_bp ngtgaaggtctcccacaggtgtattactttggaccatgtgggaaatataatgccatggtg ctggagctccttggccctagcttggaggacttgtttgacctctgtgaccgaacatttact ttgaagacggtgttaatgatagccatccagctgctttctcgaatggaatacgtgcactca aagaacctcatttaccgagatgtcaagccagagaacttcctgattggtcgacaaggcaat aagaaagagcatgttatacacattatagactttggactggccaaggaatacattgacccc gaaaccaaaaaacacataccttatagggaacacaaaagtttaactggaactgcaagatat atgtctatcaacacgcatcttggcaaagagcaaagccggagagatgatttggaagcccta ggccatatgttcatgtatttccttcgaggcagcctcccctggcaaggactcaaggctgac acattaaaagagagatatcaaaaaattggtgacaccaaaaggaatactcccattgaagct ctctgtgagaactttccagaggagatggcaacctaccttcgatatgtcaggcgactggac ttctttgaaaaacctgattatgagtatttacggaccctcttcacagacctctttgaaaag aaaggctacacctttgactatgcctatgattgggttgggagacctattcctactccagta gggtcagttcacgtagattctggtgcatctgcaataactcgagaaagccacacacatagg gatcggccatcacaacagcagcctcttcgaaatcaggtggttagctcaaccaatggagag ctgaatgttgatgatcccacgggagcccactccaatgcaccaatcacagctcatgccgag gtggaggtagtggaggaagctaatttactcctggtttttaccccaccctcaaccctctcc agcataaaacctagtgagctaaaggctttgtctgcagaaggagatcaagaggctgggggt aaggccaagaaggtaggaggaaaatggcagacctgggctggagaagaaccttctccgtat cccaggcccacgtatttgctaacctcaagcgccagctaccgcaggccagtggaacagcgc tccgggcggggcgggccgcggagcatgccggggcttgtggtcttctcggagcccgggccc gtgctctgcgaggccgccttccgccgctgtccgggctgctttcgcctccgcctgtggatg ctgcgcctctccgaacgcaacatgaaggtgctccttgccgccgccctcatcgcggggtcc gtcttcttcctgctgctgccgggaccttctgcggccgatgagaagaagaaggggcccaaa gtcaccgtcaaggtgtattttgacctacgaattggagatgaagatgtaggccgggtgatc tttggtctcttcggaaagactgttccaaaaacagtggataattttgtggccttagctaca ggagagaaaggatttggctacaaaaacagcaaattccatcgtgtaatcaaggacttcatg atccagggcggagacttcaccaggggagatggcacaggaggaaagagcatctacggtgag cgcttccccgatgagaacttcaaactgaagcactacgggcctggctgggtgagcatggcc aacgcaggcaaagacaccaacggctcccagttcttcatcacgacagtcaagacagcctgg ctagatggcaagcatgtggtgtttggcaaagttctagagggcatggaggtggtgcggaag gtggagagcaccaagacagacagccgggataaacccctgaaggatgtgatcatcgcagac tgcggcaagatcgaggtggagaagccctttgccatcgccaaggagtag