GENSCAN 1.0 Date run: 2-Nov-116 Time: 21:18:43 Sequence gi568815597f:198132581_198419519 : 286939 bp : 36.24% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 24412 24516 105 0 0 59 59 148 0.988 7.49 1.02 Term + 24937 25101 165 0 0 36 45 279 0.932 15.33 1.03 PlyA + 25703 25708 6 1.05 2.05 PlyA - 26194 26189 6 1.05 2.04 Term - 26387 26316 72 2 0 118 41 36 0.266 -1.17 2.03 Intr - 26707 26597 111 1 0 128 19 53 0.219 2.06 2.02 Intr - 32255 32126 130 1 1 28 94 81 0.168 2.48 2.01 Init - 33374 33205 170 2 2 48 34 194 0.269 9.05 2.00 Prom - 49439 49400 40 -1.45 3.05 PlyA - 50102 50097 6 1.05 3.04 Term - 65448 65209 240 0 0 72 41 164 0.010 5.14 3.03 Intr - 80252 79944 309 2 0 116 47 86 0.103 2.88 3.02 Intr - 87786 87694 93 0 0 49 62 87 0.260 1.44 3.01 Init - 96072 95872 201 0 0 60 86 111 0.505 7.02 3.00 Prom - 99199 99160 40 -4.65 4.02 PlyA - 100313 100308 6 1.05 4.01 Sngl - 118234 117215 1020 1 0 58 48 352 0.803 24.99 4.00 Prom - 118391 118352 40 -9.85 5.00 Prom + 120055 120094 40 -9.45 5.01 Init + 120478 120600 123 0 0 64 110 74 0.707 7.42 5.02 Intr + 129995 130057 63 1 0 81 86 91 0.942 6.20 5.03 Intr + 131545 131655 111 0 0 76 115 100 0.997 11.16 5.04 Intr + 160365 160459 95 2 2 95 92 64 0.152 5.34 5.05 Intr + 164525 164660 136 2 1 18 100 35 0.060 -2.65 5.06 Term + 166291 166452 162 0 0 13 33 171 0.231 1.05 5.07 PlyA + 166739 166744 6 1.05 6.02 PlyA - 166986 166981 6 1.05 6.01 Sngl - 180474 179071 1404 0 0 44 40 494 0.424 35.94 6.00 Prom - 180567 180528 40 -6.15 7.07 PlyA - 180736 180731 6 1.05 7.06 Term - 182065 181218 848 2 2 18 48 727 0.002 53.75 7.05 Intr - 196429 196277 153 2 0 18 62 133 0.260 2.82 7.04 Intr - 198169 198029 141 2 0 29 100 83 0.148 3.00 7.03 Intr - 205103 204906 198 0 0 10 33 153 0.161 0.40 7.02 Intr - 208651 208526 126 2 0 55 74 128 0.492 7.83 7.01 Init - 223060 222862 199 1 1 95 100 73 0.583 8.32 7.00 Prom - 225994 225955 40 -4.15 8.09 PlyA - 226284 226279 6 1.05 8.08 Term - 247091 246917 175 1 1 77 42 80 0.016 -1.45 8.07 Intr - 252610 252460 151 2 1 79 37 86 0.129 0.90 8.06 Intr - 263960 263809 152 1 2 3 102 95 0.158 1.29 8.05 Intr - 272216 272081 136 0 1 44 65 98 0.405 1.81 8.04 Intr - 272671 272508 164 0 2 68 89 123 0.535 9.10 8.03 Intr - 281643 281419 225 2 0 20 69 173 0.509 4.78 8.02 Intr - 283958 283753 206 2 2 53 93 114 0.681 5.48 8.01 Init - 285438 285364 75 0 0 76 58 63 0.377 3.24 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 15357 15423 67 2 1 89 80 63 0.866 6.89 S.002 Sngl - 181973 181218 756 2 0 88 48 688 0.997 60.69 S.003 Term + 186832 186942 111 0 0 124 41 148 0.982 11.28 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:198132581_198419519|GENSCAN_predicted_peptide_1|89_aa MFTLLTAAAAGGGSGVAGGWAAARLALRTRLAVRRRPEAAKGAAVTKKLRGHGGGGNRSL VSGPEELKRSPAFRRMLIGDVGYLSLADW >gi568815597f:198132581_198419519|GENSCAN_predicted_CDS_1|270_bp atgtttacactcctgacagcggcggcagcaggaggaggatcgggagtcgcgggaggatgg gccgccgctaggctcgcactccggacgcgcctcgcagtgcgcaggaggcccgaggccgcc aagggcgccgcggtaactaagaaactccgtggccacggcggcggcggcaacaggtctttg gtttcgggacccgaggaacttaagcgaagtcccgccttccggcggatgctcattggcgac gtcggttatttatctctcgcggattggtga >gi568815597f:198132581_198419519|GENSCAN_predicted_peptide_2|160_aa MQVEAAIAEVEATASYPEDLAKIIDKGGYTKQQISNMDKTAFYLKKMPSRTFIAREREIF HERKSQLMWQALWLSTFKKTTQPPAFSDRHRNHLAAINIEAGPQPGAPSADIPRKRTNAA FSEPRAGVGSTRPPEREAFLSPHTVTAPLTPPRGKAFSKL >gi568815597f:198132581_198419519|GENSCAN_predicted_CDS_2|483_bp atgcaagttgaagcagcaattgctgaagtagaagctacagcaagttatccagaagatcta gctaagatcattgataaaggtggctacactaaacaacagatttccaatatggataaaaca gccttctatttgaaaaagatgccatctaggacttttatagctagagagagagaaattttt catgaaagaaagagtcaattgatgtggcaagctttatggttgtctacttttaagaaaacg acacagccaccagccttcagtgaccgtcaccgtaatcatttagcagccatcaacattgag gctggccctcagcctggcgccccttccgcagacatccctagaaaaagaactaacgcggcc ttctccgagcccagggctggagtaggaagtacccgccctcccgaacgcgaggccttcctc tcaccccatacagttactgcccctttgactcctccgagaggcaaagctttttcaaagctc taa >gi568815597f:198132581_198419519|GENSCAN_predicted_peptide_3|280_aa MSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLFNEIKEDTKKWKNIPCSWVGRINIV KMAILPKELGLFGEMIGSRTQAEKEQAESETFCCAKDKMLKWTVLVGLQPGGGTCKRGSA VVVTADFLLALCWGGTLVFQAMGWAMELPNVPVHCVTLQGQIEEQNQVWAGSGKSMLWLP TCRHKQWPQWGSEGSSLAAGGVGPLILCHSQGIQDANPRVPTLLHTHHLPLDASCESLGS WHVRTRSDRGQEDADENEESSQKDAQAPRAWQAFLLLFHQ >gi568815597f:198132581_198419519|GENSCAN_predicted_CDS_3|843_bp atgagtgaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaactt acaagggacgtgaaggacctcttcaaggagaactacaaaccactgttcaatgaaataaaa gaggatacaaagaaatggaagaacattccatgctcatgggtaggaagaatcaatatcgtg aaaatggccatactgcccaaggaattagggctttttggagagatgattggttctaggact caggctgagaaagaacaagctgagtctgaaacattttgttgtgccaaagacaagatgttg aaatggactgtgctggttggcctccagccgggaggtggcacttgcaaaagaggatcagcc gtagtggtgacagcagattttctgcttgccttatgttggggtggtactctggtatttcag gcaatgggctgggccatggagctcccaaatgtccctgtccattgtgttaccctacaaggg cagatagaggagcaaaaccaggtgtgggctgggtcaggcaagtccatgctctggctcccc acgtgcaggcacaagcagtggccccagtggggatcagagggcagttccctggctgctggc ggggtgggccccctcatcctgtgccacagccagggtatccaggatgccaatcctcgggtc cctaccctcctccatacccaccacctgcccctggatgcctcctgtgaatcccttggctcc tggcatgttcggactaggagtgatagaggacaagaagatgcagatgaaaatgaagaaagc tcacaaaaagatgcacaagcaccacgagcatggcaagcattcctcctcctcttccatcag tga >gi568815597f:198132581_198419519|GENSCAN_predicted_peptide_4|339_aa MIVYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELPFTIAS KRIKYLGIQLTRDVKDLFKENYKPLLKEIKEDTNKWKNIPCSWVGRINIVKMAILPKVIY RFNAVPIKLPMTFFTELEKTALKFIWNQKRARIAKSILSQKNKAGGITLPDFKLYYKATV TKTAWYWYQNRDIDQWNRTEPSEIMPHIYNYLIFDKPEKNKQWGKHSLFNKRCWENWLAI CRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKSLEENLGVTIQDIGMGKDFMSKTPKAMA TKAKIDKWDLIKLKSTAKETTISEQATYKMGENFRNPLI >gi568815597f:198132581_198419519|GENSCAN_predicted_CDS_4|1020_bp atgattgtatatctagaaaaccccattgtctcagcccaaaatctccttaagctgataagc aacttcagcaaagtctcaggatacaaaatcaatgtacaaaaatcacaagcattcttatac accaataacagacaaacagagagccaaatcatgagtgaactcccattcacaattgcttca aagagaataaaatacctaggaatccaacttacaagggatgtgaaggacctcttcaaggag aactacaaaccactgctcaaggaaataaaagaggatacaaacaaatggaagaacattcca tgctcatgggtaggaagaatcaatatcgtgaaaatggccatactgcccaaggtaatttat agattcaatgccgttcccatcaagctaccaatgactttcttcacagaattggaaaaaact gctttaaagttcatatggaaccaaaaaagagcccgcatcgccaagtcaattctaagccaa aagaacaaagctggaggcatcacactacctgacttcaaactatactacaaggctacagta accaaaacagcatggtactggtaccaaaacagagatatagaccaatggaacagaacagag ccctcagaaataatgccacatatctacaactatctgatctttgacaaacctgagaaaaac aagcaatggggaaagcattccctatttaataaacggtgctgggaaaactggctagccata tgtagaaagctgaaactggatcccttccttacaccttatacaaaaatcaattcaagatgg attaaagacttaaacgttagacctaaaaccataaaaagcctagaagaaaacctaggcgtt accattcaggacataggcatgggcaaggacttcatgtctaaaacaccaaaagcaatggca acaaaagccaaaattgacaaatgggatctaattaaactaaagagcacagcaaaagaaact accatcagtgaacaggcaacctacaaaatgggagaaaattttcgcaacccactcatctga >gi568815597f:198132581_198419519|GENSCAN_predicted_peptide_5|229_aa MGYNTLANFRIEKKIGRGQFSEVYRAACLLDGVPVALKKVQIFDLMDAKARADCIKEIDL LKQLNHPNVIKYYASFIEDNELNIVLELADAGDLSRMIKLVRLITCLQREYMKMDTTSNL TSGLLAVYYMSFIGGILQMAALQSPFYGDKMNLYSLCKKIEQCDYPPLPSDHYSEEDFND DTTSGSLKSACLQGYFTAAKCEVGYADSQPRGKYQSLAQIEGLRVGKGK >gi568815597f:198132581_198419519|GENSCAN_predicted_CDS_5|690_bp atgggctataatacattagccaactttcgaatagaaaagaaaattggtcgcggacaattt agtgaagtttatagagcagcctgtctcttggatggagtaccagtagctttaaaaaaagtg cagatatttgatttaatggatgccaaagcacgtgctgattgcatcaaagaaatagatctt cttaagcaactcaaccatccaaatgtaataaaatattatgcatcattcattgaagataat gaactaaacatagttttggaactagcagatgctggcgacctatccagaatgatcaagttg gtacgccttattacatgtctccagagagaatacatgaaaatggatacaacttcaaatctg acatctggtctcttggctgtctactatatgagtttcattgggggcattttacagatggct gcattacaaagtcctttctatggtgacaaaatgaatttatactcactgtgtaagaagata gaacagtgtgactacccacctcttccttcagatcactattcagaagaagattttaatgat gataccactagtggttcgttgaagtcggcctgcctgcagggctacttcactgcagctaag tgtgaagttggatatgcggattctcagccaaggggcaaatatcaaagtctggcccagata gaaggtcttagagttgggaaaggaaaatag >gi568815597f:198132581_198419519|GENSCAN_predicted_peptide_6|467_aa MGDFNTPLSTLDRSTRQKVNKDTQELNSALHQADLIDIYRTLQPKSTENTFFSAPRHTYS KIDHILGSKALLSKCKRTEIITNYLSDHSAIKLELRIKNLTQNRSTTWKLNNLLLNDYWV HNEMKAEIKMFFETNENKDTTYKNLWDSFKAVCRGQFIALNAHKRNQERSKTDTLTSQLK ELEKQEQTHSKASTRQEITKIRAELKEIETQKTLQKINESRSWFFERINKIDRLLARLIK KKREKNQIDAIKNDKGDITTDPTEIQTTIREYYKHLYANKLENLEEMDKFLDTYTLPRLN QEEVESLNKPITGAEIVAIINSLPTKKSPGPDGFTAEFYQRYKEEQVPFLLKLFQSIEKE RILPNSFYEASIILIPKPGRDTTKKGNFRPISLMNIDAKILNKILANRIQQHIKKLIHHD QVGFIPGMQGWFNIHKSINVIQHINRTKQKNHMIISIDAEKAFDKIQ >gi568815597f:198132581_198419519|GENSCAN_predicted_CDS_6|1404_bp atgggagactttaacaccccactgtcaacattagacagatcaacgagacagaaagtcaac aaggatacccaggaattgaactcagctctacaccaagcggacctaatagacatctacaga actctccaacccaaatcaacagaaaatacatttttttcagcaccacgccacacctattcc aaaattgaccacatacttggaagtaaagctctcctcagcaaatgtaaaagaacagaaatt ataacaaactatctctcagaccacagtgcaatcaaactagaactcaggattaagaatctc actcaaaatcgctcaactacatggaaactgaacaacctgctcctgaatgactactgggta cataacgaaatgaaggcagaaataaagatgttctttgaaaccaacgagaacaaagacaca acatacaagaatctctgggactcatttaaagcagtgtgtagagggcaatttatagcacta aatgcccacaagagaaaccaggaaagatccaaaactgacaccctaacatcacaattaaaa gaactagaaaagcaagagcaaacacattcaaaagctagcacaaggcaagaaataactaaa atcagagcagaactgaaggaaatagagacgcaaaaaacccttcaaaaaattaatgaatcc aggagctggttttttgaaaggatcaacaaaattgatagactgctagcaagactaataaag aaaaaaagagagaagaatcaaatagatgcaataaaaaatgataaaggggatatcaccacc gatcccacagaaatacaaactaccatcagagaatactacaaacacctctacgcaaataaa ctagaaaatctagaagaaatggataaattccttgacacatacactctcccaagactaaac caggaagaagttgaatctctgaataaaccaataacaggagctgaaattgtggcaataatc aatagcttaccaaccaaaaagagtccaggaccagatggatttacagccgaattctaccag aggtacaaggaggaacaggtaccattccttctgaaactattccaatcaatagaaaaagag agaatcctccctaactcattttatgaggccagcatcatcctgataccaaagccgggcaga gacacaaccaaaaaagggaattttagaccaatatccttgatgaacattgatgcaaaaatc ctcaataaaatactggcaaaccgaatccagcagcacatcaaaaagcttatccaccatgat caagtgggcttcatccctgggatgcaaggctggttcaacatacacaaatcaatcaatgta atccagcatataaacagaaccaaacagaaaaaccacatgattatctcaatagatgcagaa aaggcctttgacaaaattcaataa >gi568815597f:198132581_198419519|GENSCAN_predicted_peptide_7|554_aa MAEGKAEAATFFTRQQRECQTVIKPSDLMRTHSLSREQHGGNHPHDSIASTWFLPQHLKI MGITIQGFQEFFPPRPSSTPPPPSRVSAVYRSSLGMAAAQQTQELGAHVHSCTAIKNYLR LGIYKEKRFNWLLVPQAVPEACWGGLRKLAIMAEGKEERGTAYMGGAGKESEGGESDQVN KVRDAGGKVVGDIILDSIVWEVLFKREPLNISGRKELAKRKDRAGAVCRTTAPEDIGDEE AAAPAGHPGLRQRRWWKSASLRQGGCPSQAAVEGKLTNRNDIHTKNPPVHHHHQRPKVDK TTKMGKKQSRKTGNSKKQSASPPPKECSSSPATEQSWTDNDFDKLREEDFRRSNYSELRE DIQTKGKEVENFEKNLEECITRITITEKCLKELMELKTKARELREECRSLRSQCNQLEER VSAMEDEMNEMKREGKFREKRIKRNEQSLQEIWDYVKGPNLRLIGVPESDGENGTKLENT LQDIIRENFPNLARQVNIQIQEIQRMPQRYSSRRATPRHIIVTKVEMKEKMLGAAREKGR VTLKGKPIRLTADL >gi568815597f:198132581_198419519|GENSCAN_predicted_CDS_7|1665_bp atggcggaaggcaaagcagaagcagctaccttcttcacaaggcagcagcgggaatgccag acggttataaaaccatcagatctcatgagaactcactcactatcacgagaacagcacgga ggaaaccacccccatgattcaattgcctccacctggttcctccctcaacacctgaaaatt atggggattacaattcaagggtttcaagagttctttcctcctcgtccttcctcaacacct ccaccaccatcaagggtgtcagcggtttacagaagtagtctgggaatggcagctgcccag caaactcaagaacttggggctcatgtccattcttgcactgctataaagaactacctgaga ctgggtatttataaagaaaagaggtttaattggctcctggttccacaggctgtaccagaa gcatgctggggcggtctcaggaaacttgcaatcatggcagaaggcaaagaggaaagaggc acagcttacatgggtggagcaggaaaagagagtgaagggggagagagtgaccaagtcaac aaggtcagggatgctggaggaaaggtggttggtgacatcatattagatagcatagtctgg gaagtcctcttcaagagggagcctttgaacatcagtggtaggaaggaattggctaagaga aaagatagggctggagctgtctgcaggactacggctccagaggacattggagatgaggag gcagctgcgcctgcagggcatccagggctgagacagaggaggtggtggaagtctgcaagt ctgcgccaaggaggttgtccgagccaagcagcagtcgaaggaaaactaacaaacagaaac gacatccacaccaaaaacccacctgtacatcaccatcatcaaagaccaaaagtagataaa accacaaagatggggaaaaaacagagcagaaaaactggaaactctaaaaagcagagcgcc tctcctcctccaaaggaatgcagttcctcaccagcaacggaacaaagctggactgacaat gactttgacaagctgagagaagaagacttcagacgatcaaattactccgagctacgggag gacattcaaaccaaaggcaaagaagttgaaaactttgaaaaaaatttagaagaatgtata actagaataaccattacagagaagtgcttaaaggagctgatggagctgaaaaccaaggct cgagaactacgtgaagaatgcagaagcctcaggagccaatgcaatcaactggaagaaagg gtatcagcgatggaagatgaaatgaatgaaatgaagcgagaagggaagtttagagaaaaa agaataaaaagaaacgagcaaagcctccaagaaatatgggactatgtgaaaggaccaaat ctacgtctgattggtgtacctgaaagtgacggggagaatggaaccaagttggaaaacact ctgcaggatattatccgggagaacttccccaatctagcaaggcaggtcaacattcagatt caggaaatacaaagaatgccacaaagatactcctcgagaagagcaactccaagacacata attgtcaccaaagttgaaatgaaggaaaaaatgttaggggcagccagagagaaaggtcga gttaccctcaaagggaagcccatcagactaacagcggatctctag >gi568815597f:198132581_198419519|GENSCAN_predicted_peptide_8|427_aa MMFRQPTALEQDAELDIPIGTISRQLCMGKDLPNYISHILLPADSLLGSANRTALAGDEE AGLEIRTLLLIQIVVALADSCWRCSSGTEGRMKRIQLLDCEDLSIAHGKVHLVIRNIDIQ ATASEELTFAPADDFSPGQHLDCYLMRDSGPEPPSKAAPGFLTLRNTVNGFVVSLTSGVK LQMQTLAVSVTAHKGGMSRVVCSSQWVHGLTDFRNEAADADPRARHRALIGAFLQSADWC VYNPLARQKSSPSSHWTQEVQLASPLTVRISSAISDELGMLDRNKQDKERKHSKPKMLCD EAGPCSPHCAQGTFQPFIKGPLATFHVVHYTDEAENLPVYLSVMSTTEAWLPLRNGGNSK AIGCSLYLSRHSGLADHSIYHTVLQMPVHFSICTPSRNYVYLLHFISSVLATVSGPLQAL SKYLSNE >gi568815597f:198132581_198419519|GENSCAN_predicted_CDS_8|1284_bp atgatgttcagacaacctacggcactagagcaggatgcggaacttgacatacctatagga actatctccagacaactttgtatgggcaaggatttgccaaattacatttcccacattctt ttgccagctgactctctgttaggttctgccaataggacagcactggcgggagatgaggag gctggactggagataagaaccctgcttctgattcagatagttgttgcacttgctgacagt tgctggaggtgctctagtggcactgagggtaggatgaaaagaatccaactgctagattgt gaggatctttcaattgcccatggaaaggtgcaccttgtgataaggaacatagacatccag gcaacagccagtgaggaactaacgtttgctcctgcagatgatttcagccctggccaacac cttgactgctaccttatgagagactctggaccagaaccacccagtaaagcagcacctgga ttcctgactctcagaaacactgttaatgggttcgtggtctcgctgacttcaggagtgaag ctgcagatgcagactctggcagtgagtgttacagctcataaaggtggcatgtccagagtt gtttgttcctcccagtgggttcatggtctcactgacttcaggaatgaagctgcagacgca gaccctcgtgctagacacagagcgttgattggtgcatttttacagagtgctgattggtgc gtttacaatcctttagctagacagaaaagttctccaagctcccactggacccaggaagtc cagctggcttcacctctcacagttagaatatcttctgccatcagtgatgagcttggaatg ttggatcgtaataaacaagacaaggagaggaaacacagcaaaccaaagatgctttgtgat gaggcagggccatgcagcccacactgtgctcagggcacttttcagccatttatcaaagga cccctggccacgtttcatgtggttcactacactgatgaagctgaaaatttgccagtctac ctttcagtgatgagcaccacagaagcgtggttacctctcagaaatggaggcaacagcaag gccattggatgcagtctttatttgtccaggcattcaggccttgctgatcatagcatttat cacacagtattacaaatgcctgttcacttctctatttgcaccccaagcaggaactatgtc taccttcttcactttatatcctcagtgcttgccacagtgtctggcccattgcaggcactc agtaaatacctgtcaaatgaatga