GENSCAN 1.0 Date run: 8-Nov-116 Time: 09:14:50 Sequence gi568815595f:98253854_98454816 : 200963 bp : 37.23% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 6044 6306 263 0 2 86 45 214 0.910 11.60 1.02 PlyA + 7826 7831 6 1.05 2.03 PlyA - 8884 8879 6 1.05 2.02 Term - 16459 16213 247 0 1 -4 53 203 0.195 1.88 2.01 Init - 37082 36769 314 2 2 60 64 220 0.150 13.64 2.00 Prom - 54840 54801 40 -3.75 3.00 Prom + 60054 60093 40 -4.85 3.01 Init + 73004 73118 115 1 1 73 66 94 0.955 6.12 3.02 Intr + 73779 73999 221 1 2 58 93 108 0.931 5.40 3.03 Intr + 74964 75187 224 2 2 27 91 89 0.448 -0.90 3.04 Intr + 75650 75756 107 2 2 64 81 90 0.860 4.84 3.05 Intr + 99987 100494 508 1 1 41 31 242 0.004 4.90 3.06 Intr + 108744 109007 264 0 0 62 74 114 0.029 3.10 3.07 Intr + 134159 134261 103 2 1 41 65 88 0.025 1.06 3.08 Term + 148587 148760 174 2 0 72 48 140 0.720 5.18 3.09 PlyA + 150428 150433 6 1.05 4.00 Prom + 150809 150848 40 -7.05 4.01 Init + 152726 152830 105 1 0 63 66 66 0.379 2.18 4.02 Term + 153284 153598 315 1 0 74 47 234 0.842 11.86 4.03 PlyA + 153854 153859 6 1.05 5.00 Prom + 155176 155215 40 -6.15 5.01 Sngl + 155269 156066 798 0 0 49 37 327 0.614 19.30 5.02 PlyA + 156103 156108 6 1.05 6.00 Prom + 157041 157080 40 -10.15 6.01 Init + 157101 157301 201 2 0 60 86 148 0.122 10.72 6.02 Intr + 160504 160564 61 0 1 94 89 -22 0.070 -4.11 6.03 Intr + 167668 167829 162 2 0 59 121 94 0.737 8.83 6.04 Intr + 168560 168853 294 0 0 20 73 159 0.417 3.66 6.05 Term + 174897 175393 497 1 2 48 48 266 0.049 12.34 6.06 PlyA + 175850 175855 6 1.05 7.02 PlyA - 177867 177862 6 1.05 7.01 Term - 191198 190954 245 0 2 45 42 178 0.529 3.98 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:98253854_98454816|GENSCAN_predicted_peptide_1|87_aa XRGASTCCHCHPWSQGVLQDYPGCSLKAQGLLNQLVVKLPSLRLTLQAVGSPQAQGKSRN IVEESTARIEDLKSQLGALPNCGDVRT >gi568815595f:98253854_98454816|GENSCAN_predicted_CDS_1|264_bp ngcagaggagcctccacctgttgccactgccacccctggtcacaaggagtactgcaagac taccctggatgttcccttaaggcccaaggactcttaaatcagcttgtggtaaagctgcct agcctgagactcacccttcaggcagtgggctcccctcaagcccagggaaagtctagaaat attgttgaagagtcaactgctagaattgaggacctaaagagccagcttggtgctctaccc aactgtggtgatgttcgtacctaa >gi568815595f:98253854_98454816|GENSCAN_predicted_peptide_2|186_aa MNFPELKEHIIAQCKKAKNHDKMMQVVTGKIASTKRNITDLIQLKNTLQELHNAITSIDS RIDQVEKKISELEDYLSEIKQADRNREKRMKRNKQNLKELRDGIMHEGTNPAEIEIWKLS DREFKITVLKKLRETQNNTKKEFRMISMKFNKEIERSKNNQTENLELKNAIGILKNALEP FNSKMD >gi568815595f:98253854_98454816|GENSCAN_predicted_CDS_2|561_bp atgaacttccctgagctaaaggagcacattatagcccaatgcaagaaagctaagaatcat gataaaatgatgcaggtggtgacaggcaaaatagccagtacaaagagaaatataactgac ctgatacagctgaaaaacacactacaagaacttcacaatgcaatcacaagtatcgatagc agaatagaccaagtggagaaaaaaatctcagagctcgaagactatctttctgaaataaaa caggcagacaggaatagagaaaaaagaatgaaaaggaataaacaaaacctcaaagaatta cgggatgggattatgcacgagggaaccaatcctgcagaaatagaaatatggaagctctca gacagagaattcaaaataactgtgttgaagaaactcagggaaactcaaaataacacaaag aaggaattcagaatgatatcaatgaaatttaacaaagagattgaaagaagtaaaaataat cagacagaaaatctggagctgaaaaatgcaattggcatactgaagaatgcattagagccc tttaatagcaaaatggattga >gi568815595f:98253854_98454816|GENSCAN_predicted_peptide_3|571_aa MAPPGPTNPSPVAPLRVIQCTEGQLQTPTISSLTQHISKTVHSPAVLPQEYTKKRRCILT HPPYCCVGFPWAGTGPHILYLSRLASNLELFKRGKGRGEQRKEEVTCGMLRKHLRKRCVR GLEDCPLSPQTSGNLIQGPHTPRMCELARGISQRLDGAEHRLAPAEPSHESPPARAACLP LRKSSPREPHGLKDLATTMTTTKILGRVPAIGGDFPRPTSRPEEKPGMARENHSLAAEFI LIGFTNYPELKTLLFVVFSAIYLVTMVGNLGLVALIYVERRLLTPMYIFLGNLALMDSCC SCAVTPKMLENFFSEDRIISLYECMAQFYFLCLAETTDCFLLATMAYDRYVAICHPLQYH TMMSKTLCIRMTTGAFKAGNLHSMIHVGLLLSQVGRETEKRNKTQRQSIEKQQWDQEDRH SAYQGPAPTPASESPQFLLIIILIISAKRNVVGQQGDNKEKISKKHVSKRTYVIIKFKGS LYANDSDGTGVTDFRCPNKYIVYRGVKCEEGNNRVGFTSLPHQQMEVQFASLLPTTIEDG ALAGPEAATIALSAPCTCTNTAKRTADPPPT >gi568815595f:98253854_98454816|GENSCAN_predicted_CDS_3|1716_bp atggctccacctggacccaccaacccatctcctgtggccccactcagagtgattcaatgc acagaaggacagcttcaaacccctacgatttcatctctgacccaacacatcagcaaaacg gtacactcaccagcagttctgccacaagagtacaccaaaaaaaggagatgcatcctgacg catccaccctactgctgtgtcgggtttccgtgggctggaacaggacctcacattctgtat ttgtcccgattggctagcaacttagaactttttaaaagaggcaaaggtagaggagaacaa agaaaggaggaagtcacttgtggaatgttgagaaagcacctgaggaagcggtgcgtgaga ggactggaggactgcccactctcaccacaaacctctgggaacctaatacaggggccccac acccctaggatgtgtgagctggcaagggggatctctcagagattagatggagcagagcac aggttagctccagcagaacccagtcatgaaagcccaccagccagggctgcctgtctgcct ctgagaaagtctagccccagggagccccatggcctgaaagacttagcaacaacaatgaca acaacaaaaatcttaggcagagtgccagcaatcggaggtgacttcccaaggcccacgagc agacctgaagagaagccaggaatggctagggaaaatcactccttagcagctgaattcatc ctcataggatttacaaattatccagagctgaagacgcttctgtttgtggtgttctctgcc atctatctggtcaccatggtggggaatcttggtctggtggcattaatttatgtagagcgt cgtcttctcacaccaatgtacatctttctgggcaacctggctctgatggattcctgctgt tcctgtgctgttacccccaagatgttagagaatttcttttctgaggatagaattatttcc ctgtatgaatgtatggcacaattttattttctctgtcttgctgaaaccacagactgcttt cttctggcgacaatggcctatgaccgctatgtggccatatgccacccactgcagtaccac accatgatgtccaagacgctctgcattcggatgaccacaggggccttcaaagctggaaac ctgcattccatgattcatgtagggcttttattaagtcaggtgggacgagagactgagaaa agaaataagacacagagacaaagtatagagaaacaacagtgggaccaggaggaccggcac tcagcataccaaggacctgcaccgacaccggcctctgagtcccctcagtttttattgatt attattctcattatttcagcaaaaaggaatgtagtaggacagcagggtgataataaggag aagatcagcaaaaaacatgtgagcaaaagaacctatgtcataattaagttcaagggaagc ctatatgctaatgatagtgatgggacaggagtgaccgattttagatgtcctaacaaatat atagtatatcgaggagtgaagtgtgaagaaggaaacaacagggtgggttttacttcattg cctcaccagcaaatggaagtgcagtttgcctctctgctaccaaccaccattgaagatgga gccttggcaggcccagaggcagcaaccattgccctgtcagcaccctgcacttgtactaac actgcaaagagaacagcagatcctcccccaacctga >gi568815595f:98253854_98454816|GENSCAN_predicted_peptide_4|139_aa MVMRWSQDGQIGTAPVYSSQCEQRRKRVISAFPPEGTDKQKDSSNFCRLKCPCLTALKRV VVRPARSWRSENGQTASSSGSLNPDQPNGLAPPSRGRLTPHMAGYSSETKLPEERSGSCI CGSPNSAVLQPPLFCSHRC >gi568815595f:98253854_98454816|GENSCAN_predicted_CDS_4|420_bp atggtcatgaggtggagccaagatggccaaataggaacagctccagtctacagctcccag tgtgagcaacgcagaaaacgggtgatttctgcatttccacctgagggcacagacaaacaa aaagacagcagtaacttctgcagacttaaatgtccctgtctgacagctttgaagagagta gtggttcgcccagcacgcagctggagatctgagaatgggcagactgcctcctcaagtggg tccctgaaccccgatcagcctaacgggttggcacccccaagtaggggcagactgacacct cacatggccgggtactcctctgagacaaaacttccagaggaacgatcaggcagctgcatt tgtgggtcaccaaattccgctgttctacagccaccgctgttctgcagccatcgctgctga >gi568815595f:98253854_98454816|GENSCAN_predicted_peptide_5|265_aa MGDFNTPLSTLDRSTRQKVNKDTQELNSALHQADLIDIYRTLHPKSTECTFFSAPHHTYS KVDHIVGSKALLSECKRTEIITNCLSDHSVIKQELRIKKLTQNCSTTWKLNNLLLNDYWV HDEMKAEIKMFFETNENKDTTYQKLWDTFKAVCRGKFIALNAHKRKQERSKIDTLTSQLK ELEKQEQTHSKASRRQEITKIRAELKEIETHKTLQKINESRSWFLEKINKIDRLLARLIK KKRENNQIDAIKNDKGDVTTDLTEI >gi568815595f:98253854_98454816|GENSCAN_predicted_CDS_5|798_bp atgggagactttaacaccccactgtcaacattagacagatcaacgagacagaaagttaac aaggatacccaggaattgaactcagctctgcaccaagcggacctaatagacatctacaga actctccaccccaaatcaacagaatgtacatttttttcagcaccacaccacacctattcc aaagttgaccacatagttggaagtaaagcactcctcagcgaatgtaaaagaacagaaatt ataacaaactgtctgtcagaccacagtgtaatcaaacaagaactcaggattaagaaactc actcaaaactgctcaactacatggaaactgaacaacctgctcctgaatgactactgggta catgacgaaatgaaggcagaaataaagatgttctttgaaaccaatgagaacaaagacaca acataccagaaactctgggacacattcaaagcagtgtgtagagggaaatttatagcacta aatgcccacaagagaaagcaggaaagatccaaaattgacaccctaacatcacaattaaaa gaactagaaaaacaagagcaaacacattcaaaagctagcagaaggcaagaaataactaag atcagagcagaactgaaggaaatagagacacataaaacccttcaaaaaatcaatgaatcc aggagctggtttttagaaaagatcaacaaaattgatagactgctagcaagactaataaag aagaaaagagagaacaatcaaatagatgcaataaaaaatgataaaggggatgtcaccact gatctcacagaaatataa >gi568815595f:98253854_98454816|GENSCAN_predicted_peptide_6|404_aa MSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWVGRIKIV KMAIRPKAPVCMFFPHVSMCSHPLAPSWVLQKGSQRYWKKATGRRKSPLNFGTICTDREV SWPELRGGHESGVQTPQAEEEGKGGEYYIKGTCQGTKESEPQPSALDLSSDRARGNWKRN QKTNSGNMTKQGSLTPPKNCNSSLSMDPNQEEIPNLPEKEFRKLVIKLIRETAEKDEGQS GTYIIEVPHHRGPKDKPTELSFTPPMPQHAIWGPGDFPAQSTNVDTGALFLGAHGGPITL PLMSHTGLIIISPTTSQPLTTQLLTTLEPAACLATAIAITHATPTVQGPKNLPTNWLTTI TIHIQASYLRTQVLGHLDPLTLVSHVTLGPKDRHTWPTAVNTGA >gi568815595f:98253854_98454816|GENSCAN_predicted_CDS_6|1215_bp atgagtgaactcccattcacaattgcttcaaagagaataaaatacctgggaatccaactt acaagggatgtgaaggacctcttcaaggagaactacaaaccactcctcaatgaaataaaa gaggatacaaataagtggaagaacattccatgctcatgggtaggaagaatcaagatcgtg aaaatggccatacggcccaaggccccagtgtgcatgtttttcccccatgtgtccatgtgt tctcatcctttagctccctcttgggtccttcagaagggcagccagaggtactggaaaaag gccactgggagaaggaaatctccactgaactttggaacaatttgtactgatcgagaagtc tcctggccagaactcaggggagggcatgaatctggtgtgcagactccacaggcagaggaa gaaggaaaaggaggagagtactatatcaagggaacatgccaggggacaaaagaatctgaa ccacagccttcagccctagacctttcctctgacagagccagaggaaactggaaaagaaac cagaaaaccaactctggtaatatgacaaaacaaggttctttaacaccccccaaaaattgc aatagctcactgtcaatggatccaaaccaagaagaaatccctaatttacctgaaaaagaa ttcaggaagttagttattaagctaatcagggagacagcagaaaaagatgaaggccaatct ggcacatacatcatagaggtcccacatcatagaggtcccaaggacaagcccactgagctc agcttcactccccctatgccacagcatgcaatctgggggcctggagatttcccggcccag tctaccaatgttgacactggagcactcttcctgggggcccatggtgggcctatcaccctg ccactcatgtcacacacaggcctaattattatttcaccaaccacatcacaaccactgaca acacaactgcttaccactctggaaccagcagcttgtcttgccactgctattgccatcacc catgccacaccaactgttcaggggcccaaaaacttacccactaactggctcaccactatc actatccacatccaagctagctacctgaggacccaagtactgggccatctggacccacta acactggtgtcccatgtcaccctggggcccaaggacaggcacacttggcccactgctgtc aacactggggcttga >gi568815595f:98253854_98454816|GENSCAN_predicted_peptide_7|81_aa XVLKFYIVTVSGYTQENGALANDLNRENAIQEIITKGVKSKNSNKVTCDKLLLEGLVLTE LRILSIGEVILQKLESLWGYS >gi568815595f:98253854_98454816|GENSCAN_predicted_CDS_7|246_bp naagttttaaagttttacattgtcactgtatcagggtatactcaggaaaacggagccctt gccaacgatttaaatagagaaaatgcaattcaggaaataattacaaagggtgtgaaaagt aaaaattcaaacaaggtaacctgtgataagctcctactggaaggactggtgttaacagaa ctcagaattctgtccataggagaggttattctgcagaagctggaatctctctggggatac agctaa