GENSCAN 1.0 Date run: 5-Nov-116 Time: 09:04:02 Sequence gi568815595f:182720259_183018101 : 297843 bp : 38.93% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 PlyA - 221 216 6 -0.45 1.04 Term - 3235 2924 312 1 0 74 40 190 0.223 6.82 1.03 Intr - 6002 5928 75 2 0 75 56 95 0.261 3.79 1.02 Intr - 12894 12659 236 1 2 90 98 80 0.512 5.68 1.01 Init - 22049 21950 100 2 1 41 41 133 0.280 4.38 1.00 Prom - 24570 24531 40 -4.75 2.00 Prom + 46214 46253 40 -2.65 2.01 Init + 62286 62339 54 2 0 52 53 54 0.028 -0.37 2.02 Intr + 73454 73528 75 1 0 76 82 105 0.764 7.49 2.03 Intr + 73962 74159 198 2 0 -8 12 227 0.011 4.13 2.04 Intr + 99141 99260 120 2 0 46 45 96 0.002 0.87 2.05 Intr + 100002 100118 117 2 0 55 116 81 0.003 7.34 2.06 Intr + 109414 109494 81 0 0 49 80 73 0.013 1.52 2.07 Intr + 115777 115884 108 0 0 65 67 119 0.991 7.06 2.08 Intr + 116084 116212 129 1 0 92 47 120 0.980 8.27 2.09 Intr + 116813 116916 104 1 2 62 80 101 0.707 4.75 2.10 Intr + 121817 121864 48 2 0 30 116 64 0.089 0.28 2.11 Intr + 137620 137770 151 1 1 106 63 28 0.680 1.34 2.12 Intr + 138904 139101 198 0 0 95 67 75 0.710 4.73 2.13 Intr + 145198 145440 243 0 0 100 94 168 0.999 15.37 2.14 Intr + 146010 146185 176 2 2 62 86 213 0.826 16.32 2.15 Intr + 148820 148893 74 2 2 81 91 61 0.999 3.83 2.16 Intr + 148970 149073 104 0 2 124 97 32 0.997 6.67 2.17 Intr + 152098 152279 182 0 2 91 111 187 0.999 19.04 2.18 Intr + 153554 153757 204 2 0 108 116 220 0.967 24.19 2.19 Intr + 178349 178514 166 2 1 120 91 64 0.897 8.94 2.20 Term + 185536 185661 126 0 0 108 43 82 0.702 3.00 2.21 PlyA + 187517 187522 6 1.05 3.12 PlyA - 188339 188334 6 1.05 3.11 Term - 191732 191611 122 0 2 67 32 138 0.223 3.76 3.10 Intr - 224915 224850 66 0 0 118 21 58 0.007 0.06 3.09 Intr - 227076 226980 97 0 1 71 90 115 0.997 8.66 3.08 Intr - 227374 227292 83 2 2 63 111 14 0.161 -0.36 3.07 Intr - 241098 240968 131 2 2 63 82 147 0.986 11.02 3.06 Intr - 243791 243623 169 0 1 67 96 161 0.999 12.88 3.05 Intr - 245495 245279 217 2 1 48 113 157 0.982 11.35 3.04 Intr - 259379 259199 181 1 1 41 32 108 0.269 -0.55 3.03 Intr - 260381 260229 153 1 0 70 77 99 0.892 5.27 3.02 Intr - 260628 260577 52 1 1 77 59 76 0.662 0.65 3.01 Init - 261595 261472 124 1 1 63 91 52 0.564 3.39 3.00 Prom - 264779 264740 40 -7.75 4.00 Prom + 266016 266055 40 -1.85 4.01 Init + 271161 271270 110 2 2 93 35 107 0.012 5.64 4.02 Intr + 278301 278485 185 0 2 50 61 61 0.001 -1.89 4.03 Intr + 279695 279810 116 0 2 64 61 72 0.001 1.35 4.04 Intr + 279954 280103 150 2 0 26 45 138 0.018 2.74 4.05 Term + 282258 282383 126 2 0 123 54 88 0.035 6.20 4.06 PlyA + 283120 283125 6 1.05 5.00 Prom + 286932 286971 40 -2.75 5.01 Sngl + 297305 297838 534 1 0 46 48 311 0.885 18.72 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 224915 224836 80 0 2 118 42 68 0.980 2.15 S.002 Sngl + 271161 271349 189 2 0 93 34 135 0.874 3.46 S.003 Intr - 285544 285402 143 0 2 54 115 66 0.858 4.98 S.004 Term - 295308 295180 129 0 0 73 37 145 0.877 5.10 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:182720259_183018101|GENSCAN_predicted_peptide_1|240_aa MLRFHSIGLQVVAALSSTLRKNVSESEANQQPKDPLTHTESIPESVRYGQVKYLRMTWLG NYMAISLAKLKAKTLQLIMKGNHARWCCLNGSSLSTQRASYAQSSPKGEDSTNYRKHWAS DGRLRNHESLPPVLKTHNPRQIKGDLGWFSDDPDRYIEAFQNVTQVLYLTCRDVMLLLSQ THTAAEKQAAPDIRRKLQKQAIGPGSTLEDLLKLDTSVFYNRDREEAQEKKRSTRERLRL >gi568815595f:182720259_183018101|GENSCAN_predicted_CDS_1|723_bp atgctacgctttcacagtatcggtctacaggtggtagctgctctttcctcaacactcagg aagaatgtgtctgagagtgaagccaaccaacagcccaaggaccccctcacacatactgag agcatccctgagtctgtaagatacggccaagtgaaatatttgagaatgacatggcttggg aactacatggccataagccttgctaaattaaaagcaaaaacattgcaattaatcatgaaa gggaaccatgctagatggtgctgtctaaatggaagctctttatctacacagagggcctca tatgcccagtcttcacccaagggagaggacagtacaaactatcgaaaacactgggcatct gatggccgattaaggaaccacgagagcctgccgccggtcttaaagactcataaccctagg caaataaagggagacttgggctggttttctgatgaccctgataggtatattgaggccttc caaaatgtaactcaagtgctttacctcacatgcagggacgttatgctgcttctaagccaa acccacactgcagctgaaaagcaggcagctccagatattagaaggaagctacagaaacag gccataggaccgggtagtaccttggaagacctcctgaagttggacacttcggtcttttac aatagggaccgggaggaggcccaggagaaaaagagaagcacaagagaaagactaaggctc tag >gi568815595f:182720259_183018101|GENSCAN_predicted_peptide_2|885_aa MREVCLRESTTESSTEPEPRGPRAPRDPDGDDGGMWRWIRQQLGKRRTCLLPPLVTDFWK RVTIQSCVDSRGGGCRRCLAPSARDVSPQDLERGKTVPAAGLASETIRITICRMLKCLEG SLATVEVEIQSSKNKVSVFLLPVKSFLPEGFDPPHQSDTRTIYVANRFPQNGLYTPQKFI DNRIISSKLMIDTPTSPVTSGLPLFFVITVTAIKQGYEDWLRHNSDNEVNGAPVYVVRSG GLVKTRSKNIRVGDIVRIAKDEIFPADLVLLSSDRLDGSCHVTTASLDGETNLKTHVAVP ETALLQTVANLDTLVAVIECQQPEADLYRFMGRMIITQQMEEIVRSMNTFLIIYLVILIS EAVISTILKYTWQAEEKWDEPWYNQKTEHQRNSSKILRFISDFLAFLVLYNFIIPISLYV TVEMQKFLGSFFIGWDLDLYHEESDQKAQVNTSDLNEELGQVEYVFTDKTGTLTENEMQF RECSINGMKYQEINGRLVPEGPTPDSSEGNLSYLSSLSHLNNLSHLTTSSSFRTSPENET ELIKEHDLFFKAVSLCHTVQISNVQTDCTGDGPWQSNLAPSQLEYYASSPDEKALVEAAA RYKLLHILEFDSDRRRMSVIVQAPSGEKLLFAKGAESSILPKCIGGEIEKTRIHVDEFAL KGLRTLCIAYRKFTSKEYEEIDKRIFEARTALQQREEKLAAVFQFIEKDLILLGATAVED RLQDKVRETIEALRMAGIKVWVLTGDKHETAVSVSLSCGHFHRTMNILELINQKSDSECA EQLRQLARRPFLGSQNMYFVFIQLLSSGSAWFAIILMVVTCLFLDIIKKVFDRHLHPTST EKAQMEEDKGSVSSAYEFAESQHRGEDSRHLLAEACSQQDSCCVH >gi568815595f:182720259_183018101|GENSCAN_predicted_CDS_2|2658_bp atgagggaagtatgtctgagggaatcaaccaccgaaagcagcacagagccagaaccccgc ggcccccgcgccccgcgggacccggacggcgacgacgggggaatgtggcgctggatccgg cagcagctggggaagcgaagaacctgtcttctgcccccactcgtcacggacttctggaag cgtgtcacgatccagagctgtgttgattcccgaggaggaggctgtcgtaggtgtctagcc ccctcagcgagagacgtttctccccaggacctggaaaggggaaagactgtgccagctgcc ggtctagcttcggaaacgatacgtattacaatatgccgaatgttaaagtgtcttgaaggt tcacttgcaactgtggaagtggaaatacagtcctctaagaataaagtttcagtgttcttg ttacccgtgaaatcttttctaccagagggttttgacccaccacatcagagtgacacaaga accatctacgtagccaacaggtttcctcagaatggcctttacacacctcagaaatttata gataacaggatcatttcatctaagcttatgattgatacacctaccagtccagttaccagt ggacttccattattctttgtgataacagtaactgccataaagcagggatatgaagattgg ttacggcataactcagataatgaagtaaatggagctcctgtttatgttgttcgaagtggt ggccttgtaaaaactagatcaaaaaacattcgggtgggtgatattgttcgaatagccaaa gatgaaatttttcctgcagacttggtgcttctgtcctcagatcgactggatggttcctgt cacgttacaactgctagtttggacggagaaactaacctgaagacacatgtggcagttcca gaaacagcattattacaaacagttgccaatttggacactctagtagctgtaatagaatgc cagcaaccagaagcagacttatacagattcatgggacgaatgatcataacccaacaaatg gaagaaattgtaaggtcaatgaatacatttttgataatttatctagtaattcttatatct gaagctgtcatcagcactatcttgaagtatacatggcaagctgaagaaaaatgggatgaa ccttggtataaccaaaaaacagaacatcaaagaaatagcagtaagattctgagatttatt tcagacttccttgcttttttggttctctacaatttcatcattccaatttcattatatgtg acagtcgaaatgcagaaatttcttggatcattttttattggctgggatcttgatctgtat catgaagaatcagatcagaaagctcaagtcaatacttccgatctgaatgaagagcttgga caggtagagtacgtgtttacagataaaactggtacactgacagaaaatgagatgcagttt cgggaatgttcaattaatggcatgaaataccaagaaattaatggtagacttgtacccgaa ggaccaacaccagactcttcagaaggaaacttatcttatcttagtagtttatcccatctt aacaacttatcccatcttacaaccagttcctctttcagaaccagtcctgaaaatgaaact gaactaattaaagaacatgatctcttctttaaagcagtcagtctctgtcacactgtacag attagcaatgttcaaactgactgcactggtgatggtccctggcaatccaacctggcacca tcgcagttggagtactatgcatcttcaccagatgaaaaggctctagtagaagctgctgca aggtacaaactgcttcatattctggaatttgattcagatcgtaggagaatgagtgtaatt gttcaggcaccttcaggtgagaagttattatttgctaaaggagctgagtcatcaattctc cctaaatgtataggtggagaaatagaaaaaaccagaattcatgtagatgaatttgctttg aaagggctaagaactctgtgtatagcatatagaaaatttacatcaaaagagtatgaggaa atagataaacgcatatttgaagccaggactgccttgcagcagcgggaagagaaattggca gctgttttccagttcatagagaaagacctgatattacttggagccacagcagtagaagac agactacaagataaagttcgagaaactattgaagcattgagaatggctggtatcaaagta tgggtacttactggggataaacatgaaacagctgttagtgtgagtttatcatgtggccat tttcatagaaccatgaacatccttgaacttataaaccagaaatcagacagcgagtgtgct gaacaattgaggcagcttgccagaaggccatttttgggctcccagaatatgtattttgtg tttattcagctcctgtcaagtggttctgcttggtttgccataatcctcatggttgttaca tgtctatttcttgatatcataaagaaggtctttgaccgacacctccaccctacaagtact gaaaaggcacagatggaagaagataagggttcagtcagctcagcatatgaatttgctgaa agccagcacagaggggaggacagtaggcacctgctagctgaggcttgcagccagcaggac agttgttgcgtgcactag >gi568815595f:182720259_183018101|GENSCAN_predicted_peptide_3|464_aa MDERFVAARNLPWYSAGRSQYMASSRKLGGLWELPKKFCCFACVDPAVVVRKEPRRKRRS GAASSGRAGPPRTAHEREPGKPVRGAAAAAAVHSLRSRRRRGEAWRTPTCLSSYSPFLFG GGKKSSPGLQTLVVELFTLVIKYLTLLDALRVSEALFGIVQSERELNSVQNKLKSSQKDK VRQFMIFTQSSEKTAVSCLSQNDWKLDVATDNFFQNPELYIRESVKGSLDRKKLEQLYNR YKDPQDENKIGIDGIQQFCDDLALDPASISVLIIAWKFRAATQCEFSKQEFMDGMTELGC DSIEKLKAQIPKMEQELKEPGRFKDFYQFTFNFAKNPGQKGLDLEMAIAYWNLVLNGRFK FLDLWNKFLLEHHKRSIPKDTWNLLLDFSTMIADDMSNYDEEGAWPVLIDDFVEFARPQI AGTKISTPANQCSLPVGNVSGPAINQGQNADPYISVQLKSYRLH >gi568815595f:182720259_183018101|GENSCAN_predicted_CDS_3|1395_bp atggatgaacggtttgtggccgccagaaacctgccttggtacagcgctgggcgctcccag tacatggctagctccaggaagttaggaggcctttgggagttaccaaagaagttctgttgt tttgcctgcgtagacccggccgtggtcgtccggaaggaaccgcgtaggaaacggaggtcc ggggctgcctcctcgggccgcgccgggccgccccgcactgcgcatgagcgggagccgggc aagcccgtgagaggagccgccgccgccgccgccgtccattcgctgcggagccggaggagg aggggagaggcctggaggacaccaacatgtttgtcaagttactctccttttttgtttggg ggggggaaaaagtcctctccaggcttgcagacccttgtggtcgagttgtttactttggta attaagtacctgactctgctggacgctcttcgcgtttcagaagcgctgtttggcattgta cagtctgagcgtgagttgaattccgtgcagaacaagttgaaatcatcgcagaaggataaa gttcgtcagtttatgatcttcacacaatctagtgaaaaaacagcagtaagttgtctttct caaaatgactggaagttagatgttgcaacagataattttttccaaaatcctgaactttat atacgagagagtgtaaaaggatcattggacaggaagaagttagaacagctgtacaataga tacaaagaccctcaagatgagaataaaattggaatagatggcatacagcagttctgtgat gacctggcactcgatccagccagcattagtgtgttgattattgcgtggaagttcagagca gcaacacagtgcgagttctccaaacaggagttcatggatggcatgacagaattaggatgt gacagcatagaaaaactaaaggcccagatacccaagatggaacaagaattgaaagaacca ggacgatttaaggatttttaccagtttacttttaattttgcaaagaatccaggacaaaaa ggattagatctagaaatggccattgcctactggaacttagtgcttaatggaagatttaaa ttcttagacttatggaataaatttttgttggaacatcataaacgatcaataccaaaagac acttggaatcttcttttagacttcagtacgatgattgcagatgacatgtctaattatgat gaagaaggagcatggcctgttcttattgatgactttgtggaatttgcacgccctcaaatt gctgggacaaaaatctccactccagccaaccaatgttccctgcctgtgggcaacgtgtcg gggcctgccatcaatcagggtcagaatgctgacccctacatcagcgttcagcttaaatcc taccgtctccattaa >gi568815595f:182720259_183018101|GENSCAN_predicted_peptide_4|228_aa MTASASEEASGNFSSWQKANWEQVPYMTGAEPSSVGRSSCKASLVVTNSLSVCMSEKVSK KDLISPLLMQLSLARYEILGWNFFSLRMLHIGPQSLLACIHHSGGGNTAGDVQGASAGDC ACGCAGGGVGLGAECWQAPIAMVISGGMGSGCIPTLAGRGMQNPPRQTRASKVMWGAAAD PREAAVWVHASKNRRADTETRIEAMEEMRGRLAGILKLRELQMVMANH >gi568815595f:182720259_183018101|GENSCAN_predicted_CDS_4|687_bp atgacggcttctgcttctgaagaggcctcaggaaacttttcctcatggcagaaggcaaac tgggagcaggtgccttacatgacaggagcagaaccaagcagtgtggggaggagctcttgt aaggcaagtctggtggtaacaaattccctcagcgtttgcatgtctgaaaaggtgtccaaa aaggatcttatttctcctttgcttatgcagcttagtttggccagatatgaaattctgggt tggaatttcttttctttaagaatgttgcatattggcccccagtctcttctggcttgcatt caccacagtggtggaggcaacacagctggggatgtgcagggggcctctgctggtgactgt gcatgtggttgtgctggaggtggtgttggcttgggggcagaatgctggcaggctccgatt gcaatggtgatcagtgggggaatgggaagtggctgcattcccacactggcagggcgagga atgcaaaacccacccaggcagacacgtgccagcaaagtgatgtggggagctgccgcggac cccagggaagctgcagtttgggtacacgcttctaagaaccgcagggcagatactgagacc cgcatagaggcaatggaggaaatgagaggcaggcttgctgggatcttaaagttgagagag cttcaaatggtgatggcgaatcactga >gi568815595f:182720259_183018101|GENSCAN_predicted_peptide_5|177_aa MWLCHLPNALVKSEAYSPNIQVKSETYSPNALVKSEAYRSNALVKSEAYSPNVQVKSETY SPNVLVKSEAYSPNVLVKSEAYSPNVLVKSEAYSPNALVKSEAYRSNVLVKSEAYSPNVL VKSEAYRSNALVKSEAYSPNALVKSEAYRSNALVKSEAYSPNALVKSEAYSPNVLVK >gi568815595f:182720259_183018101|GENSCAN_predicted_CDS_5|534_bp atgtggttgtgccatctgccaaatgcactggtcaaatcagaggcctacagcccaaatata caggtcaaatctgagacctacagcccaaatgcactggtcaaatcagaggcctacaggtca aatgcactggtcaaatcagaggcttacagcccaaatgtacaggtcaaatctgagacctac agcccaaatgtactggtcaaatcagaggcctacagcccaaatgtactggtcaaatcagag gcctacagcccaaatgtactggtcaaatcagaggcctacagcccaaatgcactggtcaaa tcagaggcctacaggtcaaatgtactggtcaaatcagaggcctacagcccaaatgtactg gtcaaatcagaggcctacaggtcaaatgcactggtcaaatcagaggcctacagcccaaat gcactggtcaaatcagaggcctacaggtcaaatgcactggtcaaatcagaggcctacagc ccaaatgcactggtcaaatcagaggcctacagcccaaatgtactcgtcaaataa