GENSCAN 1.0 Date run: 7-Nov-116 Time: 16:59:51 Sequence gi568815595r:20061108_20284027 : 222920 bp : 38.51% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 11226 11352 127 2 1 108 95 21 0.703 4.13 1.02 Intr + 34156 34301 146 2 2 18 100 79 0.247 1.18 1.03 Intr + 38755 38847 93 0 0 76 98 65 0.976 5.54 1.04 Intr + 40180 40361 182 0 2 57 84 135 0.945 7.74 1.05 Intr + 50489 50680 192 2 0 107 78 77 0.954 6.29 1.06 Intr + 53775 53881 107 0 2 71 111 17 0.931 1.24 1.07 Intr + 58491 58616 126 1 0 70 115 58 0.927 6.43 1.08 Intr + 61561 61706 146 2 2 48 59 189 0.472 11.08 1.09 Intr + 64798 65006 209 0 2 99 105 193 0.995 19.05 1.10 Intr + 66316 66442 127 1 1 83 116 69 0.978 8.96 1.11 Intr + 75835 75945 111 0 0 71 77 99 0.885 6.76 1.12 Intr + 79114 79257 144 0 0 78 99 118 0.998 11.46 1.13 Intr + 85209 85323 115 2 1 122 45 75 0.974 5.70 1.14 Term + 91225 91418 194 2 2 79 34 191 0.970 9.40 1.15 PlyA + 92307 92312 6 1.05 2.06 PlyA - 92401 92396 6 1.05 2.05 Term - 113943 113134 810 0 0 2 39 399 0.073 18.59 2.04 Intr - 122697 122501 197 1 2 118 68 159 0.974 15.11 2.03 Intr - 125131 124952 180 2 0 15 69 227 0.056 12.42 2.02 Intr - 130141 130026 116 0 2 72 58 141 0.321 8.67 2.01 Init - 133926 133817 110 0 2 62 70 57 0.330 1.04 2.00 Prom - 165541 165502 40 -3.15 3.00 Prom + 176783 176822 40 -3.45 3.01 Init + 178684 178776 93 0 0 61 81 54 0.393 2.43 3.02 Intr + 179176 179356 181 0 1 92 68 50 0.418 1.92 3.03 Intr + 180603 180743 141 1 0 -44 103 151 0.133 2.80 3.04 Intr + 189311 189456 146 0 2 69 3 140 0.434 2.68 3.05 Term + 189612 189743 132 2 0 101 48 89 0.486 3.31 3.06 PlyA + 191033 191038 6 1.05 4.03 PlyA - 191451 191446 6 1.05 4.02 Term - 193906 193649 258 1 0 10 42 220 0.334 4.17 4.01 Init - 199907 199899 9 2 0 89 74 1 0.293 -0.55 4.00 Prom - 200310 200271 40 -3.65 5.09 PlyA - 201074 201069 6 1.05 5.08 Term - 202071 201182 890 1 2 43 36 278 0.358 9.73 5.07 Intr - 207535 207308 228 2 0 44 87 169 0.724 9.32 5.06 Intr - 208860 208698 163 0 1 29 67 126 0.439 3.23 5.05 Intr - 209461 209242 220 0 1 70 49 134 0.229 5.08 5.04 Intr - 211147 210979 169 2 1 26 103 106 0.111 3.98 5.03 Intr - 215347 215130 218 0 2 72 73 148 0.421 8.92 5.02 Intr - 215567 215531 37 0 1 101 98 1 0.436 -1.20 5.01 Intr - 217801 217613 189 2 0 60 99 75 0.459 4.44 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 122920 122779 142 1 1 79 52 95 0.976 5.34 S.002 Init + 124683 124742 60 2 0 63 -15 119 0.923 0.40 S.003 Term + 124868 125305 438 1 0 122 37 259 0.924 18.49 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:20061108_20284027|GENSCAN_predicted_peptide_1|672_aa AEESCKCNGWKNPNPSPTPPRADLQQIIVSLTESCRSCSHALAAHVSHLENVSEEEMNRL LGIVLDVEYLFTCVHKEEDADTKQVYFYLFKLLRKSILQRGKPVVEGSLEKKPPFEKPSI EQGVNNFVQYKFSHLPAKERQTIVELAKMFLNRINYWHLEAPSQRRLRSPNDDISGYKEN YTRWLCYCNVPQFCDSLPRYETTQVFGRTLLRSVFTVMRRQLLEQARQEKDKLPLEKRTL ILTHFPKFLSMLEEEVYSQNSPIWDQDFLSASSRTSQLGIQTVINPPPVAGTISYNSTSS SLEQPNAGSSSPACKASSGLEANPGEKRKMTDSHVLEEAKKPRVMGDIPMELINEVMSTI TDPAAMLGPEVSRTNFLSAHSARDEAARLEERRGVIEFHVVGNSLNQKPNKKILMWLVGL QNVFSHQLPRMPKEYITRLVFDPKHKTLALIKDGRVIGGICFRMFPSQGFTEIVFCAVTS NEQVKGYGTHLMNHLKEYHIKHDILNFLTYADEYAIGYFKKQGFSKEIKIPKTKYVGYIK DYEGATLMGCELNPRIPYTEFSVIIKKQKEIIKKLIERKQAQIRKVYPGLSCFKDGVRQI PIESIPGINLKTMSERLKNRYYVSKKLFMADLQRVFTNCKEYNPPESEYYKCANILEKFF FSKIKEAGLIDK >gi568815595r:20061108_20284027|GENSCAN_predicted_CDS_1|2019_bp gccgaggagtcttgtaaatgtaatggctggaaaaaccctaacccctcacccactcccccc agagccgacctgcagcaaataattgtcagtctaacagaatcctgtcggagttgtagccat gccctagctgctcatgtttcccacctggagaatgtgtcagaggaagaaatgaacagactc ctgggaatagtattggatgtggaatatctctttacctgtgtccacaaggaagaagatgca gataccaaacaagtttatttctatctatttaagctcttgagaaagtctattttacaaaga ggaaaacctgtggttgaaggctctttggaaaagaaacccccatttgaaaaacctagcatt gaacagggtgtgaataactttgtgcagtacaaatttagtcacctgccagcaaaagaaagg caaacaatagttgagttggcaaaaatgttcctaaaccgcatcaactattggcatctggag gcaccatctcaacgaagactgcgatctcccaatgatgatatttctggatacaaagagaac tacacaaggtggctgtgttactgcaacgtgccacagttctgcgacagtctacctcggtac gaaaccacacaggtgtttgggagaacattgcttcgctcggtcttcactgttatgaggcga caactcctggaacaagcaagacaggaaaaagataaactgcctcttgaaaaacgaactcta atcctcactcatttcccaaaatttctgtccatgctagaagaagaagtatatagtcaaaac tctcccatctgggatcaggattttctctcagcctcttccagaaccagccagctaggcatc caaacagttatcaatccacctcctgtggctgggacaatttcatacaattcaacctcatct tcccttgagcagccaaacgcagggagcagcagtcctgcctgcaaagcctcttctggactt gaggcaaacccaggagaaaagaggaaaatgactgattctcatgttctggaggaggccaag aaaccccgagttatgggggatattccgatggaattaatcaacgaggttatgtctaccatc acggaccctgcagcaatgcttggaccagaggtcagcaggaccaattttctgtcagcacac tcggccagggatgaggcggcaaggttggaagagcgcaggggtgtaattgaatttcacgtg gttggcaattccctcaaccagaaaccaaacaagaagatcctgatgtggctggttggccta cagaacgttttctcccaccagctgccccgaatgccaaaagaatacatcacacggctcgtc tttgacccgaaacacaaaacccttgctttaattaaagatggccgtgttattggtggtatc tgtttccgtatgttcccatctcaaggattcacagagattgtcttctgtgctgtaacctca aatgagcaagtcaagggctatggaacacacctgatgaatcatttgaaagaatatcacata aagcatgacatcctgaacttcctcacatatgcagatgaatatgcaattggatactttaag aaacagggtttctccaaagaaattaaaatacctaaaaccaaatatgttggctatatcaag gattatgaaggagccactttaatgggatgtgagctaaatccacggatcccgtacacagaa ttttctgtcatcattaaaaagcagaaggagataattaaaaaactgattgaaagaaaacag gcacaaattcgaaaagtttaccctggactttcatgttttaaagatggagttcgacagatt cctatagaaagcattcctggaattaatctgaaaaccatgagtgaacgcctcaagaatagg tactacgtgtctaagaaattattcatggcagacttacagcgagtctttaccaattgcaaa gagtacaacccccctgagagtgaatactacaaatgtgccaatatcctggagaaattcttc ttcagtaaaattaaggaagctggattaattgacaagtga >gi568815595r:20061108_20284027|GENSCAN_predicted_peptide_2|470_aa MLDIINHQRNANEIHSEIPFIPTRMARIKKKDNDKCCSEGQTSKAKVSAGGTPSGGSERN PLLASSSFWQLSAFLANQWRSEAENFECGGGSSRRRRTVVRSWRAQCAGLAVAGELRRAL KCEEEEDSCCRSSGQANTSTLLKNYQDNNKMLVLALENEKSKVKEAQDIILQLRKECYYL TCQLYALKGKLTSQQTVEPAQIPTIPQDTLGVDFDSGEAKSTDNVLPRTVSVRSSLKKHC NSICQFDSLDDFETSHLAGKSFEFERVGFLDPLVNMHIPENVQHNACQWSKDQVNLSPKL IQPGTFTKTKEDILESKSEQTKSKQRDTQERKREEKRKANRRKSKRMSKYKENKSENKKT VPQKKMHKSVSSNDAYNFNLEEGVHLTPFRQKVSNDSNREENNESEVSLCESSGSGDDSD DLYLPTCKYIQNPTSNSDRPVTRPLAKRALKYTDEKETEGSKPTKTPTSK >gi568815595r:20061108_20284027|GENSCAN_predicted_CDS_2|1413_bp atgcttgatatcattaaccatcagagaaatgcaaatgaaatccacagtgagataccattc atacccactaggatggctagaatcaagaagaaagataatgacaagtgttgttctgaaggc cagacgtccaaagccaaggtgtcagcaggtggtactccctctggaggctcagaacgaaat ccactccttgcctcttccagtttctggcagctgtcggcattcctagccaaccaatggagg agcgaggcggaaaatttcgaatgtggcggcggtagttccaggcgacggcggacggtggta cggtcctggagggcccagtgcgcggggctagccgtggctggagagcttcgaagagccttg aaatgtgaggaggaggaagatagctgttgcagaagtagtggccaagccaacacttctaca ctgctgaaaaattaccaagacaacaacaaaatgttagttttagctttggaaaatgaaaaa tccaaagtgaaagaagcccaagatatcatcctacagctgagaaaagaatgttactatctc acatgtcagctatatgcattgaaaggaaaacttacatcacaacaaacagtagaacctgct cagatacctactattcctcaagacacactgggagttgattttgattcaggtgaagctaag tctactgataatgtcttacctagaactgtatctgttcgtagcagtttaaagaaacattgt aacagtatatgtcagtttgatagcttggatgattttgaaaccagtcatttggcagggaag tcttttgaattcgaaagagttggatttttagacccactagtaaacatgcacatacctgaa aatgtacaacacaatgcttgtcaatggagcaaggaccaagttaacttatcaccaaagctg attcagccaggaacgtttactaaaacaaaagaagacattttagaatctaaatctgaacaa actaaaagtaagcaaagagatacacaagaaagaaaaagagaagagaaaagaaaagctaac aggagaaaatcaaaacgtatgtcaaaatataaagagaataaaagcgaaaataaaaaaact gttccccaaaaaaaaatgcacaaatctgtcagttccaatgatgcttacaattttaatttg gaagagggtgttcatcttactcctttccgacaaaaagtgagcaatgactctaatagagaa gaaaacaacgagtctgaagtgagcctctgtgaatcaagtggttcaggagatgattccgat gacctctatttgcccacttgcaagtacattcagaatcccacgagcaattcagatagacca gtcaccaggcctctagctaaaagagcactgaaatacacagatgaaaaagagacggagggt tctaagccaacaaaaactcctaccagtaagtga >gi568815595r:20061108_20284027|GENSCAN_predicted_peptide_3|230_aa MGIHRNHEGRPRCQGGECGQTVPMMVLQLIEAWVGGELLQLQFLLGDETGSQGQFDDLEP VCVCHCWVPQPAPLRSGNSETFSATPLDRTPEVPVSPDEKSWQNNSGITRNLNVMTPTKD HTSSVAMVNNKNGNSEMTAFSPHGWLHRLELGASGVSGTGHKLLVDRLFTAVEDSGPIPT APLHSAPALTPSGCHQALPLVPFGVVARAVPGPLCAVTGAGEAGMQEAMS >gi568815595r:20061108_20284027|GENSCAN_predicted_CDS_3|693_bp atgggaatccacaggaatcatgaaggacgccccaggtgccagggaggagaatgtgggcaa acagtccccatgatggtgttacagctgatagaagcctgggtgggaggagagctgctacag ctgcagtttctcctaggtgacgagacaggcagccagggccagtttgatgacctggaacca gtctgtgtgtgtcattgctgggtgcctcagcctgctcctctgagatcaggaaacagtgaa actttctctgctacacccctagacagaactccagaagtgccagtatctccagatgagaag agctggcagaataattctggtatcacaagaaatctgaatgtcatgacaccaacaaaggat cacactagctctgtagcaatggtgaataataaaaatggaaactcagaaatgacagcattc agcccccatggctggttacaccggttggagttgggtgccagtggtgtttctggtacaggg cacaagctgctggtggatcgactattcacagctgtagaggacagcggccccattcccaca gctccactacacagtgctccagctttaacaccatctggatgccaccaagccttaccactt gtgccctttggagtggtggccagagctgtacctgggccgctttgtgctgtgactggagca ggagaggctgggatgcaggaagcaatgtcctga >gi568815595r:20061108_20284027|GENSCAN_predicted_peptide_4|88_aa MAQWVQPKEGEPKQGGASPHLGSLTASGNSLSQPKEAIRDCTVHSGPDTAIFPWSSKPID QEIPSDAYATRAMGVQHKTEQPFVQTPS >gi568815595r:20061108_20284027|GENSCAN_predicted_CDS_4|267_bp atggcacagtgggtgcagcccaaggagggcgagccaaagcagggtggggcatcgcctcac ctgggaagcctcacggcgtcggggaattccctctcccagccaaaggaagccattagggac tgtactgtgcactctggcccagatactgcaattttcccatggtcttcgaaacccatagac caggagattccctctgatgcctatgccaccagggccatgggtgtccagcacaaaactgag cagccatttgtgcagacacccagctag >gi568815595r:20061108_20284027|GENSCAN_predicted_peptide_5|704_aa XPQNQACLENSPLRKACFLAQLNCLDRLKPKANLEDPSCQSNLKEVSVLACLAVHNFSNW ADADSFLFKQAAVVFSVTKVCPELVPSSGFLISLTSRMKPRTLMVSVTVLKDGVSGVCSF RCSDVSRVSSFRWVRGLADFRSEATDLRSQQMFATPVPMICVEVPVGIHTGDSLLTGREF VPFFDCHSIIYLTNIPGISKLSGCRVNQQLVRTLELNGFPLCRPSAQPRSTGKAEADSKQ TNAPNSEESGLLEHPFPESLTPVSLVQQLCYSYLTGHQVRGLADFRSEAADLHSELLKLV RPELFVPPGGFVVSLTSGVKPQTFAVSVTAVKEIQATIREYYKHLSTNKLENLEEMDKFL DKYTFPRLSQEEVESLNRPITSSEIEAVIAHQPKKAQDQMDSQPNSTTDAEKAFDKIQQP FMLKTLNKLGIDGTYLKIIRAIYDKPTANIILNGQKLEAFPLKTGTRQGCPLSPLLFNIV LEVLARAIRQEKEIKGIQSGKEEVKLSLFADDMIVYLENPIVSAQNLVKLISNFSKVSGY KINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIHLTRDVKHLLKENYKPLLNK IKEDTNKWKNIPCSWVGRINIMKMAILPKVIYRFNAIPIKLPMTFFTELEKATLKFIWNQ KRAHIAKSILSQKNKAGGITLPDFKVYYKATVTKTAWYWYQTEI >gi568815595r:20061108_20284027|GENSCAN_predicted_CDS_5|2115_bp ngtccacagaaccaagcctgtttggaaaacagtcctttaagaaaagcttgtttccttgct caactcaactgtcttgaccgccttaaaccaaaagcaaatttggaagacccctcctgccaa agcaatctaaaagaagtgagtgtactagcttgcttagcagtccacaacttcagcaactgg gctgatgcagattccttcctgtttaaacaggctgcagtcgttttcagtgtcacaaaagtg tgtccagaattggttccttccagtgggttcttgatctcgctgacttcacgaatgaagcca cggaccctcatggtgagtgttacagttcttaaagatggtgtatctggagtttgttccttc agatgttcagatgtgtccagagtttcttccttccggtgggttcgtggtctcgctgacttc aggagtgaagccacagaccttcgcagtcaacagatgtttgcaactccagtccccatgatc tgcgttgaggtcccagtggggatccatactggggacagcttgctgaccggtagggaattt gtccctttcttcgactgtcattctatcatttacttgactaacataccaggtatctccaaa ctctcaggctgcagggtcaaccaacaacttgtcaggaccctggagctgaatggctttcct ctctgccgaccctcggctcagcccagaagtacaggaaaagcggaagctgattccaagcaa accaatgctcccaattccgaagagtcggggttgttagagcaccctttcccagaaagcctg acacccgtgtctttagtccagcagctgtgctactcatatttaactggccatcaggttcgt ggtcttgctgacttcaggagtgaagccgcagaccttcacagtgagctcttaaagctggtg cgtccagagttgtttgttcctcccggtgggttcgtggtctcactgacttcaggagtgaag ccgcagaccttcgcagtgagtgttacagctgttaaagaaatacaagctaccatcagagaa tactataaacacctctccacaaacaaactagaaaatctagaggaaatggataaattcctg gacaaatacaccttcccaagattaagccaggaagaagttgaatccctgaatagaccaata acaagttctgaaattgaagcagtaatagcccaccaaccaaaaaaagcccaggaccagatg gattcacagccaaattctaccacagatgcagaaaaggcctttgacaaaattcaacaaccc ttcatgctaaaaactctcaataaattaggtattgatgggacgtatctcaaaataataaga gctatttatgacaaacccacagccaatatcatactgaatgggcaaaaactggaagcattc cctttgaaaactggaacaagacagggatgccctctctcaccactcctattcaacatagtg ttggaagttctggccagggcaattaggcaggagaaggaaataaagggtattcaatcagga aaagaggaagtcaaattgtccctgtttgcagatgacatgattgtatatctagaaaacccc attgtctcagcccaaaatctcgttaagctgataagcaacttcagcaaagtctcaggatac aaaatcaatgtacaaaaatcacaagcattcttatacaccaataacagacaaacagagagc caaatcatgagtgaactcccattcacaattgcttcaaagagaataaaatacttaggaatc caccttacaagggacgtgaagcacctcctcaaggagaactacaaaccactgctcaataaa ataaaagaggatacaaacaaatggaagaacattccatgctcatgggtaggaagaatcaat atcatgaaaatggccatactgcccaaggtaatttatagattcaatgccatccccatcaag ctaccaatgactttcttcacagaactggaaaaagctactttaaagttcatatggaaccaa aaaagagcccacatcgccaagtcaatcctaagccaaaagaacaaagctggaggcatcaca ctacctgacttcaaagtatactacaaggctacagtaaccaaaacagcatggtactggtac caaacagagatatag