GENSCAN 1.0 Date run: 6-Nov-116 Time: 09:21:53 Sequence gi568815595r:20070707_20284027 : 213321 bp : 38.48% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1627 1753 127 0 1 108 95 21 0.650 4.13 1.02 Intr + 24557 24702 146 0 2 18 100 79 0.247 1.18 1.03 Intr + 29156 29248 93 1 0 76 98 65 0.976 5.54 1.04 Intr + 30581 30762 182 1 2 57 84 135 0.945 7.74 1.05 Intr + 40890 41081 192 0 0 107 78 77 0.954 6.29 1.06 Intr + 44176 44282 107 1 2 71 111 17 0.931 1.24 1.07 Intr + 48892 49017 126 2 0 70 115 58 0.927 6.43 1.08 Intr + 51962 52107 146 0 2 48 59 189 0.472 11.08 1.09 Intr + 55199 55407 209 1 2 99 105 193 0.995 19.05 1.10 Intr + 56717 56843 127 2 1 83 116 69 0.978 8.96 1.11 Intr + 66236 66346 111 1 0 71 77 99 0.885 6.76 1.12 Intr + 69515 69658 144 1 0 78 99 118 0.998 11.46 1.13 Intr + 75610 75724 115 0 1 122 45 75 0.974 5.70 1.14 Term + 81626 81819 194 0 2 79 34 191 0.970 9.40 1.15 PlyA + 82708 82713 6 1.05 2.06 PlyA - 82802 82797 6 1.05 2.05 Term - 104344 103535 810 1 0 2 39 399 0.073 18.59 2.04 Intr - 113098 112902 197 2 2 118 68 159 0.974 15.11 2.03 Intr - 115532 115353 180 0 0 15 69 227 0.056 12.42 2.02 Intr - 120542 120427 116 1 2 72 58 141 0.321 8.67 2.01 Init - 124327 124218 110 1 2 62 70 57 0.330 1.04 2.00 Prom - 155942 155903 40 -3.15 3.00 Prom + 167184 167223 40 -3.45 3.01 Init + 169085 169177 93 1 0 61 81 54 0.393 2.43 3.02 Intr + 169577 169757 181 1 1 92 68 50 0.418 1.92 3.03 Intr + 171004 171144 141 2 0 -44 103 151 0.133 2.80 3.04 Intr + 179712 179857 146 1 2 69 3 140 0.434 2.68 3.05 Term + 180013 180144 132 0 0 101 48 89 0.486 3.31 3.06 PlyA + 181434 181439 6 1.05 4.03 PlyA - 181852 181847 6 1.05 4.02 Term - 184307 184050 258 2 0 10 42 220 0.334 4.17 4.01 Init - 190308 190300 9 0 0 89 74 1 0.293 -0.55 4.00 Prom - 190711 190672 40 -3.65 5.09 PlyA - 191475 191470 6 1.05 5.08 Term - 192472 191583 890 2 2 43 36 278 0.358 9.73 5.07 Intr - 197936 197709 228 0 0 44 87 169 0.724 9.32 5.06 Intr - 199261 199099 163 1 1 29 67 126 0.439 3.23 5.05 Intr - 199862 199643 220 1 1 70 49 134 0.229 5.08 5.04 Intr - 201548 201380 169 0 1 26 103 106 0.111 3.98 5.03 Intr - 205748 205531 218 1 2 72 73 148 0.421 8.92 5.02 Intr - 205968 205932 37 1 1 101 98 1 0.436 -1.20 5.01 Intr - 208202 208014 189 0 0 60 99 75 0.459 4.44 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 113321 113180 142 2 1 79 52 95 0.976 5.34 S.002 Init + 115084 115143 60 0 0 63 -15 119 0.923 0.40 S.003 Term + 115269 115706 438 2 0 122 37 259 0.924 18.49 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:20070707_20284027|GENSCAN_predicted_peptide_1|672_aa AEESCKCNGWKNPNPSPTPPRADLQQIIVSLTESCRSCSHALAAHVSHLENVSEEEMNRL LGIVLDVEYLFTCVHKEEDADTKQVYFYLFKLLRKSILQRGKPVVEGSLEKKPPFEKPSI EQGVNNFVQYKFSHLPAKERQTIVELAKMFLNRINYWHLEAPSQRRLRSPNDDISGYKEN YTRWLCYCNVPQFCDSLPRYETTQVFGRTLLRSVFTVMRRQLLEQARQEKDKLPLEKRTL ILTHFPKFLSMLEEEVYSQNSPIWDQDFLSASSRTSQLGIQTVINPPPVAGTISYNSTSS SLEQPNAGSSSPACKASSGLEANPGEKRKMTDSHVLEEAKKPRVMGDIPMELINEVMSTI TDPAAMLGPEVSRTNFLSAHSARDEAARLEERRGVIEFHVVGNSLNQKPNKKILMWLVGL QNVFSHQLPRMPKEYITRLVFDPKHKTLALIKDGRVIGGICFRMFPSQGFTEIVFCAVTS NEQVKGYGTHLMNHLKEYHIKHDILNFLTYADEYAIGYFKKQGFSKEIKIPKTKYVGYIK DYEGATLMGCELNPRIPYTEFSVIIKKQKEIIKKLIERKQAQIRKVYPGLSCFKDGVRQI PIESIPGINLKTMSERLKNRYYVSKKLFMADLQRVFTNCKEYNPPESEYYKCANILEKFF FSKIKEAGLIDK >gi568815595r:20070707_20284027|GENSCAN_predicted_CDS_1|2019_bp gccgaggagtcttgtaaatgtaatggctggaaaaaccctaacccctcacccactcccccc agagccgacctgcagcaaataattgtcagtctaacagaatcctgtcggagttgtagccat gccctagctgctcatgtttcccacctggagaatgtgtcagaggaagaaatgaacagactc ctgggaatagtattggatgtggaatatctctttacctgtgtccacaaggaagaagatgca gataccaaacaagtttatttctatctatttaagctcttgagaaagtctattttacaaaga ggaaaacctgtggttgaaggctctttggaaaagaaacccccatttgaaaaacctagcatt gaacagggtgtgaataactttgtgcagtacaaatttagtcacctgccagcaaaagaaagg caaacaatagttgagttggcaaaaatgttcctaaaccgcatcaactattggcatctggag gcaccatctcaacgaagactgcgatctcccaatgatgatatttctggatacaaagagaac tacacaaggtggctgtgttactgcaacgtgccacagttctgcgacagtctacctcggtac gaaaccacacaggtgtttgggagaacattgcttcgctcggtcttcactgttatgaggcga caactcctggaacaagcaagacaggaaaaagataaactgcctcttgaaaaacgaactcta atcctcactcatttcccaaaatttctgtccatgctagaagaagaagtatatagtcaaaac tctcccatctgggatcaggattttctctcagcctcttccagaaccagccagctaggcatc caaacagttatcaatccacctcctgtggctgggacaatttcatacaattcaacctcatct tcccttgagcagccaaacgcagggagcagcagtcctgcctgcaaagcctcttctggactt gaggcaaacccaggagaaaagaggaaaatgactgattctcatgttctggaggaggccaag aaaccccgagttatgggggatattccgatggaattaatcaacgaggttatgtctaccatc acggaccctgcagcaatgcttggaccagaggtcagcaggaccaattttctgtcagcacac tcggccagggatgaggcggcaaggttggaagagcgcaggggtgtaattgaatttcacgtg gttggcaattccctcaaccagaaaccaaacaagaagatcctgatgtggctggttggccta cagaacgttttctcccaccagctgccccgaatgccaaaagaatacatcacacggctcgtc tttgacccgaaacacaaaacccttgctttaattaaagatggccgtgttattggtggtatc tgtttccgtatgttcccatctcaaggattcacagagattgtcttctgtgctgtaacctca aatgagcaagtcaagggctatggaacacacctgatgaatcatttgaaagaatatcacata aagcatgacatcctgaacttcctcacatatgcagatgaatatgcaattggatactttaag aaacagggtttctccaaagaaattaaaatacctaaaaccaaatatgttggctatatcaag gattatgaaggagccactttaatgggatgtgagctaaatccacggatcccgtacacagaa ttttctgtcatcattaaaaagcagaaggagataattaaaaaactgattgaaagaaaacag gcacaaattcgaaaagtttaccctggactttcatgttttaaagatggagttcgacagatt cctatagaaagcattcctggaattaatctgaaaaccatgagtgaacgcctcaagaatagg tactacgtgtctaagaaattattcatggcagacttacagcgagtctttaccaattgcaaa gagtacaacccccctgagagtgaatactacaaatgtgccaatatcctggagaaattcttc ttcagtaaaattaaggaagctggattaattgacaagtga >gi568815595r:20070707_20284027|GENSCAN_predicted_peptide_2|470_aa MLDIINHQRNANEIHSEIPFIPTRMARIKKKDNDKCCSEGQTSKAKVSAGGTPSGGSERN PLLASSSFWQLSAFLANQWRSEAENFECGGGSSRRRRTVVRSWRAQCAGLAVAGELRRAL KCEEEEDSCCRSSGQANTSTLLKNYQDNNKMLVLALENEKSKVKEAQDIILQLRKECYYL TCQLYALKGKLTSQQTVEPAQIPTIPQDTLGVDFDSGEAKSTDNVLPRTVSVRSSLKKHC NSICQFDSLDDFETSHLAGKSFEFERVGFLDPLVNMHIPENVQHNACQWSKDQVNLSPKL IQPGTFTKTKEDILESKSEQTKSKQRDTQERKREEKRKANRRKSKRMSKYKENKSENKKT VPQKKMHKSVSSNDAYNFNLEEGVHLTPFRQKVSNDSNREENNESEVSLCESSGSGDDSD DLYLPTCKYIQNPTSNSDRPVTRPLAKRALKYTDEKETEGSKPTKTPTSK >gi568815595r:20070707_20284027|GENSCAN_predicted_CDS_2|1413_bp atgcttgatatcattaaccatcagagaaatgcaaatgaaatccacagtgagataccattc atacccactaggatggctagaatcaagaagaaagataatgacaagtgttgttctgaaggc cagacgtccaaagccaaggtgtcagcaggtggtactccctctggaggctcagaacgaaat ccactccttgcctcttccagtttctggcagctgtcggcattcctagccaaccaatggagg agcgaggcggaaaatttcgaatgtggcggcggtagttccaggcgacggcggacggtggta cggtcctggagggcccagtgcgcggggctagccgtggctggagagcttcgaagagccttg aaatgtgaggaggaggaagatagctgttgcagaagtagtggccaagccaacacttctaca ctgctgaaaaattaccaagacaacaacaaaatgttagttttagctttggaaaatgaaaaa tccaaagtgaaagaagcccaagatatcatcctacagctgagaaaagaatgttactatctc acatgtcagctatatgcattgaaaggaaaacttacatcacaacaaacagtagaacctgct cagatacctactattcctcaagacacactgggagttgattttgattcaggtgaagctaag tctactgataatgtcttacctagaactgtatctgttcgtagcagtttaaagaaacattgt aacagtatatgtcagtttgatagcttggatgattttgaaaccagtcatttggcagggaag tcttttgaattcgaaagagttggatttttagacccactagtaaacatgcacatacctgaa aatgtacaacacaatgcttgtcaatggagcaaggaccaagttaacttatcaccaaagctg attcagccaggaacgtttactaaaacaaaagaagacattttagaatctaaatctgaacaa actaaaagtaagcaaagagatacacaagaaagaaaaagagaagagaaaagaaaagctaac aggagaaaatcaaaacgtatgtcaaaatataaagagaataaaagcgaaaataaaaaaact gttccccaaaaaaaaatgcacaaatctgtcagttccaatgatgcttacaattttaatttg gaagagggtgttcatcttactcctttccgacaaaaagtgagcaatgactctaatagagaa gaaaacaacgagtctgaagtgagcctctgtgaatcaagtggttcaggagatgattccgat gacctctatttgcccacttgcaagtacattcagaatcccacgagcaattcagatagacca gtcaccaggcctctagctaaaagagcactgaaatacacagatgaaaaagagacggagggt tctaagccaacaaaaactcctaccagtaagtga >gi568815595r:20070707_20284027|GENSCAN_predicted_peptide_3|230_aa MGIHRNHEGRPRCQGGECGQTVPMMVLQLIEAWVGGELLQLQFLLGDETGSQGQFDDLEP VCVCHCWVPQPAPLRSGNSETFSATPLDRTPEVPVSPDEKSWQNNSGITRNLNVMTPTKD HTSSVAMVNNKNGNSEMTAFSPHGWLHRLELGASGVSGTGHKLLVDRLFTAVEDSGPIPT APLHSAPALTPSGCHQALPLVPFGVVARAVPGPLCAVTGAGEAGMQEAMS >gi568815595r:20070707_20284027|GENSCAN_predicted_CDS_3|693_bp atgggaatccacaggaatcatgaaggacgccccaggtgccagggaggagaatgtgggcaa acagtccccatgatggtgttacagctgatagaagcctgggtgggaggagagctgctacag ctgcagtttctcctaggtgacgagacaggcagccagggccagtttgatgacctggaacca gtctgtgtgtgtcattgctgggtgcctcagcctgctcctctgagatcaggaaacagtgaa actttctctgctacacccctagacagaactccagaagtgccagtatctccagatgagaag agctggcagaataattctggtatcacaagaaatctgaatgtcatgacaccaacaaaggat cacactagctctgtagcaatggtgaataataaaaatggaaactcagaaatgacagcattc agcccccatggctggttacaccggttggagttgggtgccagtggtgtttctggtacaggg cacaagctgctggtggatcgactattcacagctgtagaggacagcggccccattcccaca gctccactacacagtgctccagctttaacaccatctggatgccaccaagccttaccactt gtgccctttggagtggtggccagagctgtacctgggccgctttgtgctgtgactggagca ggagaggctgggatgcaggaagcaatgtcctga >gi568815595r:20070707_20284027|GENSCAN_predicted_peptide_4|88_aa MAQWVQPKEGEPKQGGASPHLGSLTASGNSLSQPKEAIRDCTVHSGPDTAIFPWSSKPID QEIPSDAYATRAMGVQHKTEQPFVQTPS >gi568815595r:20070707_20284027|GENSCAN_predicted_CDS_4|267_bp atggcacagtgggtgcagcccaaggagggcgagccaaagcagggtggggcatcgcctcac ctgggaagcctcacggcgtcggggaattccctctcccagccaaaggaagccattagggac tgtactgtgcactctggcccagatactgcaattttcccatggtcttcgaaacccatagac caggagattccctctgatgcctatgccaccagggccatgggtgtccagcacaaaactgag cagccatttgtgcagacacccagctag >gi568815595r:20070707_20284027|GENSCAN_predicted_peptide_5|704_aa XPQNQACLENSPLRKACFLAQLNCLDRLKPKANLEDPSCQSNLKEVSVLACLAVHNFSNW ADADSFLFKQAAVVFSVTKVCPELVPSSGFLISLTSRMKPRTLMVSVTVLKDGVSGVCSF RCSDVSRVSSFRWVRGLADFRSEATDLRSQQMFATPVPMICVEVPVGIHTGDSLLTGREF VPFFDCHSIIYLTNIPGISKLSGCRVNQQLVRTLELNGFPLCRPSAQPRSTGKAEADSKQ TNAPNSEESGLLEHPFPESLTPVSLVQQLCYSYLTGHQVRGLADFRSEAADLHSELLKLV RPELFVPPGGFVVSLTSGVKPQTFAVSVTAVKEIQATIREYYKHLSTNKLENLEEMDKFL DKYTFPRLSQEEVESLNRPITSSEIEAVIAHQPKKAQDQMDSQPNSTTDAEKAFDKIQQP FMLKTLNKLGIDGTYLKIIRAIYDKPTANIILNGQKLEAFPLKTGTRQGCPLSPLLFNIV LEVLARAIRQEKEIKGIQSGKEEVKLSLFADDMIVYLENPIVSAQNLVKLISNFSKVSGY KINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIHLTRDVKHLLKENYKPLLNK IKEDTNKWKNIPCSWVGRINIMKMAILPKVIYRFNAIPIKLPMTFFTELEKATLKFIWNQ KRAHIAKSILSQKNKAGGITLPDFKVYYKATVTKTAWYWYQTEI >gi568815595r:20070707_20284027|GENSCAN_predicted_CDS_5|2115_bp ngtccacagaaccaagcctgtttggaaaacagtcctttaagaaaagcttgtttccttgct caactcaactgtcttgaccgccttaaaccaaaagcaaatttggaagacccctcctgccaa agcaatctaaaagaagtgagtgtactagcttgcttagcagtccacaacttcagcaactgg gctgatgcagattccttcctgtttaaacaggctgcagtcgttttcagtgtcacaaaagtg tgtccagaattggttccttccagtgggttcttgatctcgctgacttcacgaatgaagcca cggaccctcatggtgagtgttacagttcttaaagatggtgtatctggagtttgttccttc agatgttcagatgtgtccagagtttcttccttccggtgggttcgtggtctcgctgacttc aggagtgaagccacagaccttcgcagtcaacagatgtttgcaactccagtccccatgatc tgcgttgaggtcccagtggggatccatactggggacagcttgctgaccggtagggaattt gtccctttcttcgactgtcattctatcatttacttgactaacataccaggtatctccaaa ctctcaggctgcagggtcaaccaacaacttgtcaggaccctggagctgaatggctttcct ctctgccgaccctcggctcagcccagaagtacaggaaaagcggaagctgattccaagcaa accaatgctcccaattccgaagagtcggggttgttagagcaccctttcccagaaagcctg acacccgtgtctttagtccagcagctgtgctactcatatttaactggccatcaggttcgt ggtcttgctgacttcaggagtgaagccgcagaccttcacagtgagctcttaaagctggtg cgtccagagttgtttgttcctcccggtgggttcgtggtctcactgacttcaggagtgaag ccgcagaccttcgcagtgagtgttacagctgttaaagaaatacaagctaccatcagagaa tactataaacacctctccacaaacaaactagaaaatctagaggaaatggataaattcctg gacaaatacaccttcccaagattaagccaggaagaagttgaatccctgaatagaccaata acaagttctgaaattgaagcagtaatagcccaccaaccaaaaaaagcccaggaccagatg gattcacagccaaattctaccacagatgcagaaaaggcctttgacaaaattcaacaaccc ttcatgctaaaaactctcaataaattaggtattgatgggacgtatctcaaaataataaga gctatttatgacaaacccacagccaatatcatactgaatgggcaaaaactggaagcattc cctttgaaaactggaacaagacagggatgccctctctcaccactcctattcaacatagtg ttggaagttctggccagggcaattaggcaggagaaggaaataaagggtattcaatcagga aaagaggaagtcaaattgtccctgtttgcagatgacatgattgtatatctagaaaacccc attgtctcagcccaaaatctcgttaagctgataagcaacttcagcaaagtctcaggatac aaaatcaatgtacaaaaatcacaagcattcttatacaccaataacagacaaacagagagc caaatcatgagtgaactcccattcacaattgcttcaaagagaataaaatacttaggaatc caccttacaagggacgtgaagcacctcctcaaggagaactacaaaccactgctcaataaa ataaaagaggatacaaacaaatggaagaacattccatgctcatgggtaggaagaatcaat atcatgaaaatggccatactgcccaaggtaatttatagattcaatgccatccccatcaag ctaccaatgactttcttcacagaactggaaaaagctactttaaagttcatatggaaccaa aaaagagcccacatcgccaagtcaatcctaagccaaaagaacaaagctggaggcatcaca ctacctgacttcaaagtatactacaaggctacagtaaccaaaacagcatggtactggtac caaacagagatatag